aicoolies logo
Hopsworks logo

Hopsworks

AI Lakehouse with Feature Store for real-time ML

Share
freemiumOpen Source
Visit Website →

Hopsworks is a data-intensive AI platform combining a Python-centric Feature Store with MLOps capabilities for production ML systems. Provides sub-millisecond feature retrieval powered by RonDB, dual offline and online storage for batch and real-time inference, experiment tracking, model registry, and deployment pipelines. Available as managed cloud on AWS, Azure, and GCP, self-hosted on Kubernetes, or serverless platform.

Hopsworks is an enterprise AI Lakehouse platform unifying feature management, model development, and model serving in a single integrated system. The Feature Store solves a critical ML operations problem of managing, versioning, and serving features with consistent semantics across training and inference. Using RonDB it provides sub-millisecond feature retrieval for real-time inference enabling low-latency AI systems in fraud detection, recommendation, and personalization use cases.

The dual storage architecture with offline storage for batch training and online storage for real-time serving eliminates training-serving skew, a common source of model performance degradation in production. Feature reuse at organizational scale creates massive leverage as top features get used across hundreds of different models. Python-centric APIs make it natural for data scientists and the platform integrates with popular ML tools like Spark, Pandas, and Kafka for data workflows.

Beyond feature management Hopsworks provides complete ML lifecycle governance with experiment tracking, model registry with version control, and deployment pipelines. The freemium pricing model with generous free credits lowers the barrier for experimentation. For regulated industries self-hosted and serverless deployment options ensure data sovereignty. The combination of sub-millisecond latency, operational maturity, and flexible deployment positions Hopsworks as foundational infrastructure for production ML.

Pricing

Free tier with $4k credits; pay-as-you-go after

Platforms

AI Lakehouse with Feature Store for batch and real-time ML operations

Categories

Tags

Use Cases

Alternatives

Related Tools

Traceway logo

Traceway

OpenTelemetry-native observability with AI tracing, logs, traces, metrics, and session replay — self-hosted in 90 seconds.

Traceway is an open-source, OpenTelemetry-native observability platform that combines logs, traces, metrics, exceptions, session replay, and AI tracing in a single self-hosted system. MIT licensed with no open-core restrictions, it deploys in 90 seconds via Docker Compose and accepts OTLP/HTTP from any OTel SDK without a Collector or per-language vendor SDK.

open-sourceOpen Source
Marqo logo

Marqo

Embedding-first search and discovery engine for AI-powered product experiences.

Marqo is an open-source tensor search engine that combines embedding generation and vector search in a single API, removing the need to manage separate embedding pipelines and vector databases. Built for product discovery and multi-modal search, it lets teams index text, images, and structured data together, returning ranked results based on semantic similarity rather than keyword overlap.

freemium
Judgeval logo

Judgeval

Open-source post-building layer for agents — tracing, evals, and online monitoring

Judgeval is the open-source post-building layer for AI agents from Judgment Labs, providing OpenTelemetry-based tracing, hosted and custom evaluation scorers, and online behavior monitoring for LLM-powered applications. Instrument any function with a single decorator, score live production traffic against faithfulness and instruction-adherence checks, and feed real-world failures back into reinforcement learning or supervised fine-tuning loops.

open-sourceOpen Source
TraceRoot logo

TraceRoot

Open-source observability and self-healing layer for AI agents

TraceRoot is a YC S25-backed open-source observability platform purpose-built for AI agents and LLM apps. It combines OpenTelemetry-compatible tracing with an agentic debugging runtime that reads your source code, correlates failures with recent commits, and proposes fix PRs automatically. BYOK support spans seven LLM providers; the entire stack runs self-hosted via Docker Compose, with TraceRoot Cloud available for managed deployments.

open-sourceOpen Source
OpenSRE logo

OpenSRE

Open-source toolkit for building AI SRE incident response agents

OpenSRE is an open-source Python toolkit from Tracer Cloud for building AI SRE agents that investigate and respond to production incidents. It ships with connectors to Prometheus, Grafana, Kubernetes and incident platforms, plus a simulation harness that replays past incidents so teams can benchmark agent accuracy before trusting it on live pager rotations.

open-sourceOpen Source
Evolver logo

Evolver

Self-evolution engine for AI agents with auditable updates

Evolver is an open-source self-evolution engine for AI agents that turns run logs into auditable, reviewable updates via its Genome Evolution Protocol. Instead of ad hoc prompt tweaking, teams collect traces and Evolver proposes versioned diffs to prompts, tools and workflows that engineers can approve, reject or roll back like code.

open-sourceOpen Source