# search-engine
13 tools tagged
Showing 13 of 13 tools
SeekDB
AI-native state store with hybrid vector and full-text search
SeekDB is an open-source AI-native state store from the OceanBase ecosystem that combines MySQL-compatible data access with hybrid vector and full-text retrieval. It targets agent and AI application teams that need embedded or server deployment, copy-on-write style sandboxes, and searchable state without gluing together several separate storage layers.
GitNexus
Graph RAG code knowledge graph for repository exploration
GitNexus is a code-knowledge-graph and Graph RAG app for exploring repository structure before humans or AI coding agents make changes. Write-time source checks support graph/RAG and local-server signals, but not hard local/server-architecture, 14-language, MCP, pricing, or licensing claims; evaluate privacy, scale, and integrations directly.
QMD
On-device hybrid search engine for your docs and notes
QMD is an on-device search engine built by Tobi Lütke (Shopify CEO) that indexes markdown notes, meeting transcripts, and documentation locally. It combines BM25 full-text search, vector semantic search, and LLM-powered re-ranking into a single hybrid pipeline. Ships with a built-in MCP server for seamless integration with Claude Code, Cursor, and other AI editors. All processing happens on your machine via node-llama-cpp with GGUF models — zero cloud dependency.
R2R
Production RAG engine with hybrid search and knowledge graphs
R2R is a production-grade RAG engine from SciPhi AI that combines hybrid search with knowledge graph extraction and agentic retrieval capabilities. It provides a complete pipeline from document ingestion through retrieval and generation, supporting vector, keyword, and graph-based search strategies. The managed API and self-hosted options make it accessible for both rapid prototyping and production deployments requiring advanced retrieval beyond simple vector similarity.
ParadeDB
Elasticsearch-quality full-text and hybrid search inside Postgres
ParadeDB brings Elasticsearch-quality full-text search, BM25 ranking, and hybrid vector-keyword search directly into PostgreSQL as native extensions. Backed by a 12 million dollar Series A with over 500,000 Docker deployments, it eliminates the overhead of running separate search infrastructure. Teams get powerful search within their existing Postgres stack without managing additional clusters.
turbopuffer
Serverless vector and full-text search on object storage
turbopuffer is a serverless vector and full-text search engine built on object storage and vendor-positioned as roughly 10x cheaper than traditional vector databases. Used by Anthropic, Cursor, Notion, and Atlassian for production search workloads. Official site reports 4T+ documents, 10M+ writes/s, and 25k+ queries/s in production systems. Funded by Thrive Capital.
Exa MCP Server
Real-time web search and retrieval via MCP
Exa MCP Server provides AI coding agents with real-time web search and content crawling capabilities through the Model Context Protocol. It leverages Exa's neural search API for semantic understanding of queries, returning clean, structured results with full page content extraction. Supports both remote hosted MCP endpoints and local client configurations.
Firecrawl MCP Server
Web scraping and crawling via MCP for AI agents
Firecrawl MCP Server is the official MCP integration for Firecrawl, giving Cursor, Claude, Windsurf, and other MCP clients scrape, crawl, map, search, extract, and agent-style web research tools. It now supports a hosted remote endpoint, keyless rate-limited scrape/search/interact use, API-key/OAuth access for the full tool set, and self-hosted Firecrawl deployments.
Pinecone
Fully managed vector database built for AI applications at production scale.
Pinecone is a leading managed vector database designed for high-performance similarity search at scale. Purpose-built for AI applications including RAG, recommendation systems, and semantic search. Offers managed serverless infrastructure with automatic scaling, filtering, hybrid retrieval, and namespacing. No infrastructure management required.
Typesense
Open-source search engine — fast, typo-tolerant, easy to use.
Typesense is an open-source, typo-tolerant search engine optimized for instant search experiences. Written in C++ for maximum performance. Features built-in vector search for semantic/hybrid queries, geo-search, faceting, and curation. Popular for e-commerce search, documentation sites, and SaaS applications.
Qdrant
High-performance vector database written in Rust for similarity search at scale.
Qdrant is a high-performance vector similarity search engine and database written in Rust. Designed for production-grade AI applications with advanced filtering, payload indexing, and distributed deployment. Supports billion-scale vector collections with sub-second query times. Popular choice for RAG, recommendation systems, and anomaly detection.
Weaviate
Open-source vector database for AI-native applications and semantic search.
Weaviate is an open-source vector database purpose-built for AI applications. Supports vector, keyword, and hybrid search with built-in vectorization modules for OpenAI, Cohere, Hugging Face, and more. Used for RAG pipelines, semantic search, recommendation engines, and multimodal search. Written in Go for high performance.
Elasticsearch
Distributed search and analytics engine for all types of data.
Elasticsearch is the world's most popular open-source search and analytics engine, powering search experiences for companies like Wikipedia, GitHub, Netflix, and Uber. Built on Apache Lucene, it provides near-real-time search, structured and unstructured data analysis, and machine learning capabilities. Part of the Elastic Stack (ELK), it handles log analytics, application search, security analytics, and observability at scale. Supports vector search for AI/RAG applications.