Open-source · Self-hosted · Full control

Document ingestion and vector search you can self-host

bigRAG is a complete RAG pipeline — upload documents, auto-chunk, embed, and search. One API for your entire retrieval-augmented generation stack. Deploy on your infrastructure in minutes.

Simple integration

Upload, embed, and search in minutes

bigRAG handles the entire RAG pipeline. Upload any document format, and it automatically parses, chunks, embeds, and indexes for vector search. Use the TypeScript SDK or REST API.

TypeScript SDK with zero dependencies and full type safety
Milvus vector database with IVF_FLAT indexing and cosine similarity
Full API reference with Swagger docs at /docs
app.ts
import { BigRAG } from "@bigrag/client";

const client = new BigRAG({
  apiSecret: "your-api-secret",
  baseUrl: "http://localhost:6100",
});

// Upload, embed, and search — all in one platform
const { results } = await client.query("knowledge_base", {
  query: "What is the PTO policy?",
  top_k: 5,
});

15+

Document formats

PDF, DOCX, images, and more

6

Embedding models

OpenAI and Cohere

5 min

To deploy

Docker Compose

Deploy on your infrastructure

Your documents, embeddings, and search data never leave your servers. One command to get started.

$ curl -O https://raw.githubusercontent.com/bigint/bigrag/main/docker-compose.yml
$ docker compose up -d