Calculator
RAG Total Cost Calculator
All-in-one RAG bill — embedding pass + vector DB + reranker + LLM generation. Plug in document count and query volume to see the full monthly stack.
Pricing refreshed: 2026-05-01
Total monthly
$789
One-time embed cost
$6
Per query
$0.0053
Year 1 total
$9,469
Monthly cost breakdown
Embedding query (Voyage 3)
0%$0
Vector DB (pinecone-serverless)
0%$2
Reranker (Cohere Rerank 3)
38%$300
Generation (Claude Haiku 4.5)
62%$486
RAG bill = embedding query + vector DB storage/retrieval + reranker (optional) + LLM generation. Generation typically dominates above 50k queries/day. At MVP scale, vector DB minimums dominate.
Frequently Asked Questions
How accurate are these calculators?+
Pricing is sourced from official provider documentation and refreshed monthly. Real bills can vary by 5–15% due to caching, batching, and region.
Are prices in USD?+
Yes, all prices are quoted in USD per the providers' billing currency.
How often is data updated?+
Pricing tables are reviewed and updated on the first of every month.
Can I trust these for budgeting?+
Use them as estimates. For production budgets, always validate with a 1-week pilot using your real workload.