Kalkulator
Pembanding Harga Token
Perkirakan biaya token input/output OpenAI, Anthropic, Google, xAI, Mistral termasuk hemat prompt cache.
Harga diperbarui:
AITOT Token & Pricing Comparator memungkinkan Anda membandingkan biaya per token pada 22 LLM terdepan 2026 — termasuk OpenAI GPT-5, Claude Opus 4.7, Gemini 2.5 Pro, Llama 4 70B, DeepSeek V3, Mistral Large 2, dan Amazon Nova.
Token output mendominasi mayoritas tagihan — biaya 3-5× input token di setiap provider utama. Comparator mengurutkan berdasarkan total cost. Toggle prompt caching memotong biaya input 60-90% di Anthropic dan 50% di OpenAI.
Semua pricing dari dokumentasi resmi dan diperbarui tanggal 1 setiap bulan. Tagihan nyata jatuh dalam 5-15% dari estimasi. Tanpa login; hasil dihitung client-side.
Termurah
Amazon · Nova Lite
$14.40
Per bulan
| Provider | Model | Input / 1M | Output / 1M | Per request | Per bulan |
|---|---|---|---|---|---|
| Amazon | Nova Lite | $0.06 | $0.24 | $0.0001 | $14.40 |
| OpenAI | GPT-5 nano | $0.05 | $0.40 | $0.0002 | $20.00 |
| Gemini 2.5 Flash-Lite | $0.10 | $0.40 | $0.0002 | $24.00 | |
| Cohere | Command R | $0.15 | $0.60 | $0.0004 | $36.00 |
| Mistral | Mistral Small 3 | $0.20 | $0.60 | $0.0004 | $40.00 |
| DeepSeek | DeepSeek V3 | $0.27 | $1.10 | $0.0007 | $65.60 |
| OpenAI | GPT-5.4 nano | $0.20 | $1.25 | $0.0007 | $66.00 |
| Gemini 3.1 Flash-Lite | $0.25 | $1.50 | $0.0008 | $80.00 | |
| OpenAI | GPT-5 mini | $0.25 | $2.00 | $0.001 | $100.00 |
| Meta (Together) | Llama 4 70B | $0.88 | $0.88 | $0.0011 | $105.60 |
| Gemini 2.5 Flash | $0.30 | $2.50 | $0.0012 | $124.00 | |
| DeepSeek | DeepSeek R1 | $0.55 | $2.19 | $0.0013 | $131.60 |
| xAI | Grok 4 mini | $0.60 | $2.40 | $0.0014 | $144.00 |
| Amazon | Nova Pro | $0.80 | $3.20 | $0.0019 | $192.00 |
| OpenAI | GPT-5.4 mini | $0.75 | $4.50 | $0.0024 | $240.00 |
| Anthropic | Claude Haiku 4.5 | $1.00 | $5.00 | $0.0028 | $280.00 |
| Mistral | Mistral Large 2 | $2.00 | $6.00 | $0.004 | $400.00 |
| Meta (Together) | Llama 4 405B | $3.50 | $3.50 | $0.0042 | $420.00 |
| OpenAI | o3 | $2.00 | $8.00 | $0.0048 | $480.00 |
| Gemini 3.5 Flash | $1.50 | $9.00 | $0.0048 | $480.00 | |
| OpenAI | GPT-5 | $1.25 | $10.00 | $0.005 | $500.00 |
| Gemini 2.5 Pro | $1.25 | $10.00 | $0.005 | $500.00 | |
| Cohere | Command R+ | $2.50 | $10.00 | $0.006 | $600.00 |
| Gemini 3.1 Pro | $2.00 | $12.00 | $0.0064 | $640.00 | |
| OpenAI | GPT-5.4 | $2.50 | $15.00 | $0.008 | $800.00 |
| Gemini 2.5 Pro (long ctx >200K) | $2.50 | $15.00 | $0.008 | $800.00 | |
| Anthropic | Claude Sonnet 4.6 | $3.00 | $15.00 | $0.0084 | $840.00 |
| Anthropic | Claude Opus 4.8 | $5.00 | $25.00 | $0.014 | $1,400.00 |
| xAI | Grok 4 | $5.00 | $25.00 | $0.014 | $1,400.00 |
| OpenAI | GPT-5.5 | $5.00 | $30.00 | $0.016 | $1,600.00 |
| OpenAI | GPT-5.5 Pro | $30.00 | $180.00 | $0.096 | $9,600.00 |
Hanya estimasi. Tagihan nyata dapat bervariasi 5–15% tergantung caching, batching, dan region.
Yang dilakukan kalkulator ini
22 LLM dalam satu tabel
GPT-5, Opus 4.7, Gemini 2.5 Pro, Llama 4, DeepSeek V3, Mistral, Nova, Cohere — semua bisa dibandingkan.
Modeling prompt cache
Toggle cache hit rate 0-100% untuk lihat tarif efektif.
Per-request + per-month
Kalkulator tampilkan biaya per request dan total bulanan.
Workload presets
Chat, RAG, agent, summarization, code-gen presets preset ratio input/output realistis.
Rasio output:input
Chat 4:1; code-gen 3:1; summarization 10:1.
Export + share
Simpan skenario di localStorage, ekspor CSV, bagikan permalink.
Perbandingan cepat
Harga token pada LLM teratas (per 1M token)
| Model | Input | Output | Blended 50:50 |
|---|---|---|---|
| Amazon Nova Lite | $0.06 | $0.24 | $0.15 |
| DeepSeek V3 | $0.27 | $1.10 | $0.69 |
| Gemini 2.5 Flash | $0.30 | $2.50 | $1.40 |
| GPT-5 mini | $0.40 | $1.60 | $1.00 |
| Claude Haiku 4.5 | $0.80 | $4.00 | $2.40 |
| Claude Sonnet 4.6 | $3.00 | $15.00 | $9.00 |
| OpenAI GPT-5 | $10.00 | $30.00 | $20.00 |
| Claude Opus 4.7 | $15.00 | $75.00 | $45.00 |
Output mendominasi mayoritas workload. Pakai kalkulator dengan rasio nyata Anda.
Cara menggunakan kalkulator
Perkirakan biaya token input + output untuk workload Anda di 22 LLM dalam <60 detik.
- 1
Pilih workload preset
Pilih chat, RAG, agent, summarization, atau code-gen.
- 2
Set request per bulan
Masukkan volume bulanan diharapkan.
- 3
Toggle prompt caching
Jika system prompt stabil, set cache hit rate 50-80%.
- 4
Bandingkan dan pilih
Sort berdasarkan biaya bulanan. Pilih model termurah yang memenuhi standar kualitas.
Kenapa pakai kalkulator ini
- ✓Gratis selamanya — tanpa login, tanpa kartu
- ✓22 LLM diperbarui bulanan
- ✓Jalan client-side — input pribadi
- ✓Workload presets, bukan rata-rata generic
- ✓Termasuk prompt cache + batch discounts
- ✓Permalinks untuk berbagi