Meta (Llama) API pricing
Open-weight Llama models. No first-party API — prices below are a representative hosted endpoint (Together AI).
Meta (Llama) models — price per 1M tokens
All prices in USD, last verified 2026-06-26. Cheapest reference cost first; click a column to re-sort.
| Model | Input /1M | Output /1M | Cost / call* | Context | Type |
|---|---|---|---|---|---|
| Llama 3.1 8B Open weights | $0.200 | $0.200 | $0.0004 | 128K | Open weights open |
| Llama 4 Scout Open weights (Groq) | $0.110 | $0.340 | $0.0005 | 128K | Open weights open |
| Llama 4 Maverick Open weights (Together AI) | $0.270 | $0.850 | $0.0011 | 500K | Open weights open |
| Llama 3.3 70B Open weights | $1.04 | $1.04 | $0.0021 | 128K | Open weights open |
*Reference cost of one call with 1,000 input + 1,000 output tokens — a neutral yardstick. Use the calculator for your real usage. Cheapest row highlighted.
How Meta (Llama) API pricing works
Meta (Llama) bills per token: an input price for everything you send and a higher output price for what the model generates, both quoted per million tokens. To estimate a real bill, multiply by your monthly request volume — the cost calculator does this across every model at once.
Before committing to volume, always confirm the current numbers on the official Meta (Llama) pricing page — the list prices here are verified periodically but the market moves fast.