ML
Llama 3.3 70B
Meta
$0.0015
per request
CheapestOA
GPT-4.1 Nano
OpenAI
$0.0030
per request
GO
Gemini 2.5 Flash Lite
$0.0030
per request
10,000
5,000
Cost per Request
Context Window
Pricing Breakdown
| Metric | Llama 3.3 70B | GPT-4.1 Nano | Gemini 2.5 Flash Lite |
|---|---|---|---|
| Input Price | $0.10/MTok | $0.10/MTok | $0.10/MTok |
| Output Price | $0.10/MTok | $0.40/MTok | $0.40/MTok |
| Cached Input | — | $0.03/MTok | $0.01/MTok |
| Cost / Request | $0.0015 | $0.0030 | $0.0030 |
| Daily (100 req) | $0.15 | $0.30 | $0.30 |
| Monthly (100 req/day) | $4.50 | $9.00 | $9.00 |
Features & Capabilities
| Feature | Llama 3.3 70B | GPT-4.1 Nano | Gemini 2.5 Flash Lite |
|---|---|---|---|
| Context Window | 128K | 1M | 1M |
| Max Output | 8K | 32K | 8.2K |
| Caching | No | Yes | Yes |
| Batch API | No | Yes | No |
| Vision | No | Yes | Yes |
| Tool Use | Yes | Yes | Yes |
| Best For | chatgeneral | classificationrouting | classificationbudget |