Model Comparison

Comparing 3 models side by side

ML

Llama 3.3 70B

Meta

$0.0015

per request

Cheapest
OA

GPT-4.1 Nano

OpenAI

$0.0030

per request

GO

Gemini 2.5 Flash Lite

Google

$0.0030

per request

10,000
5,000

Cost per Request

Context Window

Pricing Breakdown

MetricLlama 3.3 70BGPT-4.1 NanoGemini 2.5 Flash Lite
Input Price$0.10/MTok$0.10/MTok$0.10/MTok
Output Price$0.10/MTok$0.40/MTok$0.40/MTok
Cached Input$0.03/MTok$0.01/MTok
Cost / Request$0.0015$0.0030$0.0030
Daily (100 req)$0.15$0.30$0.30
Monthly (100 req/day)$4.50$9.00$9.00

Features & Capabilities

FeatureLlama 3.3 70BGPT-4.1 NanoGemini 2.5 Flash Lite
Context Window128K1M1M
Max Output8K32K8.2K
CachingNoYesYes
Batch APINoYesNo
VisionNoYesYes
Tool UseYesYesYes
Best For
chatgeneral
classificationrouting
classificationbudget