Model Catalog

54 models across 10 providers. Filter by tier, provider, or search by name.

All prices in credits per 1K tokens. 1 credit = $0.001 USD. Prices updated March 29, 2026.

Showing 54 of 54 models
ModelProviderTierContextMax Out$/1M In$/1M OutNotes
gpt-4.1OpenAIultra1M32,768$2.0$8.0Latest flagship, 1M context
gpt-4.1-miniOpenAIpro1M32,768$0.40$1.6Fast, cheap, 1M context
gpt-4.1-nanoOpenAIcore1M32,768$0.10$0.40Cheapest GPT-4 class
gpt-4oOpenAIultra128K16,384$2.5$10.0Vision + text flagship
gpt-4o-miniOpenAIcore128K16,384$0.15$0.60Best value general model
o1OpenAIultra200K100,000$15.0$60.0Deep reasoning, expensive
o3OpenAIultra200K100,000$2.0$8.0Reasoning, great value
o3-miniOpenAIpro200K100,000$1.1$4.4Fast reasoning
o4-miniOpenAIpro200K100,000$1.1$4.4Newest reasoning, 75% cached savings
codex-mini-latestOpenAIpro200K100,000$1.5$6.0Fine-tuned for coding (Codex CLI)
claude-opus-4.6Anthropicultra1M128,000$5.0$25.0Most intelligent, extended thinking
claude-sonnet-4.6Anthropicultra1M64,000$3.0$15.0Best balance of IQ and cost
claude-haiku-4.5Anthropiccore200K64,000$1.0$5.0Fast, cheap Claude
claude-opus-4.5Anthropicultra1M128,000$5.0$25.0Previous flagship
claude-sonnet-4.5Anthropicultra1M64,000$3.0$15.0Previous gen Sonnet
claude-3.5-haikuAnthropiccore200K8,192$0.80$4.0Legacy fast model
gemini-2.5-proGoogleultra1M65,536$1.3$10.0Context-tiered pricing at 200K
gemini-2.5-flashGooglepro1M65,536$0.30$2.5Best value large context
gemini-2.5-flash-liteGooglecore1M65,536$0.10$0.40Absurdly cheap, 1M context
grok-4.20xAIultra2M16,384$2.0$6.0Latest flagship, lowest hallucination
grok-4.1-fastxAIcore2M16,384$0.20$0.502M context at $0.20 — insane value
grok-3xAIultra131K16,384$3.0$15.0Premium reasoning
grok-3-minixAIcore131K16,384$0.25$0.50Affordable reasoning
grok-code-fastxAIpro256K16,384$0.20$1.5Coding optimized
deepseek-chatDeepSeekcore64K8,192$0.27$1.1Ultra-cheap general use ($0.07 cached)
deepseek-reasonerDeepSeekpro64K8,192$0.55$2.2Cheapest reasoning model
mistral-large-3Mistralpro128K4,096$0.50$1.5Flagship, EU data residency
mistral-medium-3.1Mistralpro32K4,096$0.40$2.0Balanced mid-tier
mistral-small-3.2Mistralcore32K4,096$0.07$0.20Cheapest Mistral
codestral-2508Mistralpro32K8,192$0.30$0.90Code generation specialist
pixtral-12bMistralcore32K4,096$0.10$0.10Vision model
ministral-3bMistralcore32K4,096$0.10$0.10Tiny, ultra-cheap
ministral-8bMistralcore32K4,096$0.15$0.15Small, cheap
ministral-14bMistralcore32K4,096$0.20$0.20Mid-small, cheap
gpt-oss-120b-cerebrasCerebraspro128K8,192$0.22$0.68Open-source 120B, 3K t/s
llama-3.1-8b-cerebrasCerebrascore128K8,192$0.06$0.14Cheapest, 2.2K t/s, free tier
glm-4.7-cerebrasCerebrasultra128K8,192$1.2$3.6Highest IQ on Cerebras
llama-3.3-70b-groqGroqcore128K32,768$0.59$0.79394 t/s, fast inference
llama-4-scout-groqGroqcore128K32,768$0.11$0.34594 t/s, newest Llama
llama-3.1-8b-groqGroqcore128K32,768$0.05$0.08Cheapest Groq model
gpt-oss-120b-groqGroqpro128K32,768$0.15$0.60Open-source 120B at 500 t/s
qwen3-32b-groqGroqpro128K32,768$0.29$0.59Qwen on Groq hardware
kimi-k2-groqGroqpro128K32,768$1.0$3.0Moonshot AI Kimi
llama-3.3-70b-cfCloudflarecore128K8,192$0.29$2.370B on edge, FP8
llama-4-scout-cfCloudflarecore128K8,192$0.10$0.44Llama 4 with vision
gpt-oss-120b-cfCloudflarepro128K8,192$0.35$0.75Open-source 120B on edge
llama-3.1-8b-cfCloudflarecore128K8,192$0.04$0.38Cheapest Cloudflare model
mistral-small-cfCloudflarecore32K8,192$0.35$0.56Mistral on Cloudflare edge
qwen-coder-cfCloudflarepro32K8,192$0.30$0.55Qwen coding model on edge
deepseek-r1-cfCloudflarepro32K8,192$0.50$4.9Distilled reasoning on edge
gemma-3-12b-cfCloudflarecore32K8,192$0.15$0.75Google Gemma on edge
mimo-v2-flashMimocore256K8,192$0.09$0.29309B MoE, cheapest Xiaomi model
mimo-v2-proMimoultra1M16,384$1.0$3.01T+ params, 1M context flagship
mimo-v2-omniMimopro262K8,192$0.40$2.0Multimodal: vision, audio, video

Tier Guide

Core — Fast, cheap. Good for simple tasks.

Pro — Balanced quality/cost. Workhorses.

Ultra — Best quality. Use sparingly.

Pricing

Prices are in USD per 1 million tokens.

1 credit = $0.001 USD.

Cached/batch pricing may be lower.

Need a Model?

We add new models weekly based on demand.

Contact us to request a specific model.