Model Comparison & Benchmarks

Models

Access the best AI models from leading providers through a single API. Compare pricing, capabilities, and performance benchmarks to find the perfect model for your use case.

GPT-4o

OpenAI

Most capable GPT-4 model with vision capabilities

Vision
Chat
Code

Context

128K

Input

$2.50/1M

Output

$10.00/1M

Speed

Fast

GPT-4o Mini

OpenAI

Affordable small model for fast, lightweight tasks

Chat
Code

Context

128K

Input

$0.15/1M

Output

$0.60/1M

Speed

Very Fast

Claude 3.5 Sonnet

Anthropic

Best balance of intelligence and speed

Vision
Chat
Code
Analysis

Context

200K

Input

$3.00/1M

Output

$15.00/1M

Speed

Fast

Claude 3 Haiku

Anthropic

Fastest model for quick, efficient responses

Chat
Code

Context

200K

Input

$0.25/1M

Output

$1.25/1M

Speed

Very Fast

DeepSeek Chat

DeepSeek

High-quality open-source model for general chat

Chat
Code

Context

64K

Input

$0.14/1M

Output

$0.28/1M

Speed

Fast

DeepSeek Coder

DeepSeek

Specialized coding model with strong performance

Code

Context

64K

Input

$0.14/1M

Output

$0.28/1M

Speed

Fast

Mistral Large

Mistral

Flagship model with top-tier reasoning

Chat
Code
Analysis

Context

128K

Input

$2.00/1M

Output

$6.00/1M

Speed

Fast

Mixtral 8x7B

Mistral

Efficient mixture-of-experts model

Chat
Code

Context

32K

Input

$0.24/1M

Output

$0.24/1M

Speed

Very Fast

Llama 3.1 70B

Meta

Open-source model with strong capabilities

Chat
Code

Context

128K

Input

$0.59/1M

Output

$0.79/1M

Speed

Fast

Gemini 1.5 Pro

Google

Multimodal model with massive context window

Vision
Chat
Code

Context

2M

Input

$1.25/1M

Output

$5.00/1M

Speed

Fast

Command R+

Cohere

Enterprise-grade model for RAG and tool use

Chat
RAG
Tools

Context

128K

Input

$2.50/1M

Output

$10.00/1M

Speed

Fast

Llama 3 70B (Groq)

Groq

Ultra-fast inference with Groq LPU

Chat
Fast

Context

8K

Input

$0.59/1M

Output

$0.79/1M

Speed

Ultra Fast

12+

Models Available

8

Providers

2M

Max Context

99.9%

Uptime