Skip to content
  • There are no suggestions because the search field is empty.

Model Pricing

This section provides the latest pricing information of models available on the Compass Platform.

Text Tokens

Prices per 1M tokens.

Model Name Model ID/Version Input  Output
GPT-5 gpt-5 $1.25 $10.00
GPT-5 mini gpt-5-mini $0.25 $2.00
GPT-4.1 GPT-4.1 $2.00 $8.00
GPT-4o gpt-4o-2024-08-06 $2.50 $10.00
GPT-4o mini gpt-4o-mini-2024-07-18 $0.15 $0.60
GPT-4o Audio  gpt-4o-audio-preview-2024-12-17  $2.50 $10.00
GPT-4o Transcribe gpt-4o-transcribe $2.50 $10.00
GPT-4o mini Transcribe gpt-4o-mni-transcribe $1.25 $5.00
GPT-4o mini TTS GPT-4o-mini-tts $0.60 N/A
GPT-4o Realtime  gpt-4o-realtime-preview-2024-12-17 $5.00 $20.00
gpt-4o-realtime-preview-2024-10-01 $5.00 $20.00
 gpt-realtime  gpt-realtime  $4.00  $16.00
o1-preview  o1-preview-2024-09-12 $15.00 $60.00
o1  o1-2024-12-17 $15.00 $60.00
o3 o3-2025-04-16 $10  $40
o3-mini o3-mini-2025-01-31 $1.10 $4.40
o4-mini o4-mini $1.1 $4.4
gpt-oss-120b gpt-oss-120b-core42-amd $0.3 $0.75
  gpt-oss-120b-qualcomm $0.15 $0.37
gpt-oss-120b-cerbras $0.25 $0.69
gpt-oss-120b-core42 $0.3 $0.75
gpt-oss-120b-azure $0.4 $0.95
gpt-oss-20b gpt-oss-20b-core42 $0.2 $0.6
K2 Think k2-think-core42 $0.15 $0.50
k2-think-cerebras $0.15 $0.50

Audio Tokens

Price per 1M tokens

Model Model ID/Version Input Output
GPT-4o Audio gpt-4o-audio-preview-2024-12-17 $40.00 $80.00
GPT-4o Realtime  gpt-4o-realtime-preview-2024-12-17 $40.00 $80.00
GPT-4o Realtime  gpt-4o-realtime-preview-2024-10-01 $100.00 $200.00
  gpt-realtime   gpt-realtime  $32.00  $64.00
GPT-4o Transcribe gpt-4o-transcribe $6.00 N/A
GPT-4o mini Transcribe gpt-4o-mini-transcribe $3.00 N/A
GPT-4o mini TTS GPT-4o-mini-tts N/A $12.00

Image Tokens

Prices per 1M tokens. 

Model Model ID/Version Input Output
 GPT Image 1  gpt-image-1 $10.00 $40.00
 gpt-realtime  gpt-realtime $5.00 N/A

Image Generation

Price per 100 images.

Model Name Model ID/Version Quality 1024*1024 1024*1792 1792*1024
Dall·E 3  dall-e-3 Standard $4.00 $8.00 $8.00
HD $8.00 $12.00 $12.00

Transcription

Model Model ID/Version Per Hour
Whisper whisper-1 $0.36 

Embeddings 

Price per 1M tokens.

Model Model ID/Version Input Tokens per Million Output Tokens per Million
Embeddings 3 Large text-embedding-3-large $0.13 $0.13

Batch Processing

Batch is supported with GPT-4o and GPT-4o mini models.

  Tokens Consumption
Batch  50% discount on input and output tokens with the Batch

Fine-Tuning

Price per 1M tokens.

Model  Model ID/Version Training
GPT-4.1 gpt-4.1-2025-04-14 $30.3

Response API Tools Calling

The supported models are:
  • GPT-4.1 
  • GPT-4o
  • GPT-4o mini
  • o4-mini
  • o3-mini 
Tool Cost
File Search Storage $0.11 GB of vector-storage per day (1 GB free)
File Search Tool Call $2.50 /1000 calls

Available Models [Free Until December 2025]

The following models are available at no cost until December 2025. Access and usage terms may be subject to change thereafter.
 

Text Generation

  • Claude Sonnet 4
  • Cohere Command A
  • Command R
  • Cohere Embed 4
  • Jais 30B
  • Llama 3 70B
  • Llama 3.3 70B
  • Mixtral 8x7B
  • Mistral 7B
  • DeepSeek R1 0528
  • Qwen 3 14B

Image Generation

  • Stable Diffusion

Embeddings

  • Qwen 3 Embedding 8B

Reranker

  • Qwen 3 Reranker 8B

Web Search [Retired on Aug 29th, 2025]

Per 1000 transactions.

Web Search API calls $18.00