4032
model brief

OpenAI

flagship tier · 2024

OpenAI

GPT-4.1

Flagship multimodal model with strong reasoning, structured outputs, and tool-use alignment.

Context window

128k tokens

Peak context for this model.

Availability

OpenAI API, Assistants API, Azure OpenAI

Where you can run it.

Modalities

Text · Vision · Code

Input/output coverage.

Pricing

$5.00 / 1M input tokens, $15.00 / 1M output tokens

Latency: Balanced; optimized for high-quality responses

Strengths

  • Deep reasoning with low hallucination rates and stable system prompt adherence.
  • Multimodal grounding for screenshots, documents, diagrams, and charts.
  • Structured outputs that stay close to JSON and function-call schemas.

Best for

  • Agent orchestration that mixes planning, tools, and guardrails.
  • Compliance, evaluations, and quality checks that need reliable citations.
  • Product experiences where tone and safety need to stay consistent.

Summary

  • Tier: flagship
  • Release: 2024
  • Latency: Balanced; optimized for high-quality responses