open-weight tier · 2024

Llama 3.2 90B

Open-weight Llama 3.2 model with strong reasoning for an open license footprint.

Context window

128k tokens

Peak context for this model.

Availability

Self-hosted, cloud marketplaces, supported by major GPU providers

Where you can run it.

Modalities

Text · Code

Input/output coverage.

Pricing

Open-weight (no per-token licensing)

Latency: Varies by host; scales across GPU clusters

Strengths

Best for

Summary