About this model
GPT OSS 120B is the larger member of OpenAI's open-weight gpt-oss series, a Transformer using mixture-of-experts to keep only 5.1B of its 117B total parameters active per token. This catalog entry packages that model inside a Trusted Execution Environment (TEE), adding hardware attestation evidence so users can independently verify that inference runs in a confidential, tamper-resistant enclave. It carries forward the base model's permissive Apache 2.0 license, configurable reasoning depth, full chain-of-thought visibility, and native tool use including function calling, browsing, and structured output.
Within this confidential-compute family, the model is the higher-capacity counterpart to GPT OSS 20B. The two share the same architecture and TEE wrapper, but the 120B variant activates 5.1B parameters per token versus the 20B model's 3.6B, and OpenAI positions the larger model for production, general-purpose, high-reasoning workloads while the smaller one targets lower-latency or memory-constrained deployment.
Compared to the standard non-enclave release, OpenAI GPT OSS 120B, the weights and capabilities are identical; the distinguishing feature here is end-to-end encryption and verifiable execution rather than any change to the model itself.
On capability, OpenAI reports that gpt-oss-120b reaches near-parity with its o4-mini model on core reasoning benchmarks while running efficiently on a single 80GB GPU. For broader chat, image, and embedding needs, the provider's same-period lineup also includes models such as GPT-5.5.
This About section is AI-generated from public sources (Claude Opus 4.8), with no human editing. It may contain inaccuracies — verify critical details against the sources listed above.
Research & Papers
Primary reference paper for this model family, sourced from the HuggingFace model card.
Data sources: Venice API · HuggingFace · Wikipedia · arXiv — enrichment updated 1d ago