About this model
Hy3 Preview is an open-weight large language model from Tencent's Hy team, released in 2026. Architecturally it is a Mixture-of-Experts model with 295B total parameters, activating only 21B per token, paired with a Multi-Token-Prediction layer that supports speculative decoding. Per the model card and an accompanying Hugging Face technical write-up, it uses a routed-expert design with a shared expert, grouped-query attention, and a 256K context window, distributed in int4 quantization.
According to Venice's catalog description, the model is aimed at complex reasoning, instruction following, in-context learning, coding and agent tasks. The Hugging Face write-up frames the architecture as a "rebuilt" Hunyuan with a new reasoning recipe, routing routine queries quickly while directing harder problems to deeper reasoning chains. Its listed capabilities include reasoning, code optimization, function calling and web search.
As the newest entry in Tencent's Hy family, the team presents Hy3 Preview as an upgrade over earlier Hunyuan releases, with the MoE design intended to deliver larger-model quality at a smaller active-parameter inference cost. Tencent also reports results on STEM and agent evaluations, but these are presented as self-reported figures rather than independently verified numbers.
The weights are published on Hugging Face, with a separate base checkpoint available for further fine-tuning, under Tencent's Hy community license. The release has drawn meaningful download and engagement activity on Hugging Face since launch.
This About section is AI-generated from public sources (Claude Opus 4.8), with no human editing. It may contain inaccuracies — verify critical details against the sources listed above.
Data sources: Venice API · HuggingFace · Wikipedia — enrichment updated 2h ago