Alibaba Open-Sources Multimodal Reasoning Model_English_

Alibaba Open-Sources Multimodal Reasoning Model

Time：2025-07-09 15:59:00 Source： Youth.cn China Youth International

Alibaba Group has launched its latest multimodal large language model, HumanOmniV2, with accuracy soaring to 69.33%. The standout feature of HumanOmniV2 is its mandatory context summarization mechanism, enabling multimodal reasoning based on global context, significantly enhancing the model's understanding of complex scenarios. Simply put, it doesn’t draw conclusions from partial information but considers all relevant data before responding. This avoids the "out-of-context" pitfalls common in traditional AI models, making its answers more accurate and reliable.

[ By Zhang Liyan ]

Editor：Hou Qianqian

CULTURE

Jiangxi’s ancient sta
Yangshuo held Golden D
Intangible heritage "t
Teams gather for tradi

TOP PHOTO

Shenyang Palace Museum
Coastal wetland "drape
Hohhot improves public
Sunset bathes sky in c
High-speed trails & go