Introducing GLM-4.5V: a breakthrough in open-source visual reasoning
GLM-4.5V delivers state-of-the-art performance among open-source models in its size class, dominating across 41 benchmarks.
Built on the GLM-4.5-Air base model, GLM-4.5V inherits proven techniques from GLM-4.1V-Thinking while achieving effective scaling through a powerful 106B-parameter MoE architecture.
Hugging Face: huggingface.co/zai-org/GLM-4.…
GitHub: github.com/zai-org/GLM-V
Z.ai API: docs.z.ai/guides/vlm/glm…
Try it now: chat.z.ai

Through efficient hybrid training, GLM-4.5V is equipped to handle diverse types of visual content, achieving comprehensive visual reasoning across all scenarios, including:
- Image Reasoning (scene understanding, complex multi-image analysis, geography recognition)
- Video Understanding (long video storyboard analysis, event recognition)
- GUI Tasks (screen reading, icon recognition, desktop operation assistance)
- Complex Chart and Document Analysis (research report analysis, information extraction)
- Grounding Capability (precise localization of visual elements)
- Image Reasoning (scene understanding, complex multi-image analysis, geography recognition)
- Video Understanding (long video storyboard analysis, event recognition)
- GUI Tasks (screen reading, icon recognition, desktop operation assistance)
- Complex Chart and Document Analysis (research report analysis, information extraction)
- Grounding Capability (precise localization of visual elements)
Webpage Replication: Please generate a high-quality UI interface using CSS and HTML based on the webpage I provided.
chat.z.ai/s/f4389582-bcd…
chat.z.ai/s/f4389582-bcd…
VIDEO
Grounding: Identify this blue table, where to buy it, and suggest similar styles.
chat.z.ai/s/18b481b5-837…
chat.z.ai/s/18b481b5-837…
VIDEO
Video Understanding: Please analyze the video of this match, point out the key moments, and analyze each team's performance.
chat.z.ai/s/e44ee8a3-64e…
chat.z.ai/s/e44ee8a3-64e…
Generated by Thread Navigator
Press ⌘ + S to quick-export
