✨ Visual Editor

close

palette Canvas & Background

Gradient:arrow_forward
Text Color:
135°

style Card Style

40px
16px

text_fields Typography

16px
Tongyi Lab
@Ali_TongyiLab
1/10 🚀 Qwen3.5-Omni is here! Scaling up to a native omni-modal AGI.
Meet the next generation of Qwen, designed for native text, image, audio, and video understanding, with major advances in both intelligence and real-time interaction.
A standout feature:
Audio-Visual Vibe Coding: Describe your vision to the camera, and Qwen3.5-Omni instantly builds a functional website or game for you.
Highlights:
Script-Level Captioning: Generate detailed video scripts with timestamps, scene cuts & speaker mapping.
SOTA Performance: Qwen3.5-Omni has secured 215 SOTA scores across various sub-tasks, matching the top-tier text/vision capabilities of the Qwen3.5 series.
Audio-Visual Understanding: From auto-segmentation to fine-grained script generation, it understands the relationship between characters and their environment like never before.
Seamless Interaction: With native API support for Semantic Interruption, voice conversations feel human-like and background-noise resistant.
Global Multilingual Mastery: Pioneering support for 74 languages in speech recognition and 29 languages in expressive speech generation, breaking down global communication barriers.
Autonomous Intelligence: Native support for WebSearch and complex Function Calling—the model now independently decides when to pull real-time data.
Qwen3.5-Omni is built to be the backbone of next-gen AI applications, empowering developers and users alike with true multimodal reasoning.
Thread image
Tongyi Lab
@Ali_TongyiLab
2/10 Script-Level Captioning
Video thumbnail
VIDEO
Tongyi Lab
@Ali_TongyiLab
3/10 Audio-Visual Vibe Coding
Video thumbnail
VIDEO
Tongyi Lab
@Ali_TongyiLab
4/10 Audio-Visual Vibe Coding
Video thumbnail
VIDEO
Tongyi Lab
@Ali_TongyiLab
5/10 Web Search
Video thumbnail
VIDEO
Tongyi Lab
@Ali_TongyiLab
6/10 Multi-Turn Dialogue and Intelligent Interruption
Video thumbnail
VIDEO
Tongyi Lab
@Ali_TongyiLab
7/10 Voice Style, Emotion and Volume Control
Video thumbnail
VIDEO
Tongyi Lab
@Ali_TongyiLab
8/10 Benchmark
Thread image
Tongyi Lab
@Ali_TongyiLab
9/10 Try it now🚀
Qwenchat: chat.qwen.ai
Blog: qwen.ai/blog?id=qwen3.…
Hugging Face Offline Demo: huggingface.co/spaces/Qwen/Qw…
Hugging Face Online Demo: huggingface.co/spaces/Qwen/Qw…
API: alibabacloud.com/help/en/model-…
Tongyi Lab
@Ali_TongyiLab
10/10 Don't miss out on the discussion. Join the server now!
discord.com/invite/mnPyh8Z…
Generated by Thread Navigator
100%
view_carousel Carousel Studio NEW
Press + S to quick-export