Qwen3-Omni: first multimodal model with SoTA text, image, audio, and video perf

(arxiv.org)

2 points | by walterbell 11 hours ago ago

No comments yet.