Alibaba Launches New Multimodal AI Model

(MENAFN) Alibaba Group Holding has introduced a fresh multimodal artificial intelligence (AI) model capable of processing text, images, audio, and video on both smartphones and laptops, according to a Chinese news agency.

On Thursday, the firm presented Qwen2.5-Omni-7B as the latest addition to its Qwen model family, aiming to solidify its position in the generative AI sector.

The multimodal Qwen2.5-Omni-7B model brings cutting-edge AI functionalities closer to everyday users.

As stated by Alibaba, the model is capable of handling multiple types of inputs and generating real-time replies either in text or audio. Moreover, the firm has made the model open-source.

The company highlighted several possible usage situations, such as providing real-time audio descriptions for visually impaired individuals and offering step-by-step cooking instructions by analyzing ingredients.

The model's flexibility underscores the growing demand for AI systems that go beyond text generation.

Alibaba’s foundational Qwen models have gained popularity among AI developers, listing them as one of the few significant alternatives to DeepSeek’s V3 and R1 models in China.

MENAFN29032025000045017167ID1109371068

Legal Disclaimer:
MENAFN provides the information “as is” without warranty of any kind. We do not accept any responsibility or liability for the accuracy, content, images, videos, licenses, completeness, legality, or reliability of the information contained in this article. If you have any complaints or copyright issues related to this article, kindly contact the provider above.

Legal Disclaimer:

EIN Presswire provides this news content "as is" without warranty of any kind. We do not accept any responsibility or liability for the accuracy, content, images, videos, licenses, completeness, legality, or reliability of the information contained in this article. If you have any complaints or copyright issues related to this article, kindly contact the author above.