Connect with us

Alibaba’s Qwen Team Launches AI Models That Can Control PCs and Phones

Alibaba Headquarters

Credit: Pexels

Alibaba’s new AI models redefine smart assistants with groundbreaking features

Hangzhou-based tech giant Alibaba isn’t letting the spotlight stay on its competitors for long. This week, its Qwen team made headlines with the release of Qwen2.5-VL, a groundbreaking family of AI models designed to push boundaries in text and image analysis – and even control your devices.

What Makes Qwen2.5-VL Special?

Qwen2.5-VL isn’t your average AI model. These cutting-edge tools can do more than just chat – they analyze documents, interpret videos, count objects in images, and even recognize products and characters from movies and TV shows. Imagine having an assistant that can comb through hours-long videos, extract data from invoices, or decode complex charts – all in one seamless experience.

But here’s the real showstopper: Qwen2.5-VL can interact directly with PCs and smartphones. Whether it’s launching apps or performing tasks like booking flights, these models bring new meaning to the term “smart assistant.” A demo shared by Philipp Schmid, a technical lead at Hugging Face, even showcased the AI booking a flight on Android – a glimpse of what’s to come in hands-free tech.

A Competitive Edge in the AI Race

Alibaba’s Qwen2.5-VL is making waves for more than its flashy features. According to Alibaba, the flagship model, Qwen2.5-VL-72B, outperformed big names like OpenAI’s GPT-4o, Google’s Gemini 2.0 Flash, and Anthropic’s Claude 3.5 Sonnet in various benchmarks. From video comprehension to mathematical reasoning and document analysis, the Qwen team is proving they’re serious players in the global AI race.

For those eager to try it out, the models are available through Alibaba’s Qwen Chat app or on Hugging Face, an AI developer platform. Smaller models like Qwen2.5-VL-3B and Qwen2.5-VL-7B are open for use under a permissive license, while the more advanced Qwen2.5-VL-72B comes with licensing conditions for larger enterprises.

Censorship and AI in China

Like other Chinese-developed AI systems, Qwen2.5-VL adheres to the country’s strict regulatory environment. Some topics, particularly those sensitive to Chinese authorities, are off-limits. For instance, attempts to query politically sensitive issues, such as criticism of Xi Jinping, result in error messages.

China’s internet regulator enforces guidelines requiring AI responses to align with “core socialist values.” While this limits certain discussions, it doesn’t diminish the impressive capabilities of Qwen’s models in practical applications.

The Path Forward for Qwen2.5-VL

Qwen2.5-VL represents Alibaba’s bold step into the future of AI – one where machines don’t just answer questions but actively engage with the world around them. While there’s still room for growth, particularly in real-world operating system environments, the potential is undeniable.

With competition heating up and Alibaba positioning itself as a leader in AI innovation, the Qwen team is clearly signaling they’re here to stay. Whether you’re a tech enthusiast, developer, or someone curious about what AI can do, the Qwen2.5-VL models are a sign of the exciting – and transformative – road ahead.