On August 25,Emmanuelle – The Sex Lives Of Ghosts (2004) Alibaba Cloud launched an open-source Large Vision Language Model (LVLM) named Qwen-VL. The LVLM is based on Alibaba Cloud’s 7 billion parameter foundational language model Qwen-7B. In addition to capabilities such as image-text recognition, description, and question answering, Qwen-VL introduces new features including visual location recognition and image-text comprehension, the company said in a statement. These functions enable the model to identify locations in pictures and to provide users with guidance based on the information extracted from images, the firm added. The model can be applied in various scenarios including image and document-based question answering, image caption generation, and fine-grained visual recognition. Currently, both Qwen-VL and its visual AI assistant Qwen-VL-Chat are available for free and commercial use on Alibaba’s “Model as a Service” platform ModelScope. [Alibaba Cloud statement, in Chinese]
Related Articles
2025-06-26 03:45
1380 views
The Anatomy of Liberal Melancholy
J.M. Bernays ,April 25, 2017 The Anatomy o
Read More
2025-06-26 03:28
898 views
Facebook’s Oversight Board makes bizarre ruling in its first group of decisions
Facebook’s Oversight Board has officially chimed in on its first five cases — and the ru
Read More
2025-06-26 03:16
963 views
The 13 best tweets of the week, including GameStop jokes, LMFAO, and Thanos
Somehow another week has passed. Days, weeks, months, it's all the same isn't it? Pandemics are weir
Read More