DeepSeek Unveils Advanced Computer Vision Technology

deepseek multimodal

2026-06-18 | Source: HN | Original article

DeepSeek introduces AI vision capability. Chinese AI start-up expands chatbot features.

DeepSeek has introduced a vision feature, enabling its chatbot to process images and video in addition to text. This move brings the Chinese artificial intelligence start-up in line with its rivals, which already offer multimodal capabilities. As reported earlier, DeepSeek has been working on its vision capabilities, with predecessors such as DeepSeek-VL and DeepSeek-VL2 demonstrating competence in image comprehension tasks. The introduction of vision capabilities matters as it expands the potential applications of DeepSeek's technology, allowing users to upload images for analysis and enabling more complex tasks such as image understanding and generation. This development also underscores DeepSeek's efforts to close the gap with its competitors, including those mentioned in our previous reports, such as Claude. What to watch next is how DeepSeek's vision feature will be received by users and how it will impact the company's position in the market. With its vision model claimed to be 10 times cheaper than existing multimodal AI solutions, DeepSeek may be able to gain a competitive edge and attract more customers to its platform.

Sources

Back to AIPULSEN