Google Unveils Gemini Omni, a Multimodal Generative AI Platform
gemini google multimodal
| Source: Shacknews | Original article
Google unveils Gemini Omni, a multimodal AI platform. It combines various generative models into one service.
Google has unveiled Gemini Omni, a multimodal generative AI platform that can create content from any input, starting with video. As we reported on May 20, the company has been rebuilding its enterprise AI stack, and Gemini Omni is a key part of this effort. This new platform rolls many of Google's existing generative AI models into a single service, enabling users to generate and edit videos through simple conversation.
Gemini Omni matters because it has the potential to revolutionize content creation, making it easier for users to produce high-quality videos without extensive editing experience. The platform's ability to reason across text, images, audio, and video also opens up new possibilities for multimedia content generation. With Gemini Omni, Google is poised to take a significant lead in the AI-powered content creation market.
As Gemini Omni Flash begins rolling out to Google AI Plus, Pro, and Ultra subscribers, as well as YouTube Shorts and YouTube Create App users, we can expect to see a surge in innovative content creation. What to watch next is how Gemini Omni will be integrated into Google's existing services, such as Google Search, which recently went "agentic" and doesn't need user input anymore. The potential applications of Gemini Omni are vast, and its impact on the tech landscape will be closely watched in the coming months.
Sources
Back to AIPULSEN