Gemini Embedding 2 brings text, video, and audio together
Google DeepMind released Gemini Embedding 2, a natively multimodal embedding model that can handle text, images, short video clips, and audio in one space. The model is being integrated into Workspace for AI Ultra and Pro users, with features like full-draft generation, design-aligned presentations, automated dashboards, and cross-file Q&A with citations. Google is also piloting a multi-agent planning mode for Gemini Business.
- It can process text, images, video, and audio in a single embedding space.
- Workspace integration adds cross-file question answering with citations.
- Google is also piloting a multi-agent planning mode for Gemini Business.
