While previous embedding models were largely restricted to text, this new model natively integrates text, images, video, audio, and documents into a single numerical space — reducing latency by as muc ...
Google introduces Gemini Embedding 2, its first multimodal embedding model designed to map text, images, audio, and video into a single space.