Published:
Last updated:
Gemini
Gemini is Google's AI platform. It encompasses various multimodal models (including Flash, Pro, and further variants; model names change per release) that can natively process text, images, audio, and video.
Core concept
Deep integration into the Google ecosystem. In suitable configurations, Gemini can access data in Google Drive, Gmail, and Docs to automate tasks. The model supports a very large context window (up to 2 million tokens).
Assessment
- Use case: Automating Office workflows, analysing large volumes of video and data, and providing a model alternative to OpenAI.
- Advantage: Direct integration with Google Workspace, high speed with the Flash variant, and multimodal processing of text, images, audio, and video.
- Limitation: Data protection concerns regarding the use of user data for training (Enterprise settings are mandatory).
Related topics
- AI Development, the development context for AI-assisted work with Gemini.
- GenAI and RAG, the pipeline in which Gemini is applied.
- OpenAI, the service comparison point for Gemini.
- Language Models, the overview of the model landscape.
Ask AI
These links open external AI services, the conversation and its content are sent to their providers.