Google’s Gemini AI model is integrated into much of the giant’s technology

2024-05-15Last Updated: 2024-05-15

1,858 2 minutes read

Google’s Gemini AI model has been integrated into much of the tech giant’s technology, with AI soon to appear in Gmail, on YouTube, and on the company’s smartphones.

In a keynote speech at the company’s I/O 2024 developer conference on May 14, CEO Sundar Pichai revealed some of the upcoming places its AI model will appear.

Pichai mentioned AI 121 times in his 110-minute keynote, with the topic taking center stage — and Gemini, which launched in December, stealing the spotlight.

Google is integrating the Large Language Model (LLM) into almost all of its offerings, including Android, Search, and Gmail, and here’s what users can expect from now on.

Sundar Pichai at Google I/O 2024. Source: Google

Application interactions

Gemini gets more context in that it will be able to interact with apps. In the upcoming update, users will be able to connect with Gemini to interact with apps such as dragging and dropping the AI-generated image into the message.

YouTube users will also be able to click “Ask this video” to find specific information within the video from the AI.

Gemini in Gmail

Google’s email platform, Gmail, is also getting AI integration where users will be able to search, summarize and draft their emails using Gemini.

The AI assistant will be able to take action on emails for more complex tasks, such as helping process e-commerce returns by searching your inbox, finding your receipt, and filling out online forms.

Gemini Live

Google also unveiled a new experience called Gemini Live where users can have “deep-depth” voice conversations with artificial intelligence on their smartphones.

The chatbot can be interrupted mid-answer to clarify and will adapt to users’ speech patterns in real time. In addition, Gemini can also see and respond to the surrounding physical environment via photos or videos captured on the device.

*Screenshot from Gemini promotional video. Source: Google*

Multimedia progress

Google is developing intelligent AI agents that can reason, plan, and complete complex, multi-step tasks on behalf of the user under supervision. Multimedia means that AI can go beyond text and handle image, audio and video input.

Examples and early use cases include automating shopping returns and exploring a new city.

Related: Google’s Gemini Gemini has been released, here’s how to try it

Other updates in the pipeline for the company’s AI model include replacing Google Assistant on Android with Gemini fully integrated into the mobile operating system.

The new “Ask Photos” feature allows you to search your photo library using natural language queries powered by Gemini. It can understand context, recognize objects and people, and summarize image memories in response to questions.

AI-generated summaries of places and regions will be displayed in Google Maps using insights from the platform’s mapping data.

magazine: “Sic AIs on each other” to prevent an AI apocalypse: David Brin, science fiction author