F5.5G Leap-forward Development of Broadband in Africa The Africa Broadband Forum 2024 (BBAF 2024) was successfully held in Cape Town, South Africa recently, under…
Google Doubles Down on Generative AI Across Search, Android, and Workspace
At its annual I/O developer conference this week, Google unveiled a sweeping set of AI-powered features aimed at transforming how users search, interact with their Android devices, and collaborate in Workspace. The tech giant doubled down on its generative AI efforts, showcasing major advances in language models, image and video generation, and creative tools for music.
The centerpiece is Gemini 1.5 Pro, Google’s new flagship large language model boasting a staggering 1 million token context window. This allows it to ingest and analyse vast datasets like full-length videos or thousands of pages in one go. Gemini 1.5 Pro will underpin upgrades across Google’s product lineup, from enhancing Search with multi-step reasoning to powering advanced Workspace features.
In Search, users will soon be able to ask complex, multi-part queries and get comprehensive AI-synthesised answers. A “Search with video” feature will let people simply record a clip to find relevant information. For Android, Gemini will enable capabilities like “Circle to Search” for exploring anything on-screen, enhanced accessibility via TalkBack’s screen reader, and live scam call detection.
Google is also supercharging its Workspace suite of productivity apps. The company highlighted AI-powered draft email replies in Gmail’s mobile app, automated summaries for long email threads, and a conversational “Live” experience. A new Gemini side panel provides intelligent suggestions leveraging the 1.5 Pro model’s expanded context window.
Not to be outdone, Google’s Imagen and Veo models for text-to-image and text-to-video generation leveled up. Imagen 3 promises exceptional fidelity, with variants tailored for speed or quality. Veo can generate high resolution, nuanced videos up to a minute long based on text prompts, opening new creative frontiers.
On the audio front, Google unveiled a family of Lyria models to generate music from text. Early samples created in collaboration with artists like Wyclef Jean hint at AI’s potential for music production.
Amid the generative AI boom, Google is also focused on responsible development. It demoed SynthID watermarking to identify AI-generated images and videos. The AI Test Kitchen expanded to over 100 countries for user feedback.
“We’re infusing generative AI across our entire product suite to truly reinvent how people search, work, create, and experience the world around them,” said Sundar Pichai, Google’s CEO. The company’s I/O announcements send a clear signal that it’s going all-in on this transformative technology wave.
Read next: AI unlocks $15.7 Trillion opportunity, but hurdles await Africa