Google rolled out the bulk of its 2026 AI stack at I/O on May 19, 2026, anchored by three releases. Gemini 3.5 Flash skipped the preview stage and went to general availability with a price increase that brings it above Gemini 2.5 Flash; Google's framing is that 3.5 Flash is now the default workhorse model for everything agentic at Google scale, with sharply better tool-calling and the ability to run as both a non-reasoning and a reasoning model from the same checkpoint. Artificial Analysis's eval shipped same-day: 3.5 Flash sits at 55.3 on the Intelligence Index, the new leader on the intelligence-versus-speed curve, with 278 output tokens per second. Simon Willison flagged that despite the price bump it is still cheap relative to GPT-5.4 and Claude Sonnet 4.6 and the long-context AA-LCR score of 69 percent matches GPT-5.4 mini at a fraction of the cost.
Gemini Omni is the headline multimodal release — an any-to-video model that takes images, text, and audio as input and produces video with synchronized audio. The internal name during the run-up was NanoBanana-for-Video. Google described it as the foundation for a single end-to-end generative-media surface inside the Gemini app, and demos at I/O showed text-prompted music videos, image-conditioned cinematics, and reference-driven scene edits — all returned with on-model audio rather than the dub-on-top pipeline the prior Veo line used. Coverage from TechCrunch and The Information emphasizes that Omni's intended competitor isn't Sora 2 or Runway Gen-4 directly; it's the bundle — a single model behind Search, the consumer app, the editing workspace, and the developer API, all running at the Flash price tier.
Gemini Spark is the third release: a persistent agent that lives across Gmail, Calendar, Drive, Workspace, and Android, surfaces proactive work, and runs background tool calls 24 hours a day with user-set guardrails. TechCrunch's read is that this is Google's most direct answer to ChatGPT's memory plus agent stack and Claude's Computer Use — the differentiator is the Gmail and Calendar context that Google can hand it natively, which OpenAI and Anthropic both have to bolt on. Spark is paired with refreshed Gemini app builds across mobile and web, a Universal Cart shopping integration, and new voice prompting in Docs and Keep — Google is betting visibly on agents over chatbots as the surface for I/O 2026. The downstream Search story is that Google Search as you know it is over: the AI Overviews surface is being expanded into a full agentic mode where the assistant can act inside results pages, place calls, and execute multi-step purchases without bouncing the user back to a search results page. The Information's read is that Google is finally willing to cannibalize search-ad revenue to defend the funnel.
Daniel will care about the eval implication: at Flash-level prices, 3.5 Flash now beats Mistral Medium 3.5, DeepSeek V3.2 and nearly matches GPT-5.4 mini, which prices the bottom of the frontier-LLM tier sharply down and squeezes the open-weight cohort that was previously the cheapest path to a 50-plus Intelligence Index score.
- Latent Space (swyx & Alessio) framed the day as the most agent-focused I/O ever — emphasis on Spark and the Android CLI, not just Gemini 3.5 Flash.
- Simon Willison called out that Gemini 3.5 Flash is the new default-everything model at Google, despite a noticeable price bump.
- Artificial Analysis ran same-day independent evals: 3.5 Flash sits at Intelligence Index 55.3, 278 tokens/sec — new leader on the intelligence-versus-speed curve.
- TechCrunch's read is that Spark marks Google betting on agents over chatbots for 2026.
- The Information emphasized the commercial play: Google is willing to cannibalize Search ad revenue with the new AI Search surface and Universal Cart.
- Gradient Flow (Ben Lorica) called I/O 2026 the moment the agent layer started to take shape across the major labs.