Guest post

6 min read

25 May 2026

Google I/O 2026 Takeaways: What Actually Matters for SEO Now

Two days after Google I/O 2026, Reddit lit up with satirical "SEO is dead" obituaries. Google confirmed what many of us expected: AI Mode is now the default search interface. Gemini 3.5 Flash answers queries, dispatches background agents, and builds tools on the fly. The traditional ten blue links haven't disappeared, but they've been pushed below the fold – to the so-called 'SEO dead zone', where click-through-rates drift from 1 to 1.5%.

If you work in digital marketing, you already know the announcements. I'm not going to rehash the keynote. Instead, I want to talk about what these changes mean in practice – what the research tells us about how AI systems decide who to cite, and what you can actually do about it.

What Actually Shifted
When Google Builds the Tool for You
The Two Pipelines: How AI Decides Who to Cite
Google Said It Out Loud: AEO and GEO Are Still SEO
What You Can Do in Practice
What We Don't Know

What Actually Shifted

I'm going to skip the feature-by-feature rundown and focus on the five things that carry real strategic weight.

AI Mode as the default interface. This is the one. Not a feature, rather a platform shift. The search box still works the same way, but the answer comes through Gemini before anything else. For informational queries, the AI answer is the search result, and everything below it is supplementary.

Search Agents. Users can now set background monitoring tasks: "track flights to Lisbon under $400" or "alert me when this product drops in price." This creates persistent search loops that never generate a click. The user sets it and forgets it. No SERP, no organic listing, and no CTR.

Personal Intelligence. Gemini can now pull from Gmail, Photos, and account data to personalize answers. When someone asks "where to I eat in Barcelona," the answer comes from their personalized history, not your restaurant review. This collapses an entire category of queries into zero-click personal retrieval.

Universal Cart. Shopping across Google surfaces with one-tap checkout. The conversion funnel from discovery to purchase now lives entirely within Google. If your model depends on being in the middle of that funnel, Google just cut you out.

Generative UI. This is a separate threat, and the one I want to spend time on.

When Google Builds the Tool for You

During the keynote, Google demoed Gemini generating interactive UI elements directly in the search results – not links to tools, but the tool itself, built on the spot.

Need a mortgage calculator? Gemini builds one in the SERP. Want to convert units, run a color palette generator, check a timezone? The model renders a functioning widget right there, with no external page needed.

One r/SEO commenter connected the dots immediately: Google would generate tools on the spot, then use agents to imitate what top tool websites do, making that a permanent fixture on that SERP. The parallel to how Google already absorbed jobs, recipes, and flight results is exact.

This matters specifically because the standard advice for surviving AI answers has been "build tools, not just content." Generative UI complicates that advice. A simple calculator or converter is now trivially replaceable in the SERP. Your tool needs to do something that requires persistent state, proprietary data, or user-specific context that a generated UI can't replicate.

The Two Pipelines: How AI Decides Who to Cite

Here's where the practitioner playbook diverges from the commentary. If you want to show up in AI answers, you need to understand how LLMs acquire and retrieve information. There are two distinct pipelines, and they work differently.

Pipeline 1: Training Data (Common Crawl and Harmonic Centrality)

Most LLMs are built on Common Crawl, a public web archive that's been indexing the internet since 2008. The numbers are stark: 64% of LLMs use Common Crawl data. GPT-3 drew over 80% of its training tokens from it.

But Common Crawl doesn't treat all pages equally. It prioritizes what to crawl using a metric called Harmonic Centrality (HC) – essentially a measure of how central a domain is in the web graph based on its link structure. Higher HC means more frequent crawling and deeper page coverage, which means more of your content ends up in training data.

This has a practical consequence. A dofollow link from a high-HC domain doesn't just help your traditional search rankings – it shifts your position in the web graph that determines how much of your content LLMs will train on.

The top of the list won't surprise you: google.com, youtube.com, wikipedia.org, github.com, wordpress.org. But the operative insight is that these are the domains whose link neighborhoods determine what gets into training data. A dofollow link from wordpress.org (HC #13) doesn't just pass PageRank – it tells Common Crawl's prioritization algorithm to crawl your site more thoroughly.

Pipeline 2: Real-Time Citation (Google-Extended)

Training data is a slow-moving signal – models retrain on cycles, not daily. The real-time pipeline works through Google-Extended, a crawler that's separate from the traditional Googlebot. It reads raw HTML (not rendered JavaScript), maintains its own index, and feeds directly into Gemini's answer generation.

This is the pipeline that determines whether you get cited with a link in an AI answer today, not six months from now when the next model is trained.

Key detail: Google-Extended and Googlebot are independent systems. Being indexed by Googlebot doesn't mean Google-Extended has your content. And because Google-Extended reads raw HTML, content that's hidden behind JavaScript rendering may not be visible to this pipeline at all.

What the Citation Data Shows

Research analyzing over 150,000 LLM citations reveals patterns that should shape strategy:

Reddit accounts for 40.1% of all LLM citations. Wikipedia is second at 26.3%. This is not intuitive – but it makes sense when you consider that LLMs value recently discussed, contextually rich, human-generated content.

Position 1 in Google correlates with roughly a 46-48% probability of LLM citation. Traditional ranking still matters, but it's not the whole picture.

37% of domains cited by AI don't appear in the traditional SERP at all. This is the most underappreciated finding. You can be invisible in classic search and still get cited by Gemini if your content appears on high-authority platforms that LLMs reference.

The practical takeaway: traditional SERP optimization and AI citation optimization overlap but aren't identical. Ranking well helps. But entity presence on high-HC domains – through genuine mentions, UGC contributions, marketplace listings – creates a separate citation pathway.

Google Said It Out Loud: AEO and GEO Are Still SEO

Before anyone spins up a new acronym-based agency, it's worth noting what Google themselves published. Their own guide to AI search optimization explicitly states that Answer Engine Optimization (AEO) and Generative Engine Optimization (GEO) are "still SEO." Not parallel disciplines, but extensions of existing best practices. Platforms like RankLLM help businesses adapt to this evolution by improving AI search visibility across ChatGPT, Gemini, and Perplexity while building on proven SEO strategies rather than replacing them.

This is important because the SEO industry has a habit of rebranding existing work every time the landscape shifts. The fundamentals haven't changed: create content that's accurate, well-structured, and genuinely useful. Make sure it's crawlable, and build real authority in your space. What has changed is the area where that authority needs to exist and the mechanics of how it gets recognized.

What You Can Do in Practice

1. Check your Google-Extended crawler access. Look at your robots.txt. If you're blocking Google-Extended (or using a blanket user-agent block), you're invisible to AI Mode's citation pipeline. Decide whether the tradeoff is worth it. For most sites, it isn't.

2. Look up your domain's HC rank. If you're not in the top 50,000, your content is being under-crawled by Common Crawl, which means LLMs are training on less of it. The fix isn't fast, but it starts with earning dofollow links from high-HC domains – which, conveniently, tends to overlap with traditional high-authority link building.

3. Audit your brand mentions on high-HC domains. This isn't about links alone. Text mentions of your brand – with consistent entity, category, and attribute language – on domains that LLMs train on create the conditions for accurate brand representation in AI answers. Check your marketplace listings, profile pages, and contributed content on high-HC sites. Are they current? Do they describe what you do in clear, factual language?

4. Start tracking AI visibility. The measurement space is still maturing. A dedicated ai visibility tool like Beamtrace can automate brand monitoring across AI platforms like ChatGPT – you can run an audit of what gets cited, and how your brand is described. It's a good way to close the feedback loop right now.

What We Don't Know

In short: a lot. The measurement gap is real and uncomfortable. We don't have reliable attribution for AI-driven traffic. GSC doesn't break out AI Mode citations separately. We're making educated bets based on the best available research, and some of those bets will turn out to be wrong.

We also don't know how Generative UI will scale. The I/O demos were impressive, but demos always are. Whether Google can generate reliable, safe interactive tools across the full breadth of search queries without hallucinating is unclear.

The Real Takeaway

The version of SEO that was purely about ranking pages for informational keywords and monetizing the traffic is under existential pressure. What survives – what has always survived – is being useful to real people in ways that can't be trivially replicated. That used to mean writing better content than the next site. Now it means being the kind of entity that AI systems cite because you're the source with the proprietary data, or the community people come back to, or the tool that can't be generated on the fly.

Google's own language is telling: this is still SEO. The craft is the same, but the terrain feels different now.

Author

Alex Rostovtsev

Alex is an SEO at Elfsight, an AI visibility specialist at Beamtrace, and also the founder of WROITER – a research-based AI slop detector that offers an alternative to the current AI humanizer market, both in terms of transparency and pricing.

7 Data Privacy and Compliance Tools Every E-commerce Store Needs in 2026

Guest post

4 min read

7 Data Privacy and Compliance Tools Every E-commerce Store Needs in 2026

Salman Writer

Read article

Company

Resources

Support

Google I/O 2026 Takeaways: What Actually Matters for SEO Now

Table of Contents

What Actually Shifted

When Google Builds the Tool for You

The Two Pipelines: How AI Decides Who to Cite

Pipeline 1: Training Data (Common Crawl and Harmonic Centrality)

Pipeline 2: Real-Time Citation (Google-Extended)

What the Citation Data Shows

Google Said It Out Loud: AEO and GEO Are Still SEO

What You Can Do in Practice

What We Don't Know

The Real Takeaway

Alex Rostovtsev

Similar posts

7 Data Privacy and Compliance Tools Every E-commerce Store Needs in 2026

Company

Resources

Support

Ready to save $3,000+/year while growing your store faster?

Google I/O 2026 Takeaways: What Actually Matters for SEO Now

Table of Contents

What Actually Shifted

When Google Builds the Tool for You

The Two Pipelines: How AI Decides Who to Cite

Pipeline 1: Training Data (Common Crawl and Harmonic Centrality)

Pipeline 2: Real-Time Citation (Google-Extended)

What the Citation Data Shows

Google Said It Out Loud: AEO and GEO Are Still SEO

What You Can Do in Practice

What We Don't Know

The Real Takeaway

Alex Rostovtsev

Similar posts

7 Data Privacy and Compliance Tools Every E-commerce Store Needs in 2026

How Fintech Companies Can Generate More Leads Through SEO

Agentic AI, Explained: From Chatbots to Autonomous Workflows