Operational definition of GEO
Generative Engine Optimization (GEO) is the set of editorial, technical and semantic practices that maximise the probability that content is selected, cited and reproduced in responses generated by AI answer engines (ChatGPT, Perplexity, Google AI Overviews, Claude, Gemini, Microsoft Copilot).
The term was introduced and formalised by researchers from Princeton, IIT Delhi, Georgia Tech and Allen AI in a paper published on arXiv in November 2023 (Aggarwal et al., arXiv:2311.09735), then accepted at KDD 2024 in Barcelona. It is the foundational academic reference for the field.
GEO, SEO, AEO: three distinct disciplines
For a full comparative, see SEO vs GEO vs AEO. In brief:
| Criterion | SEO | GEO | AEO |
|---|---|---|---|
| Optimised object | The page as a whole | The citable passage within a synthesised response | The passage that directly answers a question |
| User output | A link in a SERP | A written response, sometimes cited | A literal extract (featured snippet, voice assistant) |
| Engines | Google, Bing, Qwant | ChatGPT, Perplexity, AI Overviews, Gemini, Claude | Featured snippets, Alexa, Siri |
| Metric | Position, CTR, impressions | Citation rate, AI share of voice | Snippet appearance |
GEO builds on SEO, not in its place. A page invisible to Google will not be cited by an LLM.
AI engines in 2026
According to the Goodie Wave 2 study (published 21 May 2026, B2B brand panel, March-April 2026 data), the distribution of AI referrals is:
- ChatGPT Search: 62.6% of AI referrals (down from 89.1% in May-August 2025)
- Claude: 18.5% (up from 1.4%)
- Gemini: 10.6%
- Perplexity: 7.3%
- Microsoft Copilot: approximately 4%
Source: Goodie, "2026 AI Search Traffic Report", 21 May 2026 (41 B2B brands, 25.77 billion SimilarWeb visits Jan-April 2026).
ChatGPT concentration remains dominant but is declining: its market share fell from 89% to 63% in 8 months. Claude's rise (+17 percentage points) is particularly notable. Optimising exclusively for ChatGPT or Perplexity in 2026 would be a coverage error.
How AI engines select their sources
Most modern AI answer engines operate on a RAG (Retrieval Augmented Generation) pipeline:
- Crawl and indexing: the engine's bot crawls web pages. ChatGPT Search uses OAI-SearchBot, Perplexity uses PerplexityBot, Google AI Overviews uses Googlebot and Google-Extended.
- Vectorisation and storage: extracted text is chunked, converted to numerical vectors and stored in a vector database.
- Retrieval and generation: when a query arrives, the engine retrieves the most semantically similar chunks and passes them as context to the LLM, which generates a synthesised response.
Key implication: it is the chunk (the passage), not the entire page, that is selected. A 2,000-word article may produce 8 to 12 chunks. Only the most relevant chunk for the query will be retrieved. This is why the structure of each paragraph matters as much as overall article quality.
The reference study: Princeton GEO-bench
The foundational GEO study was conducted by Aggarwal et al. (Princeton, IIT Delhi, Georgia Tech, Allen AI), published on arXiv in November 2023 and accepted at KDD 2024 (ACM DOI: 10.1145/3637528.3671900).
Methodology: GEO-bench, a benchmark of 10,000 queries across 9 datasets and 25 thematic domains. Nine editorial interventions were tested on GPT-3.5-turbo, validated on 200 Perplexity.ai queries. Primary metric: Position-Adjusted Word Count (PAWC).
| Intervention | Impact on visibility (PAWC) |
|---|---|
| Quotation Addition | +41% |
| Statistics Addition | +31% |
| Cite Sources | +28% |
| Fluency Optimization | +17% |
| Simplification | +12% |
| Authoritative Tone | +11% |
| Keyword Stuffing | -8 to -10% |
Caveat: this study was run on GPT-3.5 and Perplexity in 2023-2024. Current models may behave differently. The direction of effects remains relevant, but exact amplitudes should be treated with caution.
The 6 GEO levers
Lever 1: self-contained passages
Each paragraph must be understandable without surrounding context. An LLM extracting a chunk does not see what comes before or after. A passage starting with "As we saw above..." is unusable out of context.
Lever 2: cited statistics and data
Dated, sourced statistics increase visibility by +31% (Princeton GEO). An LLM prefers to cite a passage containing "74% of companies that adopted GEO improved their AI visibility in 6 months (Source: Study X, 2025)" over a generic passage.
Lever 3: quotations and explicit attribution
The most powerful lever according to Princeton (+41%). Directly quoting experts, studies, official definitions, with attribution. This signals to the LLM that the content is verifiable and anchored in external trusted sources.
Lever 4: entity disambiguation
An LLM must be able to associate your content with a clearly defined entity: complete Organization or Person schema, consistent presence on authority platforms (Wikipedia, Wikidata, specialist press), sameAs signals across your properties.
Lever 5: AI crawler accessibility
An LLM can only cite what it has indexed. Verify that robots.txt allows relevant AI bots (GPTBot, OAI-SearchBot, PerplexityBot, ClaudeBot, Google-Extended). Ensure pages are accessible without JavaScript. See Technical optimisation for AI visibility.
Lever 6: schema.org structured data
Structured data improves content understanding by AI parsers. Priority schemas: Article, FAQPage, HowTo, Organization, BreadcrumbList.
Measuring GEO visibility
- Google Search Console AI Overviews report: tracks impressions and clicks from AI Overviews. The most direct source for Google AIO visibility.
- Manual query sampling: test your 10-20 target queries regularly in ChatGPT, Perplexity and Claude. Document whether your site is cited and which URL.
- Server logs: detect AI bot crawls (OAI-SearchBot, PerplexityBot, ClaudeBot, Google-Extended). A site not crawled by these bots cannot be cited.
- Specialised tools: BrandRadar (Ahrefs), Semrush AI Toolkit, and dedicated solutions like Profound are beginning to offer multi-engine coverage.
GEO implementation plan in 8 steps
- Crawlability audit: robots.txt, JavaScript rendering, TTFB, page accessibility for AI bots.
- Entity disambiguation audit: Organization schema, Wikidata, press presence.
- Target query mapping: identify the 20-30 queries you want AI engines to cite you for.
- Content passability audit: for each key page, identify current chunks and assess self-containment.
- Restructure priority passages: rewrite the 3-5 most strategic passages per page in BLUF-first mode.
- Add data and sources: integrate dated, sourced statistics in each restructured passage.
- Deploy schema.org: Article, FAQPage, Organization, BreadcrumbList on all strategic pages.
- Set up measurement: configure GSC AI Overviews report, enable AI bot server logs, establish query sampling baseline.
For a full audit across all these dimensions, see the 40-point checklist and the LOOP methodology.
Frequently asked questions
- What is the difference between GEO and SEO?
- SEO targets a ranking in a list of links (SERP). GEO targets a citation in a response synthesised by a generative engine (ChatGPT, Perplexity, AI Overviews). The metric changes: in SEO it is position, in GEO it is citation rate and share of voice in AI responses. Base technical signals overlap, but GEO adds specific requirements around passage structure, entity authority and machine readability.
- Which AI engines does GEO apply to?
- In 2026, the main AI answer engines are: ChatGPT Search (62.6% of AI referrals in March-April 2026 per Goodie), Claude AI (18.5%), Google AI Overviews (deployed in France since June 2025), Gemini (10.6%), Perplexity (7.3%) and Microsoft Copilot (~4%). Each has its own retrieval signals, but core GEO principles apply to all.
- Does GEO replace classic SEO?
- No. SEO remains the essential foundation: without correct crawling and a baseline domain authority, no page will be cited by generative engines. Both disciplines share common fundamentals (markup, authority, quality content) but diverge at second-level optimisations. A site well-optimised for SEO is almost always better positioned for GEO. The reverse is not guaranteed.
- Which GEO levers are most effective according to available data?
- The reference study (Aggarwal et al., Princeton/IIT Delhi/Georgia Tech/Allen AI, KDD 2024, arXiv:2311.09735) measured 9 interventions across 10,000 queries. Most effective: adding citations and quotations (+41% visibility on Position-Adjusted Word Count), adding statistics (+31%), citing sources (+28%). The only negative intervention: keyword stuffing (-8 to -10%).
- How do AI Overviews affect organic traffic in France?
- Selectively. According to the Seer Interactive study (53 brands, 2.43 billion impressions, Jan 2025-Feb 2026), organic CTR on SERPs with AI Overviews remains lower than without. Pages cited in an AIO achieve 2.1% average CTR, versus 0.9% for pages on the same SERP but not cited (+133% difference). In France, AI Overviews were deployed from late June 2025 (JDN, May 2026) and primarily affect informational queries.