Q&A Last updated: May 2026

What Should My robots.txt Look Like for AI Search?

Your robots.txt needs to allow AI crawlers: GPTBot, OAI-SearchBot, ClaudeBot, PerplexityBot, and Google-Extended. Exact configuration here.

OM
Oliver Mackman
AI Search Analyst

Block AI training crawlers (GPTBot, ClaudeBot, Google-Extended, CCBot). Allow AI search crawlers (OAI-SearchBot, Claude-SearchBot, PerplexityBot) and user-session crawlers (ChatGPT-User, Claude-User). This protects your content from free training while letting AI search engines cite you.

What should robots.txt look like for AI search

The recommended robots.txt configuration

There are three types of AI crawler. Block training bots (they take your content to train models). Allow search bots (these power the AI search results that cite you). Allow user bots (triggered when someone asks an AI about you).

User-agent: *
Allow: /
Sitemap: https://yoursite.com/sitemap.xml

# TRAINING CRAWLERS (block)
User-agent: GPTBot
Disallow: /

User-agent: ClaudeBot
Disallow: /

User-agent: Google-Extended
Disallow: /

User-agent: Applebot-Extended
Disallow: /

User-agent: CCBot
Disallow: /

# SEARCH CRAWLERS (allow - these power AI search results)
User-agent: OAI-SearchBot
Allow: /

User-agent: Claude-SearchBot
Allow: /

User-agent: PerplexityBot
Allow: /

User-agent: Googlebot
Allow: /

User-agent: Bingbot
Allow: /

User-agent: Applebot
Allow: /

User-agent: GoogleOther
Allow: /

# USER-SESSION CRAWLERS (allow - when users ask AI about you)
User-agent: ChatGPT-User
Allow: /

User-agent: Claude-User
Allow: /

User-agent: Perplexity-User
Allow: /

The three types of AI crawler

TypeCrawlerPlatformWhat it doesAllow?
TrainingGPTBotOpenAIScrapes content to train AI modelsBlock
TrainingClaudeBotAnthropicScrapes content to train ClaudeBlock
TrainingGoogle-ExtendedGoogleTrains Gemini and AI OverviewsBlock
TrainingApplebot-ExtendedAppleTrains Apple Intelligence / SiriBlock
TrainingCCBotCommon CrawlOpen dataset used by many AI modelsBlock
SearchOAI-SearchBotOpenAIIndexes for ChatGPT search resultsAllow
SearchClaude-SearchBotAnthropicIndexes for Claude search resultsAllow
SearchPerplexityBotPerplexityIndexes for Perplexity answersAllow
SearchGooglebotGoogleIndexes for Google Search + AI OverviewsAllow
SearchBingbotMicrosoftIndexes for Bing + CopilotAllow
UserChatGPT-UserOpenAIWhen a user asks ChatGPT to visit your siteAllow
UserClaude-UserAnthropicWhen a user asks Claude to visit your siteAllow
UserPerplexity-UserPerplexityWhen Perplexity fetches your page for a userAllow

How to check your current robots.txt

Go to yoursite.com/robots.txt in a browser. If you see Disallow: / for any of the crawlers above, that platform is blocked from accessing your content.

Common mistakes

  • WordPress default blocks - some WordPress themes and security plugins add AI crawler blocks by default
  • Cloudflare bot protection - Cloudflare's bot management can block AI crawlers without you knowing. Check Security > Bots
  • Blocking GPTBot but allowing OAI-SearchBot - you need both for full ChatGPT visibility
  • Not having a robots.txt at all - this is fine (all bots are allowed by default) but explicit rules are better practice

Related questions

OM

Oliver Mackman

AI Search Analyst, SEOCompare

Oliver leads SEOCompare's editorial and comparison research. With over a decade in digital marketing, he oversees agency evaluation, tool testing, and AI search data analysis.

Last reviewed: 7 April 2026

Need help with AI search visibility?

Get a free AI visibility audit to see how your business appears across ChatGPT, Gemini, Perplexity, and AI Overviews.

Request your free audit