# site-a — permissive AI bot policy. # Marketing flagship: maximize discoverability and AI surface area, including # training corpora. If a bot identifies itself, we let it in. # --- Traditional search engines --- User-agent: Googlebot Allow: / User-agent: Bingbot Allow: / User-agent: DuckDuckBot Allow: / User-agent: Applebot Allow: / # --- Citation / answer-engine bots (link back to source) --- User-agent: PerplexityBot Allow: / User-agent: OAI-SearchBot Allow: / User-agent: ChatGPT-User Allow: / User-agent: Claude-User Allow: / User-agent: Claude-SearchBot Allow: / User-agent: Applebot-Extended Allow: / # --- Training crawlers: ALLOWED on site-a (per-site policy) --- User-agent: GPTBot Allow: / User-agent: ClaudeBot Allow: / User-agent: Google-Extended Allow: / User-agent: Anthropic-AI Allow: / User-agent: CCBot Allow: / User-agent: Bytespider Allow: / User-agent: Amazonbot Allow: / User-agent: FacebookBot Allow: / User-agent: Meta-ExternalAgent Allow: / # --- Default --- User-agent: * Allow: / Sitemap: https://site-a.example.com/sitemap-index.xml Sitemap: https://site-a.example.com/sitemap-image.xml