# Block AI crawlers / model training & AI search bots # OpenAI User-agent: GPTBot Disallow: / User-agent: OAI-SearchBot Disallow: / # Google (opt-out dùng dữ liệu cho Gemini) User-agent: Google-Extended Disallow: / # Apple (opt-out dùng dữ liệu cho Apple Intelligence) User-agent: Applebot-Extended Disallow: / # Anthropic (Claude) User-agent: ClaudeBot Disallow: / User-agent: Claude-User Disallow: / User-agent: Claude-SearchBot Disallow: / # Perplexity User-agent: PerplexityBot Disallow: / # Common Crawl (nhiều mô hình dùng bộ dữ liệu này) User-agent: CCBot Disallow: / # Meta (Facebook/Instagram) – các bot AI/training User-agent: meta-externalagent Disallow: / User-agent: Meta-ExternalFetcher Disallow: / # ByteDance / TikTok User-agent: Bytespider Disallow: / # Amazon (Alexa/AI) User-agent: Amazonbot Disallow: / # (tùy chọn) Cho phép các bot tìm kiếm thông thường User-agent: * Disallow: