# 妲己 · Daji 18+ AI Companion · robots.txt # RTA-5042 兼容 · 允许 LLM/搜索引擎索引(成年用户) # Note: 此文件部署到 /daji/robots.txt · 域根 /robots.txt 由 nginx 合并多项目规则 # 如需独立部署,rules 已用 /daji/ 前缀,与域根放置语义一致 # ============ 通用搜索引擎 ============ User-agent: Googlebot Allow: / User-agent: Bingbot Allow: / User-agent: Baiduspider Allow: / User-agent: 360Spider Allow: / User-agent: Sogou web spider Allow: / User-agent: Sogou inst spider Allow: / User-agent: YandexBot Allow: / User-agent: DuckDuckBot Allow: / User-agent: Slurp Allow: / User-agent: Applebot Allow: / User-agent: PetalBot Allow: / # ============ LLM / 生成式引擎 ============ User-agent: GPTBot Allow: / User-agent: ChatGPT-User Allow: / User-agent: OAI-SearchBot Allow: / User-agent: ClaudeBot Allow: / User-agent: Claude-Web Allow: / User-agent: anthropic-ai Allow: / User-agent: PerplexityBot Allow: / User-agent: Perplexity-User Allow: / User-agent: Google-Extended Allow: / User-agent: Applebot-Extended Allow: / User-agent: cohere-ai Allow: / User-agent: Cohere-User Allow: / User-agent: CCBot Allow: / User-agent: FacebookBot Allow: / User-agent: meta-externalagent Allow: / User-agent: Meta-ExternalAgent Allow: / User-agent: Bytespider Allow: / User-agent: DoubaoSpider Allow: / User-agent: KimiBot Allow: / User-agent: Amazonbot Allow: / User-agent: YouBot Allow: / User-agent: Diffbot Allow: / User-agent: ImagesiftBot Allow: / User-agent: Timpibot Allow: / # 神马搜索 mobile · 中国移动搜索 ~10% 份额 User-agent: YisouSpider Allow: / # 字节跳动头条搜索(与 Bytespider 分开记,更合规) User-agent: ToutiaoSpider Allow: / # Mistral AI 助手浏览 User-agent: MistralAI-User Allow: / # Quora Poe AI 助手 User-agent: PoeBot Allow: / # Microsoft Copilot / Edge 浏览 User-agent: SemrushBot Allow: / # ============ 默认放行 + 路径屏蔽 ============ User-agent: * Allow: / # 仅屏蔽"无 noindex meta 的私密/非内容资源"——SPA 应用壳 + 后台 + 接口(避免 Disallow 与 noindex 冲突) Disallow: /admin Disallow: /api/ Disallow: /app/ Disallow: /webapp/ Disallow: /m/ # 排序/筛选参数页防索引膨胀 Disallow: /*?sort= Disallow: /*?filter= # 注意:/chat/ /invite/ /wallet/ /profile/ /gen-studio/ /studio/ 都用 noindex meta(不 Disallow), # 让爬虫能抓到并读取 noindex 指令把它们正确移出索引;Disallow 会让 noindex 永远读不到(逻辑错误)。 # ============ 站点地图 ============ Sitemap: https://dajiai6.com/sitemap-index.xml