# ============================================================================ # LeanBiome Official® — robots.txt (2026 Maximum SEO/AEO/GEO Optimization) # Site: https://leanbiome-weightloss-supplement-healthy-gut.lovable.app # Last Updated: 2026-04-21 # Purpose: Maximum crawl efficiency, full indexation, AEO + AI engine access # ============================================================================ # ---------- Sitemaps (primary discovery for all crawlers) ---------- Sitemap: https://leanbiome-weightloss-supplement-healthy-gut.lovable.app/sitemap.xml Sitemap: https://leanbiome-weightloss-supplement-healthy-gut.lovable.app/sitemap-index.xml # ---------- Canonical Host ---------- Host: https://leanbiome-weightloss-supplement-healthy-gut.lovable.app # ---------- Priority public routes ---------- Allow: / Allow: /reviews Allow: /backlinks Allow: /sitemap.xml Allow: /sitemap-index.xml Allow: /robots.txt # ============================================================================ # TIER 1 — Primary Search Engines (Highest Priority, Zero Crawl Delay) # ============================================================================ User-agent: Googlebot Allow: / Disallow: Crawl-delay: 0 User-agent: Googlebot-Image Allow: / Disallow: User-agent: Googlebot-Video Allow: / Disallow: User-agent: Googlebot-News Allow: / Disallow: User-agent: Googlebot-Mobile Allow: / Disallow: User-agent: AdsBot-Google Allow: / Disallow: User-agent: AdsBot-Google-Mobile Allow: / Disallow: User-agent: Mediapartners-Google Allow: / Disallow: User-agent: Storebot-Google Allow: / Disallow: User-agent: Bingbot Allow: / Disallow: Crawl-delay: 0 User-agent: BingPreview Allow: / Disallow: User-agent: msnbot Allow: / Disallow: User-agent: msnbot-media Allow: / Disallow: User-agent: adidxbot Allow: / Disallow: # ============================================================================ # TIER 2 — Major Global Search Engines # ============================================================================ User-agent: Slurp Allow: / Crawl-delay: 1 User-agent: DuckDuckBot Allow: / User-agent: DuckDuckGo-Favicons-Bot Allow: / User-agent: Baiduspider Allow: / Crawl-delay: 1 User-agent: Baiduspider-image Allow: / User-agent: Baiduspider-news Allow: / User-agent: Baiduspider-video Allow: / User-agent: YandexBot Allow: / Crawl-delay: 1 User-agent: YandexImages Allow: / User-agent: YandexMobileBot Allow: / User-agent: YandexMedia Allow: / User-agent: Sogou Allow: / Crawl-delay: 1 User-agent: Sogou web spider Allow: / Crawl-delay: 1 User-agent: Exabot Allow: / User-agent: Naverbot Allow: / User-agent: Yeti Allow: / User-agent: SeznamBot Allow: / User-agent: MojeekBot Allow: / User-agent: Qwantify Allow: / User-agent: PetalBot Allow: / Crawl-delay: 1 # ============================================================================ # TIER 3 — Apple, Mobile & OS Crawlers # ============================================================================ User-agent: Applebot Allow: / User-agent: Applebot-Extended Allow: / # ============================================================================ # TIER 4 — AI / LLM / Generative Search Engines (AEO + GEO Optimization) # Allow ALL AI crawlers for maximum Answer Engine visibility in 2026 # ============================================================================ User-agent: GPTBot Allow: / User-agent: ChatGPT-User Allow: / User-agent: OAI-SearchBot Allow: / User-agent: ClaudeBot Allow: / User-agent: Claude-Web Allow: / User-agent: anthropic-ai Allow: / User-agent: Google-Extended Allow: / User-agent: PerplexityBot Allow: / User-agent: Perplexity-User Allow: / User-agent: YouBot Allow: / User-agent: Amazonbot Allow: / User-agent: Bytespider Allow: / User-agent: ByteDance Allow: / User-agent: Meta-ExternalAgent Allow: / User-agent: Meta-ExternalFetcher Allow: / User-agent: FacebookBot Allow: / User-agent: cohere-ai Allow: / User-agent: cohere-training-data-crawler Allow: / User-agent: Diffbot Allow: / User-agent: ImagesiftBot Allow: / User-agent: Omgilibot Allow: / User-agent: Omgili Allow: / User-agent: CCBot Allow: / User-agent: Timpibot Allow: / User-agent: Webzio-Extended Allow: / User-agent: Kagibot Allow: / User-agent: Mistralai-User Allow: / User-agent: AwarioRssBot Allow: / User-agent: AwarioSmartBot Allow: / # ============================================================================ # TIER 5 — Social Media Crawlers (rich previews + social SEO) # ============================================================================ User-agent: Twitterbot Allow: / User-agent: facebookexternalhit Allow: / User-agent: Facebot Allow: / User-agent: LinkedInBot Allow: / User-agent: Pinterestbot Allow: / User-agent: Pinterest Allow: / User-agent: WhatsApp Allow: / User-agent: TelegramBot Allow: / User-agent: Discordbot Allow: / User-agent: SkypeUriPreview Allow: / User-agent: redditbot Allow: / User-agent: TikTokSpider Allow: / User-agent: SnapchatBot Allow: / User-agent: Slackbot Allow: / User-agent: Slackbot-LinkExpanding Allow: / # ============================================================================ # TIER 6 — SEO / Marketing Analytics Crawlers (rate-limited) # ============================================================================ User-agent: AhrefsBot Allow: / Crawl-delay: 2 User-agent: AhrefsSiteAudit Allow: / Crawl-delay: 2 User-agent: SemrushBot Allow: / Crawl-delay: 2 User-agent: SemrushBot-SA Allow: / Crawl-delay: 2 User-agent: MJ12bot Allow: / Crawl-delay: 2 User-agent: DotBot Allow: / Crawl-delay: 2 User-agent: rogerbot Allow: / Crawl-delay: 2 User-agent: SiteAuditBot Allow: / Crawl-delay: 2 User-agent: Screaming Frog SEO Spider Allow: / Crawl-delay: 2 User-agent: BLEXBot Allow: / Crawl-delay: 2 User-agent: SerpstatBot Allow: / Crawl-delay: 2 User-agent: DataForSeoBot Allow: / Crawl-delay: 2 # ============================================================================ # TIER 7 — Archive & Research Crawlers # ============================================================================ User-agent: ia_archiver Allow: / User-agent: archive.org_bot Allow: / User-agent: Wayback Allow: / # ============================================================================ # Universal / Catch-All # ============================================================================ User-agent: * Allow: / Disallow: Crawl-delay: 1 # ============================================================================ # End of robots.txt — Maximum 2026 SEO + AEO + GEO crawl coverage enabled # ============================================================================