Skip to content

Robots.txt

Use this page to compare robots.txt evidence across captured sites.

Crawler policy surfaces that tell crawlers which paths and user agents are allowed.

A matching status is a lead, not proof; content type, body shape, redirects, and truncation still matter.

Crawler policy surfaces that tell crawlers which paths and user agents are allowed.

SiteHostMatching evidence
Cloudflare Developersdevelopers.cloudflare.comrobots.txt 200
Perplexity Docsdocs.perplexity.airobots.txt 200
Model Context Protocolmodelcontextprotocol.iorobots.txt 200
Claude Platformplatform.claude.comrobots.txt 200
Vercelvercel.comdocs robots 200
OpenAI API Docsdevelopers.openai.comdocs robots.txt 404
GitHub Docsdocs.github.comrobots.txt 200
Stripe Docsdocs.stripe.comrobots.txt 200
LangChain Docsdocs.langchain.comrobots.txt 200
Cloudflare Rootwww.cloudflare.comrobots.txt 200
Google Developersdevelopers.google.comrobots.txt 200
Google AIai.google.devrobots.txt 200
OpenAI Rootopenai.comrobots.txt 200
Anthropic Rootwww.anthropic.comrobots.txt 200
Perplexity Rootwww.perplexity.airobots.txt 200
Supabase Docssupabase.comdocs robots.txt 200
LlamaIndex Docsdocs.llamaindex.airobots.txt 200
Cursor Docsdocs.cursor.comrobots.txt 200