Question 1

Where should robots.txt live?

Accepted Answer

Always at the root of your domain: https://example.com/robots.txt. Subdomains need their own file. A robots.txt at a subdirectory path is ignored by crawlers.

Question 2

What does Disallow: / mean?

Accepted Answer

It blocks crawlers from accessing every URL on your site. This is a critical misconfiguration that can prevent Google from indexing any of your pages. Check for it immediately if you see unexpected drops in Search Console.

Question 3

Does robots.txt block indexing?

Accepted Answer

No — it blocks crawling. A URL can still appear in search results if it has inbound links, even if crawlers cannot fetch it. Use a noindex meta tag if you want to prevent indexing.

Question 4

How do I declare a sitemap in robots.txt?

Accepted Answer

Add Sitemap: https://example.com/sitemap.xml on its own line anywhere in the file. You can list multiple sitemaps. This is the most reliable way to ensure crawlers find your sitemap.

Question 5

Can I have multiple User-agent blocks?

Accepted Answer

Yes. Each block starts with one or more User-agent: lines followed by directives. Use User-agent: * for rules that apply to all crawlers, then add specific blocks like User-agent: Googlebot for search-engine-specific rules.

Question 6

What is Crawl-delay?

Accepted Answer

Crawl-delay: N asks crawlers to wait N seconds between requests. Google ignores it — use the crawl rate setting in Google Search Console instead. Bing and some others do respect it.

Robots.txt Tester

Why test robots.txt?

See what crawlers are blocked

Find declared sitemaps

Catch misconfigurations

What is robots.txt and why does it matter?

How does robots.txt work?

robots.txt vs. noindex

Common robots.txt mistakes

How does this tester work?

Robots.txt FAQ