Question 1

What is a robots.txt file?

Accepted Answer

A robots.txt file tells search engine crawlers which pages or directories on your website they are allowed or not allowed to crawl. It sits at the root of your domain (e.g. https://example.com/robots.txt) and is checked by crawlers before they index your site.

Question 2

What does Disallow: (empty) mean?

Accepted Answer

An empty Disallow: directive — written as "Disallow:" with nothing after the colon — means "allow everything". It is commonly added after a User-agent line as a way to explicitly allow all bots. Omitting Disallow entirely has the same effect.

Question 3

How do I block all search engines from my site?

Accepted Answer

Add a User-agent block for * (all bots) with "Disallow: /". This tells every crawler not to crawl any page. Note: robots.txt is an advisory standard — malicious bots may ignore it.

Question 4

What is Crawl-delay?

Accepted Answer

Crawl-delay tells a bot how many seconds to wait between requests to your server. For example, "Crawl-delay: 10" means the bot should wait 10 seconds between each request. Note: Googlebot ignores Crawl-delay — use Google Search Console to control Googlebot's crawl rate instead.

Question 5

Does robots.txt prevent pages from being indexed?

Accepted Answer

No. Blocking a page in robots.txt prevents it from being crawled, but if another site links to that page, Google may still index the URL without visiting it. To prevent indexing, use a "noindex" meta robots tag on the page itself.

Question 6

Can I have multiple User-agent blocks?

Accepted Answer

Yes. You can have as many User-agent blocks as you need. Each block applies only to the specified bot. Start with a "*" block for all bots, then add specific blocks for bots that need different rules.

Directive	Example	Meaning
`User-agent`	`User-agent: *`	Which bot this block applies to
`Disallow`	`Disallow: /admin/`	Paths the bot must not crawl
`Allow`	`Allow: /admin/public/`	Exceptions within a Disallow
`Crawl-delay`	`Crawl-delay: 10`	Seconds to wait between requests
`Sitemap`	`Sitemap: https://…/sitemap.xml`	Location of your XML sitemap

robots.txt Generator

What is a robots.txt Generator?

How to Use This Tool

robots.txt Syntax

Common Mistakes

Empty Disallow

Paths Without a Leading Slash

Expecting robots.txt to Hide Pages

Googlebot Ignoring Crawl-delay

Example: Standard Website

Example: Block All Bots

Example: Block AI Training Bots

Privacy

FAQ