What is a robots.txt Generator?
A robots.txt generator creates the robots.txt file that tells search engine crawlers which pages on your website they can and cannot access. The file lives at the root of your domain — https://example.com/robots.txt — and is one of the first things any crawler checks before indexing your site.
How to Use This Tool
- Add a User-agent block — click “Add Block” and choose a bot from the dropdown (or type a custom name). Start with
*to set rules for all bots. - Add Disallow paths — enter paths you want to block (e.g.
/admin/,/private/). Paths must start with/. - Add Allow paths — use these to explicitly allow subpaths within a blocked directory (e.g. allow
/private/public/inside a blocked/private/). - Set Crawl-delay (optional) — enter the number of seconds the bot should wait between requests. Note: Googlebot ignores this; use Google Search Console to control its crawl rate.
- Enter your Sitemap URL — helps crawlers find your XML sitemap (e.g.
https://example.com/sitemap.xml). - Copy or Download — click “Copy” to copy the generated file, or “Download” to save it as
robots.txt.
robots.txt Syntax
| Directive | Example | Meaning |
|---|---|---|
User-agent | User-agent: * | Which bot this block applies to |
Disallow | Disallow: /admin/ | Paths the bot must not crawl |
Allow | Allow: /admin/public/ | Exceptions within a Disallow |
Crawl-delay | Crawl-delay: 10 | Seconds to wait between requests |
Sitemap | Sitemap: https://…/sitemap.xml | Location of your XML sitemap |
Common Mistakes
Empty Disallow
Disallow: (with no value) means allow everything — the opposite of what most people intend. To block all crawling, use Disallow: /.
Paths Without a Leading Slash
All paths in Disallow and Allow must start with /. Disallow: admin/ is invalid; the correct form is Disallow: /admin/.
Expecting robots.txt to Hide Pages
robots.txt prevents crawling, not indexing. If another site links to a blocked URL, Google may still show that URL in search results without visiting it. Use <meta name="robots" content="noindex"> on the page itself to prevent indexing.
Googlebot Ignoring Crawl-delay
Googlebot does not respect Crawl-delay. To slow down Google’s crawl rate, use the crawl rate setting in Google Search Console.
Example: Standard Website
User-agent: *
Disallow: /admin/
Disallow: /checkout/
Allow: /admin/public/
Sitemap: https://example.com/sitemap.xml
Example: Block All Bots
User-agent: *
Disallow: /
Example: Block AI Training Bots
User-agent: *
Disallow: /
User-agent: Googlebot
Disallow: /admin/
User-agent: GPTBot
Disallow: /
User-agent: Claude-Web
Disallow: /
Privacy
Everything runs in your browser. No paths, URLs, or configuration data you enter are ever sent to a server.