Robots.txt: advanced configuration for search engines

December 20, 2025·1 min read

What is robots.txt?

robots.txt is a file that tells crawlers which URLs they can or cannot crawl on your website.

Specifies which bot the rules apply to. * applies to all bots. Googlebot for Google, Bingbot for Bing.

Indicates which paths should not be crawled. Use / to block everything or specific paths like /admin/.

Permits crawling of a specific path within a blocked one. Useful for CSS or images inside /admin/.

Indicates your XML sitemap location. Can include multiple sitemaps.

Recommends an interval between requests. Useful for servers with limited resources.

$ for end of URL, * for any sequence. Example: /*?print=true$ blocks print pages.

Blocking CSS or JS (worsens Google rendering), using Disallow instead of Noindex, having contradictory rules.

robots.txt is a powerful tool for managing crawl budget. At Vynta we audit and optimize robots.txt to maximize Google crawling efficiency.

Have a project in mind?