Robots meta tags are used to control how search engines and other web crawlers interact with your web pages. Here’s a detailed overview of the specifications for these meta tags:
The meta tag is placed within the <head>
section of an HTML document. The basic syntax is:
HTML
<meta name="robots" content="value">
or for specific bots:
<meta name="googlebot" content="value">
Index / Noindex:
- index: Allows the page to be indexed by search engines. This is the default behavior if not specified.
- noindex: Prevents the page from being indexed by search engines.
Follow / Nofollow:
- follow: Allows search engines to follow the links on the page. This is the default behavior if not specified.
- nofollow: Prevents search engines from following links on the page.
Archive / Noarchive:
- archive: Allows search engines to store a cached copy of the page.
- noarchive: Prevents search engines from storing a cached copy of the page.
Snippet / Nosnippet:
- snippet: Allows search engines to show snippets (e.g., meta descriptions, titles) in search results.
- nosnippet: Prevents search engines from showing snippets.
Images / Noimages:
- images: Allows search engines to index images on the page.
- noimages: Prevents search engines from indexing images.
<meta name="robots" content="noindex, nofollow">
<meta name="robots" content="index, nofollow">
<meta name="robots" content="index, follow, noarchive">
<meta name="robots" content="nosnippet">
Meta tags can also be applied to specific search engines or bots. For example:
Googlebot:
<meta name="googlebot" content="noindex, follow">
<meta name="bingbot" content="noarchive">
robots.txt: A file located in the root directory of a site that provides directives to web crawlers about which parts of the site can or cannot be accessed. It affects the entire site or specific directories, but not individual pages within a directory.
robots meta tag: Provides more granular control and can be applied to individual pages or sections within a site.
By using these meta tags appropriately, you can better control how your site appears in search engine results and how search engines interact with your content.