![]() |
What is robot.txt
|
The robots exclusion protocol (REP), or robots.txt is a text file webmasters create to instruct robots (typically search engine robots) how to crawl and index pages on their website.
Buy Flavored condoms online Online shop for condoms |
if you want disallow your pages, who does not cache or index in google then you use robots.txt . Google web masters create robots how to crawl and index pages on their website.
|
Robots.txt is a text file that contain instructions for search engine crawlers. They list webpages that need to be crawled and disallowed webpages for search engine bots.
|
Robots.TXT is a text file which you place on your site to tell Google bot which pages you don't want to crawl. Example:
User-Agent: * Disallow: /s/ Disallow: /cgi-bin/ Disallow: /myadmin/ Disallow: /admincp/ |
Using robots.txt you can restrict Google and other search engine spiders from crawling your website.
|
Thanks to all
|
Robots.txt is a text (not html) file you put on your site to tell search robots which pages you would like them not to visit. Robots.txt is by no means mandatory for search engines but generally search engines obey what they are asked not to do. It is important to clarify that robots.txt is not a way from preventing search engines from crawling your site (i.e. it is not a firewall, or a kind of password protection) and the fact that you put a robots.txt file is something like putting a note “Please, do not enter” on an unlocked door – e.g. you cannot prevent thieves from coming in but the good guys will not open to door and enter. That is why we say that if you have really sen sitive data, it is too naïve to rely on robots.txt to protect it from being indexed and displayed in search results.
|
Great Article.
|
All times are GMT -7. The time now is 09:24 PM. |
Powered by vBulletin Copyright © 2020 vBulletin Solutions, Inc.