Site Owners Forums - Webmaster Forums

Site Owners Forums - Webmaster Forums (http://siteownersforums.com/index.php)

- Search Engine Optimization (http://siteownersforums.com/forumdisplay.php?f=16)

- - What is robots.txt? (http://siteownersforums.com/showthread.php?t=61511)

seoinheritx

11-26-2012 12:01 AM

What is robots.txt?

Robots.txt is a text file used to give instructions to the search engine crawlers about the caching and indexing of a webpage, domain, directory or a file of a website.

titly555

11-26-2012 05:26 AM

The robots exclusion protocol (REP), or robots.txt is a text file webmasters create to instruct robots (typically search engine robots) on how to crawl & index pages on their website.

mytimeandmoneys

11-26-2012 07:42 AM

Robots.txt is a text file you put on your site to tell search robots which pages you would like them not to visit. Robots.txt is by no means mandatory for search engines but generally search engines obey what they are asked not to do.

IAMalfin

11-26-2012 10:02 PM

Robots.txt is a convention to prevent cooperating web crawlers and other web robots from accessing all or part of a website which is otherwise publicly viewable.

sunitisood16

11-27-2012 02:58 AM

Robots.txt files inform search engine spiders how to interact with indexing your content. If you do not have a robots.txt file, your server logs will return 404 errors whenever a bot tries to access your robots.txt file.

outure11

11-27-2012 04:56 AM

This is very useful post. Thank you.........

john mathew

11-28-2012 05:09 AM

Thanks for share this information.

abhirampathak3

11-28-2012 10:53 PM

The robots.txt is a simple text file in your web site that inform search engine bots how to crawl and index website or web pages.

arindamdutta16

11-30-2012 03:56 AM

The robots.txt is a simple text file in your web site that informs search engine bots how to crawl and index website or web pages. It is great when search engines frequently visit your site and index your content but often there are cases when indexing parts of your online content is not what you want.

likit

12-03-2012 05:46 AM

Robots.txt is text file that holding the information for search engine crawler about caching and indexing of webpage, Domain..etc

laceywilliams12

12-04-2012 11:07 PM

Robots.txt is a text file and having the information for dofollow or no-follow for the crawlers to crawl a website web pages, and providing them indexing.

Eminenz Cw

12-05-2012 04:28 AM

Robots.txt is a text (not html) file you put on your site to tell search robots which pages you would like them not to visit. Robots.txt is by no means mandatory for search engines but generally search engines obey what they are asked not to do. It is important to clarify that robots.txt is not a way from preventing search engines from crawling your site (i.e. it is not a firewall, or a kind of password protection) and the fact that you put a robots.txt file is something like putting a note �Please, do not enter� on an unlocked door � e.g. you cannot prevent thieves from coming in but the good guys will not open to door and enter.

muzz	12-05-2012 06:24 AM

robot.txt tells the search engine which data you want to crawl and which is not.

stephan07

12-11-2012 03:43 AM

Robot.txt is a simple notepad file through which crawler crawl your website.

ronaldotson45

12-11-2012 11:53 PM

Robot.txt file indicate search crawler which information in this website is prohibited or do not crawl. if owner of the site does not want any of the file to be indexed or crawled by robot or spider then robot.txt file being used

zitoitalahold

12-13-2012 10:05 PM

Robots.txt is a text (not html) files which search robots in your site which pages are not visited. It is important to clarify that robots.txt is not a way from preventing search engines from crawling your site.

johnstamos00000

12-21-2012 03:51 AM

The robots.txt is a file that follow the search engine instruction.

martinsherman

12-26-2012 11:48 PM

The robots.txt file is a text file that tells search engine crawlers which portions of your website they should NOT index. If you don't want to restrict search engine crawlers, you should simply create an empty robots.txt file (e.g., touch robots.txt) or one that looks like this:

User-agent: *
Disallow:
Once you have created a robots.txt file, you store it in the root directory of your Web server.

Hope this helps you!!

johnstamos00000

01-21-2013 01:50 AM

Robots.txt is a text file you put on your site to tell search robots which pages you would like them not to visit.

tionnasmith

07-24-2017 03:55 AM

The robots exclusion standard, also known as the robots exclusion protocol or simply robots.txt, is a standard used by websites to communicate with web crawlers and other web robots. The standard specifies how to inform the web robot about which areas of the website should not be processed or scanned.

manalisoni

07-24-2017 05:26 AM

Jacklincy

07-24-2017 06:13 AM

Robots.txt is a convention to prevent cooperating web crawlers and other web robots from accessing all or part of a website which is otherwise publicly viewable.

wiliamjamesh

07-25-2017 02:44 AM

Jessyk

07-26-2017 04:44 AM

Robot.txt is a simple notepad file through which crawler crawl your website.

leahwilmot

07-26-2017 05:15 AM

Robots.txt is a text file which gives instruction to the search engine crawler to which page has to be crawled or not by allowing or disallowing.

quickyadspro

07-26-2017 05:19 AM

vijayasinterior

07-26-2017 09:09 AM

to block unwanted files we will use this record

Chrisdiary

07-26-2017 01:47 PM

The importance of adding custom robots.txt to blogger

Blogger comes with a default robots.txt file. However, blogger allows us to customize this file to suit our needs. This is called custom robots.txt file and I will show you the relevance of customizing this file and how to add it to the blogger server.

What is Robots.txt file ?
A robots.txt is a code which instructs web crawlers on how to go about indexing specific pages on search engines. Before web crawlers index web pages, it will first look at this file in order to ascertain the specific instructions required of it to carry out.
So basically, a robots.txt file does two things :
1. It allows search engines to discover contents on the web
2. It allows specific contents to be indexed on search engines and served to those looking for information

A robots.txt file is usually found by default on website or blog host servers like blogger. In this tutorial, I will explain what each line of code of a robots.txt file represents and how to customize it.I will use a blogger's default robots file to illustrate the working principles of the robots.txt.
The code is as shown below:

http://www.chrisdiary.com/2017/07/th...ng-custom.html

nathanleo

09-19-2022 06:29 AM

The robots exclusion standard, also known as the robots exclusion protocol or simply robots.txt, is a standard used by websites to communicate with web crawlers and other web robots. Robots are often used by search engines to categorize websites.

AzharSEO

09-21-2022 11:57 PM

Robots.txt are the files that will tell the crawler which pages of the website to crawl and which pages of the website not to crawl.

All times are GMT -7. The time now is 01:00 PM.