Site Owners Forums - Webmaster Forums

Site Owners Forums - Webmaster Forums (http://siteownersforums.com/index.php)
-   Social Networks (http://siteownersforums.com/forumdisplay.php?f=43)
-   -   What is robot.txt file and what are the benefits of using it? (http://siteownersforums.com/showthread.php?t=165867)

Martinricky 03-05-2016 01:39 AM

What is robot.txt file and what are the benefits of using it?
 
Hello,

What is robot.txt file and what are the benefits of using it?

Thank you

aditya12 03-05-2016 04:23 AM

Robot.txt file is used to provide instruction to the search engine crawler about the webpages that which page to be crawled and which not.

Best schools in Chandigarh Best schools in Mohali Best schools in Panchkula Best schools in Tricity Colleges in Chandigarh Colleges in Mohali Colleges in Tricity Colleges in Panchkula

tenso500 03-05-2016 04:52 AM

The file tells the search engine crawlers which parts to web and which parts to leave alone in a website, differing between what is viewable to the public and what is viewable to the creators of the website alone.

Admyrin 03-05-2016 05:17 AM

Robot.txt file is used to provide instruction to the search engine crawler about the webpages that which page to be crawled and which not.

langtu1292 03-24-2016 06:46 AM

you can search on google..many thing for you

josechukkiri 03-24-2016 07:14 AM

The explanation about robot.txt files are satisfactory to me and thanks to the concerned.

sudeepkhana 03-25-2016 05:57 AM

Robot.txt is a text file kept on the web servers. The robot.txt allows the site admin to permit/allow which Search engines(like google, yahoo,bing) can crawl the site. Some times the admins won't allow any of the search engines to crawl at all.

Devin Mataka 03-25-2016 06:06 AM

The robots.txt file is a simple text file (no html) that is placed in your website’s root directory in order to tell the search engines which pages to index and which to skip. Many webmasters utilize this file to help the search engines index the content of their websites.
If webmasters can tell the search engine spiders to skip pages that they do not consider important enough to be crawled (eg. printable versions of pages, .pdf files etc.), then they have a better opportunity to have their most valuable pages featured in the search engine results pages.The robots.txt file is a simple method of essentially easing the process for the spiders to return the most relevant search results.
Another benefit of having a robots.txt is that you can specify the location of the Google .xml or Yahoo sitemap. This also increases spiderability for the search engines.

rahul3214 04-07-2016 09:42 PM

Robots.txt file is important for seo, search engine spiders first see that where your robots.txt file because they do not know which file crawl or which not. In robots.txt file google know which file crawl.

karanjain 04-08-2016 12:27 PM

nice and awesome
IPL 9 - Indian Premier League 2016
IPL Auction 2016 Live Results Shane Watson And Yuvraj Sold On Highest Bidding

albertocosta 04-09-2016 04:57 AM

The robots.txt is a text file webmasters create to instruct robots (typically search engine robots) how to crawl and index pages on their website.

brknny 04-10-2016 11:27 PM

Robots.txt file is a file through which we give instructions to the search engines whether to crawl or not the particular web page.

Anubhav-soin 04-11-2016 12:02 AM

Its a file that is used by search engines to find out that which pages in your website you want to crawled.

Note : do not edit the robot.txt yourself if you do not know how to modify it otherwise you good pages might get blocked from search engine.

You can also use site:www.yousite.com in google to find what are the pages that are currently been crawled from your website.

williamstinner 04-11-2016 10:51 PM

Robots.txt file is used to give instruction to the search engine crawler to which page has to crawled or not by allowing or disallowing.

jasonroy21 11-24-2016 11:27 PM

The file tells the search engine crawlers which parts to web and which parts to leave alone in a website, differing between what is viewable to the public and what is viewable to the creators of the website alone.

henrysmithh71 11-28-2016 11:26 PM

Robots.txt file is used to give instruction to the search engine crawler to which page has to crawled or not by allowing or disallowing.

georgewills 11-28-2016 11:40 PM

The file tells the search engine crawlers which parts to web and which parts to leave alone in a website, differing between what is viewable to the public and what is viewable to the creators of the website alone.

stuartsmithh5 11-28-2016 11:45 PM

Robots.txt file is used to give instruction to the search engine crawler to which page has to crawled or not by allowing or disallowing.

pattroderick 11-28-2016 11:51 PM

Robots.txt file is for giving instruction to the crawlers indicates which one to crawl and which are don't crawl.

Dentcare 11-29-2016 04:10 AM

Robot.txt used to provide guidance to the search engine crawler about the web pages that which page to be crawled and which is not.

luffy268 12-14-2016 12:43 AM

The location of robots.txt is very important. It must be in the main directory because otherwise user agents (search engines) will not be able to find it – they do not search the whole site for a file named robots.txt. Instead, they look first in the main directory and if they don't find it there, they simply assume that this site does not have a robots.txt file and therefore they index everything they find along the way. So, if you don't put robots.txt in the right place, do not be surprised that search engines index your whole site.

ChrisRogers123 12-14-2016 02:44 AM

It is a type of text file which allows search engine to what to crawl or what to not

Pandith Raghave 12-14-2016 03:13 AM

Robots.txt is document you put on your site to tell look robots which pages you might want them Do and Don't to visit.

jeffmccoy 12-14-2016 04:14 AM

Robots.txt file is a file through which we give instructions to the search engines whether to crawl or not the particular web page.

stuartsmithh5 12-15-2016 12:54 AM

Robot.txt file is used to provide instruction to the search engine crawler about the webpages that which page to be crawled and which are not to be crawled.

Markco92 12-15-2016 03:35 AM

Robot.txt file is used to provide instruction to the search engine crawler about the webpages that which page to be crawled and which not.

FerinKings 12-16-2016 12:23 AM

robots.txt is a text file webmasters create to instruct robots (typically search engine robots) how to crawl and index pages on their website.

wiliamjamesh 12-17-2016 01:49 AM

The purpose of a robots.txt file is to tell search engines not to crawl the contents of specific directories or even individual pages within a website. ... However there are a couple of reasons why all websites would benefit from having one. The first thing search engines look for is the robots.txt file.

georgewills 12-17-2016 05:22 AM

Robots.txt file is used to give instruction to the search engine crawler to which page has to crawled or not by allowing or disallowing.

LukeBarter 12-19-2016 01:44 AM

It is a text file webmasters create to instruct robots (typically search engine robots) how to crawl and index pages on their website.

luffy268 12-19-2016 03:11 AM

Robots.txt is a text file which has few lines and this file interacts with all kind of crawlers or spiders like Googlebot, which is Google search engine's spider.

Search Engine always want to index good and fresh contain on the web and these robot.txt instruct the web crawlers to how to index and crawl your blog in the search results.

Remember,Crawler or spider will firstly look at your robots.txt files to obey the rules you have instructed It's mean you can restrict any web page on your blog from web crawlers so that it can’t get indexed in search engines like your blog your demo page, labels page, Achieve or any other pages that are not as important to get indexed.

Nandu41 12-21-2016 02:46 AM

Robots.txt is a text file which has few lines and this file interacts with all kind of crawlers or spiders like Googlebot, which is Google search engine's spider.

Search Engine always want to index good and fresh contain on the web and these robot.txt instruct the web crawlers to how to index and crawl your blog in the search results.


All times are GMT -7. The time now is 09:38 PM.


Powered by vBulletin Copyright © 2020 vBulletin Solutions, Inc.