![]() |
#16 |
Registered User
Join Date: Feb 2012
Posts: 225
|
hi,I am reading this article and thanks for sharing this information for about forum posting,
|
![]() |
![]() |
![]() |
#17 | |
Registered User
Join Date: Oct 2011
Location: Ahmedabad
Posts: 93
|
Quote:
|
|
![]() |
![]() |
![]() |
#18 |
Registered User
Join Date: Jul 2012
Posts: 55
|
Thanks , Great post information!
|
![]() |
![]() |
![]() |
#19 |
Registered User
Join Date: Aug 2012
Posts: 69
|
Hello ,
It's a text file which instructs search engine spiders or crawlers on what to do. It tells specific web spiders on which specific web pages to index. |
![]() |
![]() |
![]() |
#20 |
Registered User
Join Date: Feb 2012
Posts: 92
|
A robots.txt file is a simple txt file. robots file on a website wills utility as a appeal that specified robots discount specified files or directories when crawling a site.
|
![]() |
![]() |
![]() |
#21 |
Registered User
Join Date: Aug 2012
Posts: 7
|
Robots.txt is a very useful text file to be uploaded on root directory of your site so as to disallow crawling our mentioned url's in robots.txt as not to be displayed to users out there.
Thanks |
![]() |
![]() |
![]() |
#22 |
Registered User
Join Date: Sep 2012
Posts: 10
|
The Software Exemption Conventional, also known as the Spiders Exemption Method or robots.txt protocol, is a meeting to avoid participating web spiders and other web robots from opening all or part of a web page which is otherwise openly readable. Spiders are often used by google to classify and store web websites, or by web page owners to check resource value.
__________________
iPhone application development |
![]() |
![]() |
![]() |
#23 |
Registered User
Join Date: Aug 2012
Posts: 117
|
Robots.txt is a text file that you can put on your site to tell search robots which page you like them not to visit. Robots.txt is by no means mandatory for search engines but search engines obey what they are asked not to do. The location of robots.txt is very important as it must to be in main directory.
|
![]() |
![]() |
![]() |
#24 |
Registered User
Join Date: Sep 2012
Location: Mumbai, India
Posts: 10
|
A robots.txt is a permissions file that can be used to control which webpages of a website a search engine indexes. The file must be located in the root directory of the website for a search engine website-indexing program (spider) to reference
|
![]() |
![]() |
![]() |
#25 |
Registered User
Join Date: Aug 2012
Posts: 13
|
Robot.txt means to tell search engine of which pages you want to crawl or Not.
__________________
Quality Data Scraping from 3iDataScraping.com |
![]() |
![]() |
![]() |
#26 |
Registered User
Join Date: Sep 2012
Posts: 13
|
Robots.txt is a text (not html) file you put on your site to tell search robots which pages you would like them not to visit. Robots.txt is by no means mandatory for search engines but generally search engines obey what they are asked not to do.
Structure of a Robots.txt File : The structure of a robots.txt is pretty simple (and barely flexible) � it is an endless list of user agents and disallowed files and directories. Basically, the syntax is as follows: User-agent: Disallow: �User-agent� are search engines' crawlers and disallow: lists the files and directories to be excluded from indexing. In addition to �user-agent:� and �disallow:� entries, you can include comment lines � just put the # sign at the beginning of the line: # All user agents are disallowed to see the /temp directory. User-agent: * Disallow: /temp/ |
![]() |
![]() |
![]() |
#27 |
Registered User
Join Date: Feb 2012
Posts: 225
|
Robot.txt tells to Google that which page should be crawl in the website.
|
![]() |
![]() |
![]() |
Currently Active Users Viewing This Thread: 1 (0 members and 1 guests) | |
|
|
![]() |
||||
Thread | Thread Starter | Forum | Replies | Last Post |
issue in robots.txt file | davikerkrish | Search Engine Optimization | 0 | 07-20-2012 12:40 AM |
Unable to delete robots.txt file | notfake | Yahoo | 0 | 04-29-2012 02:19 AM |
Don�t exceed maximum file size for robots.txt file | ClaudiaSchayffe | Search Engine Optimization | 3 | 04-02-2012 04:28 AM |
What Is Robots.txt? | samlko | Search Engine Optimization | 13 | 03-09-2012 02:37 PM |
sitemap.xml and robots.txt? | jamesranatte | Search Engine Optimization | 5 | 01-31-2012 11:16 PM |