Site Owners Forums - Webmaster Forums

Site Owners Forums - Webmaster Forums (http://siteownersforums.com/index.php)
-   Search Engine Optimization (http://siteownersforums.com/forumdisplay.php?f=16)
-   -   What is robot text (http://siteownersforums.com/showthread.php?t=63876)

asad111720 02-24-2013 02:18 AM

What is robot text
 
Do you know about robot text? please say and discuss about it.

champion 02-24-2013 10:20 PM

Robots.txt is a file through which you can guide search engines to crawl or not to crawl certain sections of your website.Robot.txt is used to prevent crawlers crawl on a certain page.

danish00 02-25-2013 12:19 AM

By the help of robots.txt file, search engines crawls your webpages or not crawls your pages.
day picnic around delhi

jaysh4922 02-25-2013 03:18 AM

Robot.txt is an on-page SEO technique and it is basically used to allow for the web robots also known as the web wanderers, crawlers or spiders. It is a program that traverses the website automatically and this helps the popular search engine like Google to index the website and its content.

Just create a file named robots.txt, put it on your root directory of your website(where index.htm/index.php is), edit with a text editor and add these:

User-agent: *
Disallow: *something*

roger4321 02-25-2013 05:31 AM

robots.txt is a text file which instructs the search engine what are the pages should be crawled.

Zora2012 02-25-2013 05:33 AM

Robots.txt is the text file that is mostly used to instruct search engine which page should be crawled and which shouldn't be crawled.

SamMarley 02-25-2013 06:10 AM

Robots.txt is a file that you can add to your website which instructs the search engine crawler which pages to crawl. You can also instruct the crawlers not to crawl some pages through robots.txt.

david0019 02-25-2013 06:23 AM

Robots.txt is a text (not html) file you put on your site to tell search robots which pages you would like them not to visit.

The structure of a robots.txt is pretty simple (and barely flexible) – it is an endless list of user agents and disallowed files and directories. Basically, the syntax is as follows:

User-agent:

Disallow:

“User-agent” are search engines' crawlers and disallow: lists the files and directories to be excluded from indexing. In addition to “user-agent:” and “disallow:” entries, you can include comment lines – just put the # sign at the beginning of the line:

# All user agents are disallowed to see the /temp directory.

User-agent: *

Disallow: /temp/

spaculus 02-25-2013 07:08 AM

Robots.txt file is in the root directory of a website which has been created to direct
the activity of search engine crawlers or spiders.

jayanta1 02-25-2013 11:00 PM

The concept and structure of robots.txt has been developed more than a decade ago. Robots.txt is a text (not html) file you put on your site to tell search robots which pages you would like them not to visit. Robots.txt is by no means mandatory for search engines but generally search engines obey what they are asked not to do. The location of robots.txt is very important. It must be in the main directory because otherwise user agents (search engines) will not be able to find it – they do not search the whole site for a file named robots.txt.

asad111720 02-26-2013 03:07 PM

Go on......Thanks all..

halenmartinez 02-27-2013 03:30 AM

as you can easily judge by its name, robot txt means message or txt for robot (search engine spiders). In you webmaster tell bots which page to crawl or which not.

spaculus 02-28-2013 04:12 AM

I do not see anymore effects of Robot.text file. Search Engine also index those one which comes under no follow. I tried that. No effect of this file now.

James Andy 02-28-2013 05:16 AM

Robots.txt is a text (not html) file you put on your site to tell search robots which pages you would like them not to visit. Robots.txt is by no means mandatory for search engines but generally search engines obey what they are asked not to do.

delhicourses 02-28-2013 09:12 AM

robots text is normal file in which you can give instruction which page you crawl or not to all robots of search engines.

executiveedge01 02-28-2013 05:26 PM

Robots.txt is only using for security purpose.It will not allow an anonymous users frequently into the website.

synchronous 02-28-2013 11:02 PM

Robots.txt, is a text file present in the root directory of a website. The Robots.txt file is a convention created to direct the activity of search engine crawlers or web spiders. The file tells the search engine crawlers which parts to web and which parts to leave alone in a website, differing between what is viewable to the public and what is viewable to the creators of the website alone.

sijugk 03-04-2013 08:29 AM

Robots .txt file is the place where you can write rules for web spiders about how to crawl the site and which pages should be excluded. You can write rules specifically for each web spider.

aartikrypton 03-05-2013 04:03 AM

Hi,

With the help of robots.txt file search engines crawls your webpages.......

Thanks
Team from
kryptonsoft

jesse12 03-05-2013 04:18 AM

Robots file is a simple text file which is uploaded on root directory this is the file which is crawled by the crawler very first. So in this file we write the code which file we want to block from the crawler and other one we want to crawl.

Main use of this file is just to block the admin section which generally we wish do not crawl by the search engine.

Hope you understood.

aartikrypton 03-05-2013 04:36 AM

Hello,

With the help of robots.txt file search engines crawls your webpages.......

Thanks
Team from
Kryptonsoft

iPhone App Development
iPhone Games Development
Offshore iPhone Development

tech.biztech 03-05-2013 04:43 AM

It is a text file which tells to search engine robots that which web page you would not like to visit. It is one kind of file or folder which tells search engine to avoid the operations described in robot.txt file.

JACKADEN 03-06-2013 12:22 AM

Here is some information about Robot text.

Web site owners use the /robots.txt file to give instructions about their site to web robots; this is called The Robots Exclusion Protocol.

It works like this: a robot wants to visit a Web site URL, say http://www.example.com/welcome.html. Before it does so, it firsts checks for http://www.example.com/robots.txt, and finds:

User-agent: *
Disallow: /

The "User-agent: *" means this section applies to all robots. The "Disallow: /" tells the robot that it should not visit any pages on the site.
There are two important considerations when using /robots.txt:

• Robots can ignore your /robots.txt. Especially malware robots that scan the web for security vulnerabilities, and email address harvesters used by spammers will pay no attention.
• The /robots.txt file is a publicly available file. Anyone can see what sections of your server you don't want robots to use.

So don't try to use /robots.txt to hide information.

spaculus 03-06-2013 06:25 AM

Robots.txt is a text (not html) file you put on your site to tell search robots which pages you would like them not to visit.

thejkamath 03-07-2013 06:31 AM

Robot.txt is a file that instructs crawlers and bots whether to crawl a page or not. If yes then which page, etc.

You can allow or disallow bots from crawling your pages.

sher.winchan58 03-07-2013 07:00 AM

Robots.txt is a file through which you can guide search engines to crawl or not to crawl certain sections of your website.Robot.txt is used to prevent crawlers crawl on a certain page..

jonnysmith 03-07-2013 11:56 PM

The robots exclusion standard, also known more commonly as Robots.txt, is a text file present in the root directory of a website. The Robots.txt file is a convention created to direct the activity of search engine crawlers or web spiders. The file tells the search engine crawlers which parts to web and which parts to leave alone in a website, differing between what is viewable to the public and what is viewable to the creators of the website alone. A Robots.txt file is frequently used by search engines to categorize and archive web pages, or by webmasters to proofread source codes.

champion 03-10-2013 10:21 PM

Its simply a text file Which is used to instruct the Search engine which files to be crawl or which not.

smithoptimizer 03-11-2013 12:30 AM

The robots exclusion standard, also known more commonly as Robots.txt, is a text file present in the root directory of a website. The Robots.txt file is a convention created to direct the activity of search engine crawlers or web spiders. The file tells the search engine crawlers which parts to web and which parts to leave alone in a website, differing between what is viewable to the public and what is viewable to the creators of the website alone. A Robots.txt file is frequently used by search engines to categorize and archive web pages, or by webmasters to proofread source codes.

geeks9 03-11-2013 02:54 AM

nice dude.....thank 4 all.......

jeffzeilenga 03-13-2013 04:05 AM

using Robots.txt is a file we can guide search engines to crawl or not to crawl certain sections of your website.

outure11 03-25-2013 10:17 PM

Actually the search engines handle nofollow differently. Google will not follow the link, assign PR to it, or give it weight in its algorithm. Yahoo will follow the link but give it no weight in their algorithm. MSN says they won't give the link any value but it is hard to confirm exactly what they're doing.

johnmartin9990 03-26-2013 01:46 AM

Well ... robots.text isa text file in which you difine instruction which page you want to be crawl or not to all robots of search engines.

hostsingh 03-27-2013 01:20 AM

Robots.txt is a text file you put it on your web site to tell search robots which pages you would like them not to visit. It is important to clarify that robots.txt is not a way from preventing search engines from crawling your site.


All times are GMT -7. The time now is 08:39 AM.


Powered by vBulletin Copyright © 2020 vBulletin Solutions, Inc.