# http://info.webcrawler.com/mak/projects/robots/robots.html # (this is an edited abstract of # # The format and semantics of the "/robots.txt" file are as follows # The file consists of one or more records separated by one or more # blank lines. Each record contains lines of the form # : # The field name is case insensitive. Text after the '#' character on a # line is ignored during parsing. This is used for comments. The # following can be used: # =over 3 # =item User-Agent # The value of this field is the name of the robot the record is # describing access policy for. If more than one I field is # present the record describes an identical access policy for more than # one robot. At least one field needs to be present per record. If the # value is '*', the record describes the default access policy for any # robot that has not not matched any of the other records. # =item Disallow # The value of this field specifies a partial URL that is not to be # visited. This can be a full path, or a partial path; any URL that # starts with this value will not be retrieved # =back # =head1 ROBOTS.TXT EXAMPLES # The following example "/robots.txt" file specifies that no robots # should visit any URL starting with "/cyberworld/map/" or "/tmp/": # User-agent: * # Disallow: /cyberworld/map/ # This is an infinite virtual URL space # Disallow: /tmp/ # these will soon disappear # This example "/robots.txt" file specifies that no robots should visit # any URL starting with "/cyberworld/map/", except the robot called # "cybermapper": # User-agent: * # Disallow: /cyberworld/map/ # This is an infinite virtual URL space # Cybermapper knows where to go. # User-agent: cybermapper # Disallow: # This example indicates that no robots should visit this site further: # go away # User-agent: * # Disallow: / # =head1 SEE ALSO # L, L # =cut User-agent: * Disallow: /webstat/ Disallow: /errordocs/ Disallow: /ads/ Disallow: /forms/ Disallow: /calendar/ Disallow: /home/ Disallow: /inc/ Disallow: /admin/ Disallow: /images/ Disallow: /include/ Disallow: /phpMyAdmin/ Disallow: /domains/ Disallow: /wwwboard/ Disallow: /spammers/