summaryrefslogtreecommitdiff
path: root/perl/perl-www-robotrules/README
blob: 7dee780d0f40ac39bb4f852eaafd665f1ead65e7 (plain)
1
2
3
4
5
6
7
8
This module parses /robots.txt files as specified in "A Standard for
Robot Exclusion", at <http://www.robotstxt.org/wc/norobots.html>
Webmasters can use the /robots.txt file to forbid conforming robots
from accessing parts of their web site.
The parsed files are kept in a WWW::RobotRules object, and this
object provides methods to check if access to a given URL is
prohibited. The same WWW::RobotRules object can be used for one
or more parsed /robots.txt files on any number of hosts.