View Single Post
Old 08-05-2008, 14:18   #6057
rryles
Inactive
 
Join Date: May 2008
Posts: 147
rryles will become famous soon enoughrryles will become famous soon enoughrryles will become famous soon enough
Re: Virgin Media Phorm Webwise Adverts [Updated: See Post No. 1, 77, 102 & 797]

Quote:
Originally Posted by Dephormation View Post
Robots.txt doesn't allow (check the spec), it disallows.

Its a denial mechanism.

What's required is a mechanism of consent, where no consent (ie, explicit consent is not present) means no consent.

Pete.
The original RFC spec only does disallow. However the benchmark they have set is Google and Google's bots support an allow extension. Google's bots also check for meta tags in the documents. Checking those would require interception first though.

Interestingly, if they do obey robots.txt (at all) then they won't be able to use searches done on Google! What a shame http://www.google.com/robots.txt disallows all the actual search pages for all user agents.
rryles is offline