Quote:
Originally Posted by rryles
snip
Also on the subject of robots.txt - Googlebot's full user agent string is something like:
Code:
Mozilla/5.0 (compatible; Googlebot/2.1; +http://www.google.com/bot.html)
The interesting part is the url pointing to information about google bot and what it does. If phorm fake that then they are certainly committing some offense IMO.
|
AFAIK Phorm/Webwise doesn't leave or use ANY user-agent string at all.
It assumes consent to profile if Google is allowed to spider, but I don't think it is using a googlebot useragent string. So far, there has been no information at all about what information a website owner might find in their logs to indicate that Webwise has been accompanying a site visitor.
Remember Webwise doesn't crawl the site in the way a spider does, it simply profiles/copies/ the browsing done by a site visitor with Webwise switched on.
To detect their visit a site has to detect the phorged cookies it sets, and also the Phorm UID cookie. (Which is what the dephormation tools for webmasters are attempting to do).
If anyone knows otherwise, I'd love to hear about it.