![]() |
Re: Virgin Media Phorm Webwise Adverts [Updated: See Post No. 1, 77, 102 & 797]
Do you think he gives a ****
---------- Post added at 02:31 ---------- Previous post was at 02:25 ---------- Do you think he gives a **** The man can get more $1000 whores than you can shake a stick at. And that's what it's all about.[/QUOTE] Yes, I think he does - this is a battle of hearts and minds and it's being fought on our terms on forums like this one. We are thorns in his side, there are plenty of laws he's yet to break - and don't worry we'll complain and make a song and dance about every single one until in the end he'll ***** off and dream up some other scheme in some country where there arn't so many laws to break... :D |
Re: Virgin Media Phorm Webwise Adverts [Updated: See Post No. 1, 77, 102 & 797]
Haven't read all the thread as it's too long but this may have been mentioned before. If you use Firefox as your browser then download the dephormation add on available. Just Google for it.
|
Re: Virgin Media Phorm Webwise Adverts [Updated: See Post No. 1, 77, 102 & 797]
Quote:
|
Re: Virgin Media Phorm Webwise Adverts [Updated: See Post No. 1, 77, 102 & 797]
Quote:
|
Re: Virgin Media Phorm Webwise Adverts [Updated: See Post No. 1, 77, 102 & 797]
Quote:
---------- Post added at 07:33 ---------- Previous post was at 07:27 ---------- Quote:
It's possible that the spokesperson was simply confused, as they don't get on well with very specific questions about how Phorm/Webwise works - but the answer seemed clear enough even if it was wrong. |
Re: Virgin Media Phorm Webwise Adverts [Updated: See Post No. 1, 77, 102 & 797]
Quote:
Quote:
|
Re: Virgin Media Phorm Webwise Adverts [Updated: See Post No. 1, 77, 102 & 797]
Quote:
Also: If they by some miracle actually followed the robots.txt standard then the user-agent they match against and the one they send in the http headers must match: "The name token a robot chooses for itself should be sent as part of the HTTP User-agent header, and must be well documented."** Sources: * From http://www.cl.cam.ac.uk/~rnc1/080404phorm.pdf "40. Once the robots.txt file (if any) has been fetched, it will be cached. The cache retention period will be value set by the website using standard HTTP cache-control mechanisms, or for one month if no period is specified. The minimum period that the file will be cached for is two hours." ** From http://www.robotstxt.org/norobots-rfc.txt |
Re: Virgin Media Phorm Webwise Adverts [Updated: See Post No. 1, 77, 102 & 797]
Quote:
Then the second stage is them actually forwarding your original request (yes there are some redirects and stuff going on in between but lets try and keep it simple) where we can only assume your real user-agent will be used. Certainly there has been no indication from Phorm that they will be using a different user-agent for these requests (and realistically they wouldn't want to as they could then be easily identified and blocked). Alexander Hanff |
Re: Virgin Media Phorm Webwise Adverts [Updated: See Post No. 1, 77, 102 & 797]
As Robert has already pointed out, using robots.txt as permission to profile is hopeless. Many sites do not use robots.txt and others, like we with ISP hosted sites, are unable to set one up anyway. If you are quite happy for Google to scan our sites and you therefore have no specific reason to include a robots.txt, then you won't bother. I imagine there are countless sites out there, including some significant ones, who have never bothered with it, or in some cases don't even know what it is.
It would be nice to suggest that if robots.txt were missing then Phorm would not profile that site, but I guess that would be asking too much..... On a similar theme, Phorm have never made clear if they look at and use the <meta> tags in sites. I would feel tempted to include loads of words in those to totally upset the meaning of any profiling information, after all you only need to ensure they are the ten most used words on the site.... |
Re: Virgin Media Phorm Webwise Adverts [Updated: See Post No. 1, 77, 102 & 797]
Quote:
If/When the BT trial happens, it's not going to be that difficult to work out what that user agent will be, just visit your own website (assuming you are phormed) then check your web logs. Now assuming they don't forge a googlebot user agent and do use their own unique user agent, then it should be fairly simple to configure a web server to parse robots.txt as a script (I am sure I could set this up easily with apache/php) and serve different content based on the user agent. If it's a phorm user agent then deny the entire site, if not then serve your usual robots.txt. Although this still doesn't get round the implied consent/default opt-in issue for webmasters/content authors, it's something to think about. Regards... T |
Re: Virgin Media Phorm Webwise Adverts [Updated: See Post No. 1, 77, 102 & 797]
I've posted the following on Google Webmaster tools forum
http://groups.google.com/group/Googl...90386b9ad852d0 and await the response. If anyone has a more direct route to Google management they are allowed to give me, I would be grateful - but the forum pigoens seemed the best ones to let the cat loose amongst. Quote As a Webmaster I am concerned about what my UK ISP (BTYahoo!/BT Broadband) have told me about their plans to implement Webwise, a technology patented by former spyware company 121Media, now known as Phorm.Inc. 121 Media were formerly responsible for PeopleonPage, and for placing difficult to remove rootkits on people's computers. This technology uses Layer 7 interception of a users complete http traffic to profile/mirror their browsing behaviour, and then use the information to serve up targeted ads, based on an anonymised, cookie- based UID placed on the users computer. It also involves the forging of a cookie, purporting to come from the website visited, even if that website has a privacy policy that says it does not set cookies. It is similar but not identical to the US company NebuAd technology. There has been relatively little debate about the isses this technology raises for webmasters. BT have stated that Webwise/Phorm will assume implied consent of webmasters, to profile copyrighted web content, copy it, and exploit it for commercial gain IF THE WEBMASTER CONSENTS TO A GOOGLE SPIDER visiting their site. They are equating their deep level Layer 7 intrusive interception technology with that of the Google search engine. This may even lead to confusion in people's minds between Webwise and Google, and they may think that Google is in some way linked to Phorm/Webwise. They are refusing to give webmasters a way of excluding Webwise specifically using robots.txt - instead they are saying if we let Google in, we let Webwise in. They have specifically repeatedly named Google as the search engine robots.txt directive they will be looking for in order to establish what they claim will be implied consent for Webwise on the part of Webmasters. Of course the major search engines allow and even assist webmasters to exclude their robot from spidering sites by the use of specific user agent strings. Webwise will neither set a user agent string, nor permit itself to be specifically excluded via robots.txt. Google need to be aware of this as it means that one possible step a webmaster might take is to allow all other search engines to crawl their site, but exclude Google. That way, they can exclude Webwise, because they have excluded Google. I would imagine Google would not be happy about this as it has the potential to adversely impact their business model, by linking access to Google robots with Layer 7 interception by Phorm/Webwise. I am happy to provide further information to Google on this, including details of communications received from my ISP, if someone from google contacts me by email. A good starting point for information is here http://www.inphormationdesk.org/ followed by a text version of my disposable email address for google End Quote |
Re: Virgin Media Phorm Webwise Adverts [Updated: See Post No. 1, 77, 102 & 797]
Quote:
I think you're probably right about them spoofing the user agent for the "second stage". Otherwise they couldn't be sure they were being served the same content. Many sites tailor content based on user-agent strings. As for the robots.txt fetch they have a dilema. Either: They completely emulate googlebot's behaviour which may risk litigation from Google. or: They do something that differentiates them from googlebot and allows them to be denied. (e.g. the user-agent string they send in the http headers is different so we serve a different robots.txt - not the easiest solution for a webmaster to implement and impossible unless you've got a proper hosting solution) If they really wanted to create a new "gold standard" for user privacy then they would be a lot more open on these details. "The name token a robot chooses for itself should be sent as part of the HTTP User-agent header, and must be well documented." |
Re: Virgin Media Phorm Webwise Adverts [Updated: See Post No. 1, 77, 102 & 797]
Quote:
Let's not forget that robots.txt is not an access control mechanism, it is an honour based system which robots can either adhere to or ignore, it doesn't physically stop them accessing pages. If their user-agent ever does get discovered, it would be useful to just add a script to your site which checks user-agent and if the Phorm user-agent is detected it builds a page which says something like "Get your hands of me you dirty ape!" or "Phorm is not welcome here, please go away." etc etc etc. Alexander Hanff |
Re: Virgin Media Phorm Webwise Adverts [Updated: See Post No. 1, 77, 102 & 797]
Quote:
|
Re: Virgin Media Phorm Webwise Adverts [Updated: See Post No. 1, 77, 102 & 797]
Quote:
My suggestion was just for the request for robots.txt, any other page (eg stage two as you put it) I believe they just pass on the end users user agent which is useless in this situation. Regards... T ---------- Post added at 10:13 ---------- Previous post was at 10:08 ---------- Quote:
|
All times are GMT +1. The time now is 21:36. |
Powered by vBulletin® Version 3.8.11
Copyright ©2000 - 2025, vBulletin Solutions Inc.
All Posts and Content are Cable Forum