03 August 2010

Some cautionary words re consumer-tracking software

The largest U.S. websites are installing new and intrusive consumer-tracking technologies on the computers of people visiting their sites—in some cases, more than 100 tracking tools at a time—a Wall Street Journal investigation has found.

The tracking files represent the leading edge of a lightly regulated, emerging industry of data-gatherers who are in effect establishing a new business model for the Internet: one based on intensive surveillance of people to sell data about, and predictions of, their interests and activities, in real time...

In an effort to quantify the reach and sophistication of the tracking industry, the Journal examined the 50 most popular websites in the U.S. to measure the quantity and capabilities of the "cookies," "beacons" and other trackers installed on a visitor's computer by each site. Together, the 50 sites account for roughly 40% of U.S. page-views.

The 50 sites installed a total of 3,180 tracking files on a test computer used to conduct the study. Only one site, the encyclopedia Wikipedia.org, installed none. Twelve sites, including IAC/InterActive Corp.'s Dictionary.com, Comcast Corp.'s Comcast.net and Microsoft Corp.'s MSN.com, installed more than 100 tracking tools apiece in the course of the Journal's test...

The state of the art is growing increasingly intrusive, the Journal found. Some tracking files can record a person's keystrokes online and then transmit the text to a data-gathering company that analyzes it for content, tone and clues to a person's social connections. Other tracking files can re-spawn trackers that a person may have deleted...

Some of the tracking files identified by the Journal were so detailed that they verged on being anonymous in name only. They enabled data-gathering companies to build personal profiles that could include age, gender, race, zip code, income, marital status and health concerns, along with recent purchases and favorite TV shows and movies.

The ad industry says tracking doesn't violate anyone's privacy because the data sold doesn't identify people by name, and the tracking activity is disclosed in privacy policies. And while many companies are involved in collecting, analyzing and selling the data, they provide a useful service by raising the chance Internet users see ads and information relevant to them personally...
More at the link, including suggestions on how to protect yourself against trackers. This comment from the sidebar nicely sums up the situation:
It's rarely a coincidence when you see Web ads for products that match your interests.

1 comment:

  1. I do work for a company that operates in that area. Most real time bids (where that gathered information is used a lot) have to happen within like 50 milliseconds.

    This means we constantly have to keep all known users in our servers' memory, ie. we can't store them on hard drives and thus what we collect is fairly limited, plus we do expire anything that is older than 4 weeks.

    Compare that with the unlimited storage of - lets say - your Amazon purchases.

    Here is a nice writeup looking at it from a different perspective:

    http://www.adotas.com/2010/05/offline-targeting-the-real-punch-in-the-nose/

    ReplyDelete