[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Intelligent Enterprise Magazine - Decision Support: From the Lab



    [ The following text is in the "ISO-8859-1" character set. ]
    [ Your display is set for the "US-ASCII" character set.  ]
    [ Some characters may be displayed incorrectly. ]

http://www.intelligententerprise.com/992610/decision.shtml
   
   
                    October 26, 1999, Volume 2 Number 15
                                      
                            The Privacy Backlash
                                      
          Know your customers well... but don't make them paranoid
                                      
   I make my living as a data miner, and I can't help noticing that it
   makes a lot of people really nervous.
   
   An old college friend I hadn't seen in years, an artist and
   anthropologist, had only a fuzzy idea of what I do from seeing the Web
   site of Data Miners, the consulting company I founded, and was afraid
   I might be into Orwellian "1984 stuff."
   
   But later in the same conversation, she expressed the hope, only
   partly tongue-in-cheek, that my work will save her time at the video
   store; as soon as she enters the store, someone can simply hand her
   the tape that data mining predicts she will want, and she can go home
   happy.
   
   My friend's reaction was a typical mix of fear, misunderstanding, and
   willingness to see a bright side. I wish that clearing up the
   misunderstandings could alleviate the fears. Unfortunately, a clearer
   understanding of the power inherent in the com- bination of powerful
   data-mining tools and widely available personal data from many sources
   is likely to make people (and the people's representatives in
   Congress) more worried rather than less.
   
   Data mining can modestly improve our lives by helping our vendors and
   service providers better understand our needs. It can also be abused
   to invade our privacy. It won't take many examples of such abuse to
   raise an uproar leading to restrictive legislation likely to treat
   "good data mining" and "bad data mining" alike.
   
   The only way to avoid such a backlash is to educate industry and
   consumers alike how to determine what data mining applications are
   legitimate, and most important, to give people some control over how
   their personal data is used. After all, most people are happy to trade
   away some of their privacy for some other benefit. We accept having
   our picture taken by the ATM, because it makes us feel safer knowing
   that a robber would also be on camera. We accept having our luggage
   X-rayed, bomb-scanned, and ransacked at the airport, because we are
   glad a terrorist's luggage would get the same treatment.
   
   The public's acceptance of data mining requires that two conditions be
   met:
   
   o An explicit and well-understood covenant exists between those mining
   the data and the subjects of their analysis
   
   o Individuals feel they control the data they provide and how data
   miners can use it.
   
Benign Data Mining

   Most data-mining applications in marketing are benign. Despite all the
   rhetoric about "relationship marketing," no marketers are interested
   in you as a human being; they are interested in you as a potential
   customer. Information about you that sheds no light on your propensity
   to buy a certain product is simply not interesting to the vendor. Your
   age, income, sexual orientation, political affiliations, number of
   credit cards, and fondness for lima beans may all be things you choose
   not to share with your neighbors. But even personal information that
   would set the neighbors' tongues wagging for weeks on end does not
   interest the commercial data miner unless it can help predict whether
   you will order from a catalog or default on a loan, for example.
   
   Anyone really out to get you -- an ex-spouse or a collection agency,
   perhaps -- doesn't need data mining. The information that finds its
   way into marketing databases has always been available to those
   willing to look hard enough. In Massachusetts, where I live, I am free
   to check the registry of motor vehicles for an inconsiderately parked
   car's owner name and address. I can then walk over to the registry of
   deeds and see what his house is worth. And I can look up his number in
   the phone book if I want to call him to give him a piece of my mind.
   What is new with data mining is not the ability to determine who owns
   a particular car, but the ability to scan thousands of automobile
   registrations looking for patterns.
   
   Informed Consent. As consumers begin to realize the value of
   information about themselves and their habits, they will start
   charging for it. Many supermarkets already pay for this information.
   The ability to tie purchases to an individual is valuable enough that
   the store is willing to pay for it in the form of additional discounts
   offered to people who identify themselves each time they make a
   purchase. As long as the data is used only to figure out what coupons
   to offer which customers, and not to figure out who is eating too much
   fat, most people don't mind supplying the information and taking the
   discount. As consumers become more savvy, they will start expecting to
   be paid for their information in other situations as well.
   
   The Customer Rules. As long as companies are collecting data on you
   only because they want to sell you things, you are in control. If they
   misuse that information in ways that disturb you, you will be less
   likely to buy their products and services. That explains why MCI asks
   you who your friends and family are instead of just figuring it out
   for themselves and calling them up. Marketers' fear of upsetting you
   is the strongest protection you have against their misuse of your
   personal data.
   
Malign Data Mining

   Fear of offending consumers may stop some abuses, but not where real
   money is at stake. Recently, a supermarket defendant in a
   slip-and-fall suit used loyalty-card data to show that the plaintiff
   was a heavy drinker (or at least purchased a lot of alcohol). The suit
   was dropped. All of a sudden, many people who never gave much thought
   to what use their loyalty-card data might be put have started worrying
   about it.
   
   Similar worrying news stories are reported nearly every day. There was
   the wife who opened a telephone company's discount offer for calls to
   a frequently dialed number that her (now former) husband shouldn't
   have been calling. There were the drugstore customers alarmed to
   discover the reason they were getting direct-mail solicitations
   relevant to their illnesses was that a drugstore chain had sold its
   prescription data. The stories are numerous.
   
   It is easy to imagine worse scenarios. The analysis techniques that
   transform catalog order data into mailing-campaign targets could
   easily serve nefarious purposes. For instance, a "big brother"
   government might find data-mining techniques handy for compiling an
   enemy list. If we classify data-mining applications on a continuum on
   which a direct mailer deciding not to send you a sweepstakes entry is
   at one end, and a repressive state identifying you as a target for
   special persecution is at the other, many of them fall somewhere in
   the middle. How can we draw a line between the applications that ought
   to be tolerated or even welcomed and those that should be feared and
   outlawed? My answer is to evaluate each application on two scales:
   
   1. How close is the alignment between the people doing the data mining
   and the people whose data is being mined?
   
   2. What is the balance of power between the miners and the mined?
   
   Let's look at a few potential applications of data mining, keeping
   these two scales in mind.
   
   In the case of a consumer direct-marketing organization trying to
   reach the right customers, the interests of the miners and their
   targets are actually very closely aligned. Consumers do not want to
   get junk mail advertising products and services in which they have no
   interest. Similarly, the mailer has no interest in wasting postage on
   people who are unlikely to respond. Conversely, if the offer is one
   the consumer considers valuable, both the vendor and the consumer are
   pleased. As for power, it is all in the hands of the consumer, who is
   free to decide whether to respond to the offer.
   
   A more troubling prospect is the mining of medical records, credit
   card transactions, supermarket purchase records, or lifestyle data in
   order to assign risks for various ailments to individuals or
   subpopulations. How well the miner's interests align with those of the
   mined depends greatly on the nature of the healthcare system. Most of
   the developed world accepts that society as a whole benefits from, and
   is responsible for, maintaining a healthy population.
   
   In most wealthy countries, this understanding has led to the creation
   of single-payer healthcare systems in which every citizen is
   automatically covered. The interests of such a system and of the
   individual are reasonably compatible. The healthcare system saves
   money by preventing people from becoming ill and by getting them early
   treatment when they are in need. Because people tend to prefer being
   healthy to being sick, they have no particular reason to withhold
   information that may help in their diagnosis or treatment. Power is
   balanced between the miner and the mined. The healthcare system has
   the power to decide which treatments to pursue, but it does not have
   the power to refuse coverage.
   
   In the United States, the situation is quite different. Healthcare is
   usually financed through myriad for-profit insurance companies. These
   companies can save money and increase their profitability by refusing
   to cover people who are at greater risk of becoming ill. Here, the
   interests of the individual and the provider are at odds. The sicker I
   am, the more I want healthcare and the less inclined the insurer is to
   provide it. Furthermore, in the U.S. system, the power resides totally
   with the insurer, which can approve or deny coverage. Thus, while I
   might look with indifference on a project to use data mining for
   medical risk assessment in Canada or Europe, I would regard a similar
   United States program with alarm.
   
   In fact, medical records are already accorded a higher level of
   protection than most data. But what if non-medical data were used for
   the same purpose? Although I do not object to the supermarket using my
   purchasing patterns to determine which coupons to issue me, I would
   feel very differently about the supermarket data being used by an
   insurance company to determine my risk for heart disease. And yet,
   premiums are already higher for cigarette smokers, so why not for
   people who purchase a lot of beef and sour cream?
   
   Similar questions about data misuse come up with automatic
   toll-payment systems (who is interested in where you are and when?),
   telephone records (why do they want to know who your friends and
   family are?), and even magazine subscriptions, catalog orders, or Web
   site visits.
   
   Data mining is a powerful tool. Like any tool, it can be used for ill.
   As our information society matures, we will have to develop new laws
   and conventions to cope with the new methods of manipulating
   information. We should not rush to regulate harmless data mining, but
   the best regulations, where necessary, will be those that best balance
   power and support mutuality of goals between data miners and
   individuals.
   
   Michael J. A. Berry, founder and principal of Data Miners, is
   co-creator of the Decision-Support Systems Laboratory (www.dsslab.com)
   in Cambridge, Mass. You can reach him at mjab@dsslab.com.
   
   
   
   
   
   Copyright © 1999 CMP Media Inc. ALL RIGHTS RESERVED
   No Reproduction without permission
   
   | HOME |SEARCH | SUBSCRIBE |
   
                       Whitelight analytical app IQ