Posts Tagged ‘Data Mining’

Fighting fire with fire, data with data

Tuesday, October 28th, 2008

We all know that companies buy and sell databases of information about us.  They use it to make marketing decisions, including how to identify people who are looking to refinance their homes.  There are certainly interesting questions these practices raise about personal information and privacy, but more immediately, as this article makes clear, data mining can have real-life, adverse and sometimes devastating consequences for ordinary people.

For example, when Mercurion Suladdin, a county librarian in Utah, filled out an application to refinance her home at Ameriquest, “she quickly got a call from a salesman at Beneficial, a division of HSBC bank where she had taken out a previous loan.”

The salesman said he desperately wanted to keep her business. To get the deal, he drove to her house from nearby Salt Lake City and offered her a free Ford Taurus at signing.

What she thought was a fixed-interest rate mortgage soon adjusted upward, and Ms. Suladdin fell behind on her payments and came close to foreclosure before Utah’s attorney general and the activist group Acorn interceded on behalf of her and other homeowners in the state.

“I was being bombarded by so many offers that, after a while, it just got more and more confusing,” she says of her ill-fated decision not to carefully read the fine print on her loan documents.

HSBC knew she was a good target because of products offered by data vendors, like “mortgage triggers” and “TargetPoint Predictive Triggers” from Equifax, which “advertises ‘advanced profiling techniques’ to identify people who show a ’statistical propensity to acquire new credit’ within 90 days.”

Data brokers and lenders defend themselves by saying they are offering a service similar to a second opinion from a doctor.  It’s a good point, especially when you consider that a good doctor would never recommend an alternative treatment without going over all the risks and possible side-effects.  The question isn’t whether it was right or not to give her options; the question is whether she had enough information to weigh those options, especially in comparison to all the information HSBC had about her.

The Internet Age allows all of us to have so much information all the time, but the kind of power that comes with data mining and data analysis tools belongs overwhelmingly to banks, search engine companies, and insurance companies, and not at all to to individuals and consumer protection organizations.  Sure, Ms. Suladdin could have tried to Google “HSBC Beneficial risks?”, but that pales in comparison to what HBSC Beneficial was able to do with her data.

Imagine if she had had data analytic tools similar to her bank’s.  I’m not talking about the ability to go to some online forum to ask questions about mortgages.  What if she had been able to access a database that would tell her what kind of mortgages others like her had received?  Or what was statistically likely to happen to interest rates?  Or a combination of these factors?

Or think about all these nonprofit organizations working hard to deal with the subprime mortgage crisis and the rapidly increasing rate of foreclosures.  Like many nonprofits that offer a service, the first step is to identify the people who need their help and notify them these services exist–not very different from what a business has to do in its marketing strategy.  Why shouldn’t nonprofits have access to the same data as the companies that offered those mortgages to these people in the first place?

Anyone who has read this blog knows, we don’t believe in fighting bad uses of data by shutting down data collection altogether.  Let’s fight fire with fire, data with data.

Politics and Privacy, Part II

Thursday, October 2nd, 2008

Last week, I wrote about how political data collection has shown that data collection doesn’t have to be a completely one-way street, but rather, can involve individuals’ active and sometimes almost enthusiastic participation.  Part of the enthusiasm comes from a belief that this is what democracy is about—we have the right to try to persuade our fellow citizens, whether from a soap box in the town square or by calling a voter list through a phone bank.  But the data collection by political campaigns encompasses a lot more than name, occupation, and email address.  Karl Rove revolutionized it, with his famous use of consumer preferences to identify and target likely Republican voters, but the Democrats have worked hard to catch up, Catalist being one of the big players in this effort. It’s one thing to compile donor lists; another to cross-reference “beer versus wine” preferences to voter lists.  How is democracy affected by intense, data-based voter profiling?

As Solon Barocas pointed out during his talk on voter profiling at the recent DIMACS workshop, researchers have found that micro-targeting voters can increase polarization and divisiveness.  As candidates are able to air one radio ad for the Latino voters in one state and a different one for the white voters in another, they’re able to espouse more extreme positions than they would if forced to appeal to a more general audience.

If true, this is a serious problem.  But I like to believe that in the long run, and done right, political data collection and analysis could actually enable new kinds of consensus and coalition-building.  For one, in an era where blogs monitor political campaigns hour-by-hour, a local radio ad can be made available to a national audience no matter which micro-audience was originally targeted.  (Update: we can even find out about “telephone” calls to the deaf community!)

But more importantly, I can imagine that if voters and not just campaigns were able to see who else felt the way they did on major issues, many might be surprised.  Solon mentioned that despite the headlines, the algorithms by which likely Democratic or Republican voters are identified is not as simple as beer = conservative, wine = liberal.  Yes, campaigns believe they can figure out who in a community might lean in their direction, but it’s a much more complicated calculation.

So if people chose to share and know who else felt similarly, in ways that were more fine-grained than national polls, really interesting things could happen to our political discourse.  The Left Coast environmentalist might learn the hunter in South Dakota shares a commitment to conservation.  The pro-choice atheist and the pro-life Catholic might learn they both oppose the death penalty.  I’m not advocating that we throw open the curtains on the voting booth.  But knowing how our fellow citizens feel about the issues facing all of us—it almost sounds like that old-fashioned American democratic institution, the town hall meeting.

After all, democracy is the ultimate social activity.  We’re supposed to be making decisions together.