Analytics
Data Mining Epic Fail
Submitted by Dirty D on Wed, 10/08/2008 - 17:08The Ever Amazing Cory Doctorow managed to unearth this little beauty on how government usage of data mining is best described as a catastrophic failure:
They admit that far more Americans live their lives online, using everything from VoIP phones to Facebook to RFID tags in automobiles, than a decade ago, and the databases created by those activities are tempting targets for federal agencies. And they draw a distinction between subject-based data mining (starting with one individual and looking for connections) compared with pattern-based data mining (looking for anomalous activities that could show illegal activities).
But the authors conclude the type of data mining that government bureaucrats would like to do--perhaps inspired by watching too many episodes of the Fox series 24--can't work. "If it were possible to automatically find the digital tracks of terrorists and automatically monitor only the communications of terrorists, public policy choices in this domain would be much simpler. But it is not possible to do so."
There are several points in this to discuss. Let's start by conceding that the massive intrusion of the government into our daily lives by means of data collection and profiling is absolutely terrifying, etc. We all agree to this, and it's non-controversial. Of course, it's also non-trivial, but I think that we can take it as axiomatic that all of us agree that it's pretty scary. The question, though, is how scary it can be when they can't even do it correctly, and the answer is, even more terrifying. The fact that an aggressive lunatic with a rifle has terrible aim doesn't do anything to reassure the people around his target that they won't get hit. The phrase "false positive", while used correctly, somewhat euphemises what's actually happening. What this means is that the government are arresting and charging the wrong people.
There's more...














Recent comments
1 min 28 sec ago
3 min 56 sec ago
11 min 59 sec ago
22 min 3 sec ago
23 min 52 sec ago
25 min ago
26 min ago
29 min 23 sec ago
33 min 25 sec ago
37 min 25 sec ago