How Data Analysis Is Driving Policing Police are getting better tools for mining data. They're supposed to make law enforcement more surgical, but some say it's a high-tech justification for targeting certain places and people.
NPR logo

How Data Analysis Is Driving Policing

  • Download
  • <iframe src="https://www.npr.org/player/embed/622715984/623318978" width="100%" height="290" frameborder="0" scrolling="no" title="NPR embedded audio player">
  • Transcript
How Data Analysis Is Driving Policing

How Data Analysis Is Driving Policing

  • Download
  • <iframe src="https://www.npr.org/player/embed/622715984/623318978" width="100%" height="290" frameborder="0" scrolling="no" title="NPR embedded audio player">
  • Transcript

AUDIE CORNISH, HOST:

We live in an age of big data as our daily decisions are increasingly steered by computers. This week on All Tech Considered, we learn how the police are catching up with the rest of us.

(SOUNDBITE OF ULRICH SCHNAUSS' "NOTHING HAPPENS IN JUNE")

CORNISH: Big police departments are relying more and more on data to decide where to patrol and even whom to keep a close eye on. But as NPR's Martin Kaste reports from Los Angeles, some people worry that data-driven policing is just a high-tech cover for old-fashioned profiling.

UNIDENTIFIED PERSON #1: So we have the name Joseph.

MARTIN KASTE, BYLINE: You probably already assumed that the police could do this kind of search - type in someone's name and see what pops up.

UNIDENTIFIED PERSON #1: It's linking it to a residence, you know, user of this phone number, associated with this vehicle, works in that address.

KASTE: But actually, this is kind of revolutionary. TV cops do this all the time of course. But in real life, police information systems are often clunky with different kinds of information kept in separate silos. Cops spend hours logging into the different systems, printing stuff out, collating it all by hand. But lately, some bigger departments have finally set up internal search systems that can combine all this in one place, on one screen. And the Los Angeles Police Department is one of them.

UNIDENTIFIED PERSON #1: Hollywood.

UNIDENTIFIED PERSON #2: Hollywood's on.

UNIDENTIFIED PERSON #1: Thank you. Wilshire.

UNIDENTIFIED PERSON #3: Wilshire's on.

KASTE: When you listen to the daily crime conference call here, the effect of these new data tools is obvious. The crime stats are up-to-the-minute, tracked and mapped in granular fashion.

UNIDENTIFIED PERSON #4: So Hollywood - so I see yesterday we have one robbery, three burglary for motor vehicles. So what's going on?

KASTE: You probably remember CompStat. That was the crime tracking system pioneered in New York in the 1990s. But this is much faster. Instead of looking at last week's statistics, LAPD is analyzing crime trends within hours and responding almost as quickly. It's an ever-shrinking feedback loop that directly affects the assignments of cops on the street like Jennifer Ramirez.

JENNIFER RAMIREZ: So basically this is what we get on a daily basis. And we take out a mission sheet.

KASTE: Ramirez patrols in the department's Olympic division, which includes Koreatown. The mission sheet she mentioned is generated by her captain and lieutenant using a variety of software tools. And it includes detailed maps of problem areas that the LAPD calls LASER zones.

RAMIREZ: I want to make sure that I try and spend as much time in this area of my free time because these are the hot spots. These are where the crimes will tend to happen.

KASTE: And this system isn't just putting officers at certain places. It's also pointing them at certain people. Ramirez's mission sheet also comes with names and photos.

RAMIREZ: These are people that we are going to be looking out for in the area who have been our chronic offenders.

KASTE: Chronic offenders - this may be the most sensitive aspect of the LAPD's recent data analytics push. Each of these LASER zones comes with a list of the top people that the cops are supposed to watch for. Membership on this list is determined by a formula that draws on data about past interactions with the justice system. The cops are supposed to try to interact with these people as much as possible sort of as a warning to stay out of trouble. But to Anthony Robles, it all sounds way too familiar.

ANTHONY ROBLES: They're just reinventing their surveillance techniques and machinery.

KASTE: Robles is an organizer with the Youth Justice Coalition. It's an activist group run by people who've been incarcerated. He was one of the plaintiffs in a lawsuit against the LAPD's long-standing gang injunction system which relied on lists of people that the police categorized as gang affiliates. He recalls what it was like for him as a teenager when he was on that gang list.

ROBLES: Every time I drove by to that block or drove anywhere, I'd get pulled over. A lot of times they would search my car. They wouldn't find anything, and then they would just give me, like, a moving violation ticket. It led to a lot of anger. It made me want to go do something bad.

KASTE: This gang list system now seems to be on its way out, but Robles thinks it's being replaced by the Chronic Offenders Bulletin and the LASER zones and the other data analysis tools. And Jamie Garcia agrees. She's with an activist group called the Stop LAPD Spying Coalition, which sued to get more details about how the department is using data.

JAMIE GARCIA: These programs are nothing new in the history of policing. What they're trying to call science is really pseudoscience. The bias is still very much inherent in the data that's being used, and the same communities are being impacted.

KASTE: For instance, she says the formula for determining whether someone's on the Chronic Offenders Bulletin is based partly on how often someone is interviewed by the police. But that's something that's simply more likely to happen in those places with heavier police presence. To her, it's just a vicious cycle with a high-tech veneer. But inside the LAPD, the brass are pleased with the new data tools.

DENNIS KATO: It works.

KASTE: Dennis Kato is a deputy chief who's been tasked with helping to roll out these new analytics systems to all of this sprawling department's divisions by 2020. He says what people have to remember is that compared to a generation ago, crime in this city is much lower.

KATO: But you know what - so is our arrest numbers. So that's a good thing - right? - because that means we're arresting the right people. We're not out there saturating. We're not just picking up people for everything.

KASTE: At the same time, Kato says police do need to be careful as they get more, as he puts it, creative with the data.

KATO: We've got to figure out, what is the boundaries? How much is good data? And what is the input mechanisms? And all of that's happening. We question this stuff all the time.

CORNISH: And Martin joins us now to talk a little more about how police are using data. And, Martin, just start with a clarification here. Is the LAPD getting new kinds of data?

KASTE: It's not really new data, Audie. What we're talking about here is a new way to access the data they already have in about 19 databases. They say this is all just information in government-created databases like licenses and arrest records, that kind of thing. What they're doing, though, is - you know, these things are in silos. They're separate. They have some software by a company called Palantir that lets them sort of search across it and look for relationships. Now, this is not unique.

Palantir gets a lot of press because there's a lot of sort of suspicions about their contracts with the federal government, with national security contracts. But really other software can do the same kind of thing. It's called sort of relational databases. But really what we're talking about here is just speed and ease of searching across a lot of different kinds of data.

CORNISH: Besides the concerns we heard in this story about profiling certain people or neighborhoods, are there broader concerns just about privacy?

KASTE: Yeah, there may be. I mean, as I said, this is easier now to look at different kinds of data, a lot less time logging into different things. And where there's ease there's temptation to abuse it. In California, there's been some discussion about incidences of police officers inappropriately getting into something called the C-L-E-T-S or CLETS database, which is DMV information. There's been some pressure on them to police that better, so to speak. And so you can imagine if an officer has access to a computer where he can sit down and just search a lot of stuff all at once with one screen, yeah, there might be some temptations.

CORNISH: And is there a limit at least on how long they can hold onto all this information?

KASTE: Well, there are some practical considerations involving storage. One example here is the license plate reader data. You probably know about license plate readers in cars. As the squad cars drive around, they're just gathering up all the license plates that they spot. It gets stored. You can recreate on a map where a car has been.

It turns out they're keeping that indefinitely. You know, if it bugs you that someone in the LAPD could reconstruct where you've driven over the last few years, I just talked to Deputy Chief Kato, who's in this story, about that. He says it looks like they have yet to delete any of that data. So going back to when they first started doing that, they could probably reconstruct where a car in LA has been.

CORNISH: That's NPR's Martin Kaste. Martin, thanks for your reporting.

KASTE: You're welcome.

(SOUNDBITE OF EVIDENCE'S "DARKER CLOUDS (INSTRUMENTAL)")

Copyright © 2018 NPR. All rights reserved. Visit our website terms of use and permissions pages at www.npr.org for further information.

NPR transcripts are created on a rush deadline by Verb8tm, Inc., an NPR contractor, and produced using a proprietary transcription process developed with NPR. This text may not be in its final form and may be updated or revised in the future. Accuracy and availability may vary. The authoritative record of NPR’s programming is the audio record.