TED- 盲目信仰大数据的时代必须结束
提示:点击↑上方"小芳老师"免费关注哦
演说者:CathyONeil
演说题目:盲目信仰大数据的时代必须结束!
算法决定谁会得到贷款,谁会得到工作面试,谁会得到保险等等—— 但它们不会自动使事情变得公平。身为 数学家兼数据科学家的凯西·奥尼尔为算法创造了一个术语,它们是秘密的、重要的和有害的:“杀伤性数学 武器”。https://v.qq.com/txp/iframe/player.html?vid=q05402c4aga&width=500&height=375&auto=0
中英对照演讲稿
Algorithms are everywhere. They sort and separate the winners from the losers. The winners get the job or a good credit card offer. The losers don't even get an interview or they pay more for insurance. We're being scored with secret formulas that we don't understand that often don't have systems of appeal. That begs the question: What if the algorithms are wrong?
算法无处不在。 他们把成功者和失败者区分开来。 成功者得到工 37 39771 37 14942 0 0 3895 0 0:00:10 0:00:03 0:00:07 3895作 或是一个很好的信用卡优惠计划。 失败者甚至连面试机会都没有, 或者要为保险付更多的钱。 我们被不理解的秘密公式打分, 却并没有上诉的渠道。 这引出了一个问题: 如果算法是错误的怎么办?
To build an algorithm you need two things: you need data, what happened in the past, and a definition of success, the thing you're looking for and often hoping for. You train an algorithm by looking, figuring out. The algorithm figures out what is associated with success. What situation leads to success?
构建一个算法需要两个要素: 需要数据,如过去发生的事情, 和成功的定义, 你正在寻找的,通常希望得到的东西。 你可以通过观察,理解来训练算法。 这种算法能找出与成功相关的因素。 什么情况意味着成功?
Actually, everyone uses algorithms. They just don't formalize them in written code. Let me give you an example. I use an algorithm every day to make a meal for my family. The data I use is the ingredients in my kitchen, the time I have, the ambition I have, and I curate that data. I don't count those little packages of ramen noodles as food.
其实,每个人都使用算法。 他们只是没有把它们写成书面代码。 举个例子。 我每天都用一种算法来 为我的家人做饭。 我使用的数据 就是我厨房里的原料, 我拥有的时间, 我的热情, 然后我整理了这些数据。 我不把那种小包拉面算作食物。
My definition of success is: a meal is successful if my kids eat vegetables. It's very different from if my youngest son were in charge. He'd say success is if he gets to eat lots of Nutella. But I get to choose success. I am in charge. My opinion matters. That's the first rule of algorithms.
我对成功的定义是: 如果我的孩子们肯吃蔬菜, 这顿饭就是成功的。 这和我最小的儿子 负责做饭时的情况有所不同。 他说,如果他能吃很多 Nutella巧克力榛子酱就是成功。 但我可以选择成功。 我负责。我的意见就很重要。 这就是算法的第一个规则。
Algorithms are opinions embedded in code. It's really different from what you think most people think of algorithms. They think algorithms are objective and true and scientific. That's a marketing trick. It's also a marketing trick to intimidate you with algorithms, to make you trust and fear algorithms because you trust and fear mathematics. A lot can go wrong when we put blind faith in big data.
算法是嵌入在代码中的观点。 这和你认为大多数人对 算法的看法是不同的。 他们认为算法是客观、真实和科学的。 那是一种营销技巧。 这也是一种用算法来 恐吓你的营销手段, 为了让你信任和恐惧算法 因为你信任并害怕数学。 当我们盲目信任大数据时, 很多人都可能犯错。
This is Kiri Soares. She's a high school principal in Brooklyn. In 2011, she told me her teachers were being scored with a complex, secret algorithm called the "value-added model." I told her, "Well, figure out what the formula is, show it to me. I'm going to explain it to you." She said, "Well, I tried to get the formula, but my Department of Education contact told me it was math and I wouldn't understand it."
这是凯丽·索尔斯。 她是布鲁克林的一名高中校长。 2011年,她告诉我, 她学校的老师们正在被一个复杂 并且隐秘的算法进行打分, 这个算法被称为“增值模型"。 我告诉她,“先弄清楚这个 公式是什么,然后给我看看。 我来给你解释一下。” 她说,“我寻求过这个公式, 但是教育部的负责人告诉我这是数学, 给我我也看不懂。”
It gets worse. The New York Post filed a Freedom of Information Act request, got all the teachers' names and all their scores and they published them as an act of teacher-shaming. When I tried to get the formulas, the source code, through the same means, I was told I couldn't. I was denied. I later found out that nobody in New York City had access to that formula. No one understood it. Then someone really smart got involved, Gary Rubinstein. He found 665 teachers from that New York Post data that actually had two scores. That could happen if they were teaching seventh grade math and eighth grade math. He decided to plot them. Each dot represents a teacher.
更糟的还在后面。 纽约邮报提出了“信息自由法”的要求, 来得到所有老师的名字与他们的分数, 并且他们以羞辱教师的方式 发表了这些数据。 当我试图用同样的方法来获取公式, 源代码的时候, 我被告知我没有权力这么做。 我被拒绝了。 后来我发现, 纽约市压根儿没有人能接触到这个公式。 没有人能看懂。 然后,一个非常聪明的人参与了, 加里·鲁宾斯坦。 他从纽约邮报的数据中 找到了665名教师, 实际上他们有两个分数。 如果他们同时教七年级与八年级的数学, 就会得到两个评分。 他决定把这些数据绘成图表。 每个点代表一个教师。
What is that?That should never have been used for individual assessment. It's almost a random number generator.But it was. This is Sarah Wysocki. She got fired, along with 205 other teachers, from the Washington, DC school district, even though she had great recommendations from her principal and the parents of her kids.
那是什么?它永远不应该被用于个人评估。 它几乎是一个随机数生成器。但它确实被使用了。 这是莎拉·维索斯基。 她连同另外205名教师被解雇了, 都是来自华盛顿特区的学区, 尽管她的校长还有学生的 父母都非常推荐她。
I know what a lot of you guys are thinking, especially the data scientists, the AI experts here. You're thinking, "Well, I would never make an algorithm that inconsistent." But algorithms can go wrong, even have deeply destructive effects with good intentions. And whereas an airplane that's designed badly crashes to the earth and everyone sees it, an algorithm designed badly can go on for a long time, silently wreaking havoc.This is Roger Ailes.
我知道你们很多人在想什么, 尤其是这里的数据科学家, 人工智能专家。 你在想,“我可永远不会做出 这样前后矛盾的算法。” 但是算法可能会出错, 即使有良好的意图, 也会产生毁灭性的影响。 每个人都能看到一架设计的 很糟糕的飞机会坠毁在地, 而一个设计糟糕的算法 可以持续很长一段时间, 并无声地造成破坏。这是罗杰·艾尔斯。
He founded Fox News in 1996. More than 20 women complained about sexual harassment. They said they weren't allowed to succeed at Fox News. He was ousted last year, but we've seen recently that the problems have persisted. That begs the question: What should Fox News do to turn over another leaf?
他在1996年创办了福克斯新闻。 公司有超过20多名女性曾抱怨过性骚扰。 她们说她们不被允许在 福克斯新闻有所成就。 他去年被赶下台,但我们最近看到 问题依然存在。 这引出了一个问题: 福克斯新闻应该做些什么改变?
Well, what if they replaced their hiring process with a machine-learning algorithm? That sounds good, right? Think about it. The data, what would the data be? A reasonable choice would be the last 21 years of applications to Fox News. Reasonable. What about the definition of success? Reasonable choice would be, well, who is successful at Fox News? I guess someone who, say, stayed there for four years and was promoted at least once. Sounds reasonable. And then the algorithm would be trained. It would be trained to look for people to learn what led to success, what kind of applications historically led to success by that definition. Now think about what would happen if we applied that to a current pool of applicants. It would filter out women because they do not look like people who were successful in the past.
如果他们用机器学习算法 取代传统的招聘流程呢? 听起来不错,对吧? 想想看。 数据,这些数据到底是什么? 福克斯新闻在过去21年的申请函 是一个合理的选择。 很合理。 那么成功的定义呢? 合理的选择将是, 谁在福克斯新闻取得了成功? 我猜的是,比如在那里呆了四年, 至少得到过一次晋升的人。 听起来很合理。 然后这个算法将会被训练。 它会被训练去向人们 学习是什么造就了成功, 什么样的申请函在过去拥有 这种成功的定义。 现在想想如果我们把它 应用到目前的申请者中会发生什么。 它会过滤掉女性, 因为她们看起来不像 在过去取得成功的人。
Algorithms don't make things fair if you just blithely, blindly apply algorithms. They don't make things fair. They repeat our past practices, our patterns. They automate the status quo. That would be great if we had a perfect world, but we don't. And I'll add that most companies don't have embarrassing lawsuits, but the data scientists in those companies are told to follow the data, to focus on accuracy. Think about what that means. Because we all have bias, it means they could be codifying sexism or any other kind of bigotry.
算法不会让事情变得公平, 如果你只是轻率地, 盲目地应用算法。 它们不会让事情变得公平。 它们只是重复我们过去的做法, 我们的规律。 它们使现状自动化。 如果我们有一个 完美的世界那就太好了, 但是我们没有。 我还要补充一点, 大多数公司都没有令人尴尬的诉讼, 但是这些公司的数据科学家 被告知要跟随数据, 关注它的准确性。 想想这意味着什么。 因为我们都有偏见, 这意味着他们可以编纂性别歧视 或者任何其他的偏见。
Thought experiment, because I like them: an entirely segregated society -- racially segregated, all towns, all neighborhoods and where we send the police only to the minority neighborhoods to look for crime. The arrest data would be very biased. What if, on top of that, we found the data scientists and paid the data scientists to predict where the next crime would occur? Minority neighborhood. Or to predict who the next criminal would be? A minority. The data scientists would brag about how great and how accurate their model would be, and they'd be right.
思维实验, 因为我喜欢它们: 一个完全隔离的社会—— 种族隔离存在于所有的城镇, 所有的社区, 我们把警察只送到少数族裔的社区 去寻找犯罪。 逮捕数据将会是十分有偏见的。 除此之外,我们还会寻找数据科学家 并付钱给他们来预测 下一起犯罪会发生在哪里? 少数族裔的社区。 或者预测下一个罪犯会是谁? 少数族裔。 这些数据科学家们 会吹嘘他们的模型有多好, 多精确, 当然他们是对的。
Now, reality isn't that drastic, but we do have severe segregations in many cities and towns, and we have plenty of evidence of biased policing and justice system data. And we actually do predict hotspots, places where crimes will occur. And we do predict, in fact, the individual criminality, the criminality of individuals. The news organization ProPublica recently looked into one of those "recidivism risk" algorithms, as they're called, being used in Florida during sentencing by judges. Bernard, on the left, the black man, was scored a 10 out of 10. Dylan, on the right, 3 out of 10. 10 out of 10, high risk. 3 out of 10, low risk. They were both brought in for drug possession. They both had records, but Dylan had a felony but Bernard didn't. This matters, because the higher score you are, the more likely you're being given a longer sentence.
不过现实并没有那么极端, 但我们确实在许多城市里 有严重的种族隔离, 并且我们有大量的证据表明 警察和司法系统的数据存有偏见。 而且我们确实预测过热点, 那些犯罪会发生的地方。 我们确实会预测个人犯罪, 个人的犯罪行为。 新闻机构“人民 (ProPublica)”最近调查了, 其中一个称为 “累犯风险”的算法。 并在佛罗里达州的 宣判期间被法官采用。 伯纳德,左边的那个黑人, 10分中得了满分。 在右边的迪伦, 10分中得了3分。 10分代表高风险。 3分代表低风险。 他们都因为持有毒品 而被带进了监狱。 他们都有犯罪记录, 但是迪伦有一个重罪 但伯纳德没有。 这很重要,因为你的分数越高, 你被判长期服刑的可能性就越大。
What's going on? Data laundering. It's a process by which technologists hide ugly truths inside black box algorithms and call them objective; call them meritocratic. When they're secret, important and destructive, I've coined a term for these algorithms: "weapons of math destruction."
到底发生了什么? 数据洗钱。 这是一个技术人员 把丑陋真相隐藏在 算法黑盒子中的过程, 并称之为客观; 称之为精英模式。 当它们是秘密的, 重要的并具有破坏性的, 我为这些算法创造了一个术语: “杀伤性数学武器”。
They're everywhere, and it's not a mistake. These are private companies building private algorithms for private ends. Even the ones I talked about for teachers and the public police, those were built by private companies and sold to the government institutions. They call it their "secret sauce" -- that's why they can't tell us about it. It's also private power. They are profiting for wielding the authority of the inscrutable. Now you might think, since all this stuff is private and there's competition, maybe the free market will solve this problem. It won't. There's a lot of money to be made in unfairness.
它们无处不在,也不是一个错误。 这些是私有公司为了私人目的 建立的私有算法。 甚至是我谈到的老师 与公共警察使用的(算法), 也都是由私人公司所打造的, 然后卖给政府机构。 他们称之为“秘密配方(来源)”—— 这就是他们不能告诉我们的原因。 这也是私人权力。 他们利用神秘莫测的权威来获利。 你可能会想,既然所有这些都是私有的 而且会有竞争, 也许自由市场会解决这个问题。 然而并不会。 在不公平的情况下, 有很多钱可以赚。
Also, we're not economic rational agents. We all are biased. We're all racist and bigoted in ways that we wish we weren't, in ways that we don't even know. We know this, though, in aggregate, because sociologists have consistently demonstrated this with these experiments they build, where they send a bunch of applications to jobs out, equally qualified but some have white-sounding names and some have black-sounding names, and it's always disappointing, the results -- always.
而且,我们不是经济理性的代理人。 我们都是有偏见的。 我们都是固执的种族主义者, 虽然我们希望我们不是, 虽然我们甚至没有意识到。 总的来说,我们知道这一点, 因为社会学家会一直通过这些实验 来证明这一点, 他们发送了大量的工作申请, 都是有同样资格的候选人, 有些用白人人名, 有些用黑人人名, 然而结果总是令人失望的。
So we are the ones that are biased, and we are injecting those biases into the algorithms by choosing what data to collect, like I chose not to think about ramen noodles -- I decided it was irrelevant. But by trusting the data that's actually picking up on past practices and by choosing the definition of success, how can we expect the algorithms to emerge unscathed? We can't. We have to check them. We have to check them for fairness.
所以我们是有偏见的, 我们还通过选择收集到的数据 来把偏见注入到算法中, 就像我不选择去想拉面一样—— 我自认为这无关紧要。 但是,通过信任那些 在过去的实践中获得的数据 以及通过选择成功的定义, 我们怎么能指望算法 会是毫无瑕疵的呢? 我们不能。我们必须检查。 我们必须检查它们是否公平。
The good news is, we can check them for fairness. Algorithms can be interrogated, and they will tell us the truth every time. And we can fix them. We can make them better. I call this an algorithmic audit, and I'll walk you through it.
好消息是,我们可以做到这一点。 算法是可以被审问的, 而且每次都能告诉我们真相。 然后我们可以修复它们。 我们可以让他们变得更好。 我把它叫做算法审计, 接下来我会为你们解释。
First, data integrity check. For the recidivism risk algorithm I talked about, a data integrity check would mean we'd have to come to terms with the fact that in the US, whites and blacks smoke pot at the same rate but blacks are far more likely to be arrested -- four or five times more likely, depending on the area. What is that bias looking like in other crime categories, and how do we account for it?
首先,数据的完整性检查。 对于刚才提到过的累犯风险算法, 数据的完整性检查将意味着 我们不得不接受这个事实, 在美国,白人和黑人 吸毒的比例是一样的, 但是黑人更有可能被逮捕—— 取决于区域,可能性是白人的4到5倍。 这种偏见在其他犯罪类别中 是什么样子的, 我们又该如何解释呢?
Second, we should think about the definition of success, audit that. Remember -- with the hiring algorithm? We talked about it. Someone who stays for four years and is promoted once? Well, that is a successful employee, but it's also an employee that is supported by their culture. That said, also it can be quite biased. We need to separate those two things. We should look to the blind orchestra audition as an example. That's where the people auditioning are behind a sheet. What I want to think about there is the people who are listening have decided what's important and they've decided what's not important, and they're not getting distracted by that. When the blind orchestra auditions started, the number of women in orchestras went up by a factor of five.
其次,我们应该考虑成功的定义, 审计它。 还记得我们谈论的雇佣算法吗? 那个呆了四年的人, 然后被提升了一次? 这的确是一个成功的员工, 但这也是一名受到公司文化支持的员工。 也就是说, 这可能会有很大的偏差。 我们需要把这两件事分开。 我们应该去看一下乐团盲选试奏, 举个例子。 这就是人们在幕后选拔乐手的地方。 我想要考虑的是 倾听的人已经 决定了什么是重要的, 同时他们已经决定了 什么是不重要的, 他们也不会因此而分心。 当乐团盲选开始时, 在管弦乐队中, 女性的数量上升了5倍。
Next, we have to consider accuracy. This is where the value-added model for teachers would fail immediately. No algorithm is perfect, of course, so we have to consider the errors of every algorithm. How often are there errors, and for whom does this model fail? What is the cost of that failure?
其次,我们必须考虑准确性。 这就是针对教师的增值模型 立刻失效的地方。 当然,没有一个算法是完美的, 所以我们要考虑每一个算法的误差。 出现错误的频率有多高, 让这个模型失败的对象是谁? 失败的代价是什么?
And finally, we have to consider the long-term effects of algorithms, the feedback loops that are engendering. That sounds abstract, but imagine if Facebook engineers had considered that before they decided to show us only things that our friends had posted.
最后,我们必须考虑 这个算法的长期效果, 与正在产生的反馈循环。 这听起来很抽象, 但是想象一下 如果脸书的工程师们之前考虑过, 并决定只向我们展示 我们朋友所发布的东西。
I have two more messages, one for the data scientists out there. Data scientists: we should not be the arbiters of truth. We should be translators of ethical discussions that happen in larger society.And the rest of you, the non-data scientists: this is not a math test. This is a political fight. We need to demand accountability for our algorithmic overlords.The era of blind faith in big data must end.Thank you very much.
我还有两条建议, 一条是给数据科学家的。 数据科学家们:我们不应该 成为真相的仲裁者。 我们应该成为大社会中 所发生的道德讨论的 翻译者。然后剩下的人, 非数据科学家们: 这不是一个数学测试。 这是一场政治斗争。 我们应该要求我们的 算法霸主承担问责。盲目信仰大数据的时代必须结束。非常感谢。
学习型公众号:值得你关注
点击关键词获取福利
动画 | 10部美剧 | 专八全科 | 新概念 | 老友记 | 动画2
89奥斯卡 | 专四 | 英语智力竞赛 | 美语发音| BEC | PPT模板
关注小芳老师
听哈利波特,看TED视频
转发和点赞是我持续更新的动力
资源来自网络,如果有侵权,即刻删除!