
外文书中译对照5分钟机器翻译
基于DeepL
支持多语互译:英德法中俄日西等
支持多种格式:Epub、Mobi、PDF等
支持支付:国际信用卡、PayPal、MixPay
上传


Subscribe Telegram(@libmind), and receive new bilingual books: https://t.me/libmind
关注Telegram(@libmind),接收最新双语书资讯:https://t.me/libmind
Thinking, Fast and Slow
思考,快与慢
Daniel Kahneman
丹尼尔-卡尼曼
THINKING, FAST AND SLOW
思维,快速和缓慢
Contents
内容
INTRODUCTION
简介
PART I. TWO SYSTEMS
第一部分:两个系统
1. THE CHARACTERS OF THE STORY
1.故事中的人物
2. ATTENTION AND EFFORT
2. 注意力和努力
3. THE LAZY CONTROLLER
3.懒惰的控制器
4. THE ASSOCIATIVE MACHINE
4.关联机
5. COGNITIVE EASE
5.认知上的轻松
6. NORMS, SURPRISES, AND CAUSES
6.规范、意外和原因
7. A MACHINE FOR JUMPING TO CONCLUSIONS
7. 妄下结论的机器
8. HOW JUDGMENTS HAPPEN
8. 判决是如何发生 的
9. ANSWERING AN EASIER QUESTION
9. 回答一个较容易的问题
PART II. HEURISTICS AND BIASES
第二部分。启发式方法和偏见
10. THE LAW OF SMALL NUMBERS
10.小数法则
11. ANCHORS
11.锚定器
12. THE SCIENCE OF AVAILABILITY
12. 可用性的科学
13. AVAILABILITY, EMOTION, AND RISK
13.可用性、情感和风险
14. TOM W’S SPECIALTY
14.汤姆-W的专长
15. LINDA: LESS IS MORE
15.林达:少即是多
16. CAUSES TRUMP STATISTICS
16.原因胜于统计
17. REGRESSION TO THE MEAN
17.向平均数回归
18. TAMING INTUITIVE PREDICTIONS
18. 驯服直观的预测
PART III. OVERCONFIDENCE
第三部分。过渡性
19. THE ILLUSION OF UNDERSTANDING
19.理解的幻觉
20. THE ILLUSION OF VALIDITY
20. 有效性的幻觉
21. INTUITIONS VS. FORMULAS
21.直觉与公式
22. EXPERT INTUITION: WHEN CAN WE TRUST IT?
22.专家的直觉:我们什么时候可以相信它?
23. THE OUTSIDE VIEW
23.外面的风景
24. THE ENGINE OF CAPITALISM
24.资本主义的发动机
PART IV. CHOICES
第四部分:选择
25. BERNOULLI’S ERRORS
25. 伯努利的错误
26. PROSPECT THEORY
26. 前景理论
27. THE ENDOWMENT EFFECT
27. 禀赋效应
28. BAD EVENTS
28.糟糕的事件
29. THE FOURFOLD PATTERN
29. 四个方面的模式
30. RARE EVENTS
30.罕见事件
31. RISK POLICIES
31. 风险政策
32. KEEPING SCORE
32.保持分数
33. REVERSALS
33.逆转
34. FRAMES AND REALITY
34. 框架与现实
PART V. TWO SELVES
第五部分:两个自我
35. TWO SELVES
35.两个自己
36. LIFE AS A STORY
36.生活是一个故事
37. EXPERIENCED WELL-BEING
37.经历过的幸福感
38. THINKING ABOUT LIFE
38.对生活的思考
CONCLUSIONS
结论
APPENDIX A: JUDGMENT UNDER UNCERTAINTY
附录A:不确定情况下的判断
APPENDIX B: CHOICES, VALUES, AND FRAMES
附录B:选择、价值和框架
NOTES
注意事项
ACKNOWLEDGMENTS
鸣谢
FOLLOW PENGUIN
跟随鹏程
PENGUIN BOOKS
鹏程书院
THINKING, FAST AND SLOW
思维,快速和缓慢
Daniel Kahneman is a Senior Scholar at Princeton University, and Emeritus Professor of Public Affairs, Woodrow Wilson School of Public and International Affairs. He was awarded the Nobel Prize in Economics in 2002.
Daniel Kahneman是普林斯顿大学的高级学者,也是伍德罗-威尔逊公共和国际事务学院的公共事务荣誉教授。他于2002年获得诺贝尔经济学奖。
In memory of Amos Tversky
纪念 ,阿莫斯-特维尔斯基 。
Introduction
简介
Every author, I suppose, has in mind a setting in which readers of his or her work could benefit from having read it. Mine is the proverbial office water-cooler, where opinions are shared and gossip is exchanged. I hope to enrich the vocabulary that people use when they talk about the judgments and choices of others, the company’s new policies, or a colleague’s investment decisions. Why be concerned with gossip? Because it is much easier, as well as far more enjoyable, to identify and label the mistakes of others than to recognize our own. Questioning what we believe and want is difficult at the best of times, and especially difficult when we most need to do it, but we can benefit from the informed opinions of others. Many of us spontaneously anticipate how friends and colleagues will evaluate our choices; the quality and content of these anticipated judgments therefore matters. The expectation of intelligent gossip is a powerful motive for serious self-criticism, more powerful than New Year resolutions to improve one’s decision making at work and at home.
我想,每一位 的作者,心中都有一个环境,让他或她的作品的读者能够从阅读中受益。我希望能丰富人们在谈论他人的判断和选择、公司的新政策或同事的投资决定时使用的词汇。 ,为什么要关注八卦?因为识别和标示别人的错误比认识自己的错误要容易得多,也愉快得多。 在最好的时候,质疑我们所相信和想要的东西是很困难的,在我们最需要这样做的时候尤其困难,但我们可以从别人的知情意见中受益。我们中的许多人自发地预期朋友和同事 ,将如何评价我们的选择;因此这些预期判断的质量和内容很重要。对智能八卦的期待是严肃自我批评的强大动机,比改善自己在工作和家庭中的决策的新年决议更有力。
To be a good diagnostician, a physician needs to acquire a large set of labels for diseases, each of which binds an idea of the illness and its symptoms, possible antecedents and causes, possible developments and consequences, and possible interventions to cure or mitigate the illness. Learning medicine consists in part of learning the language of medicine. A deeper understanding of judgments and choices also requires a richer vocabulary than is available in everyday language. The hope for informed gossip is that there are distinctive patterns in the errors people make. Systematic errors are known as biases, and they recur predictably in particular circumstances. When the handsome and confident speaker bounds onto the stage, for example, you can anticipate that the audience will judge his comments more favorably than he deserves. The availability of a diagnostic label for this bias—the halo effect—makes it easier to anticipate, recognize, and understand.
为了成为一名优秀的诊断师,医生需要掌握一大套疾病的标签,每一个标签都绑定了 ,对疾病及其症状、可能的前因和原因、可能的发展和后果以及可能的干预措施来治愈或减轻疾病的想法。 学习医学部分包括学习医学的语言。对判断和选择的深入理解也需要比日常语言更丰富的词汇。对知情的流言蜚语的希望是: ,在人们所犯的错误中存在着独特的模式。系统性错误 被称为偏见,它们在特定情况下可预测地重复出现。例如,当英俊而自信的演讲者跳上舞台时,你可以预见到观众会对他的评论作出比他应得的更有利的评价。有了这种偏见的诊断标签--光环效应, ,就更容易预测、识别和理解。
When you are asked what you are thinking about, you can normally answer. You believe you know what goes on in your mind, which often consists of one conscious thought leading in an orderly way to another. But that is not the only way the mind works, nor indeed is that the typical way. Most impressions and thoughts arise in your conscious experience without your knowing how they got there. You cannot trace how you came to the belief that there is a lamp on the desk in front of you, or how you detected a hint of irritation in your spouse’s voice on the telephone, or how you managed to avoid a threat on the road before you became consciously aware of it. The mental work that produces impressions, intuitions, and many decisions goes on in silence in our mind.
当你被问及你在想什么时,你通常可以回答。 你相信你知道你的头脑中发生了什么,这往往是由一个有意识的想法以一种有序的方式引导到另一个。但这并不是心灵工作的唯一方式,实际上也不是典型的方式。大多数印象和想法都出现在你的意识经验中, ,而你并不知道它们是如何到达的。你无法追踪你是如何相信你面前的桌子上有一盏灯的,或者你是如何在电话中察觉到你配偶声音中的一丝烦躁的,或者在你有意识地意识到之前,你是如何设法避开路上的威胁的。 产生印象、直觉和许多决定的心理工作在我们的头脑中 沉默地进行。
Much of the discussion in this book is about biases of intuition. However, the focus on error does not denigrate human intelligence, any more than the attention to diseases in medical texts denies good health. Most of us are healthy most of the time, and most of our judgments and actions are appropriate most of the time. As we navigate our lives, we normally allow ourselves to be guided by impressions and feelings, and the confidence we have in our intuitive beliefs and preferences is usually justified. But not always. We are often confident even when we are wrong, and an objective observer is more likely to detect our errors than we are.
本书中的大部分讨论都是关于直觉的偏见。 然而,对错误的关注并没有诋毁人类的智慧,就像医学著作中对疾病的关注否认了健康一样。 我们大多数人在大多数时候都是健康的,我们的大多数判断和行动在大多数时候都是合适的。在我们的生活中,我们通常允许自己 ,以印象和感觉为指导,我们对自己的直觉信念和偏好的信心通常是合理的。 但并不总是如此。我们往往在错误的时候也很自信,而客观的观察者比我们更容易发现我们的错误。
So this is my aim for watercooler conversations: improve the ability to identify and understand errors of judgment and choice, in others and eventually in ourselves, by providing a richer and more precise language to discuss them. In at least some cases, an accurate diagnosis may suggest an intervention to limit the damage that bad judgments and choices often cause.
因此,这是我在水杯对话中的目标:通过提供更丰富、更精确的语言来讨论,提高识别和理解判断和选择错误的能力, ,最终在我们自己身上。 至少在某些情况下,准确的诊断可能会建议进行干预,以限制错误判断和选择经常造成的损害。
ORIGINS
创始者
This book presents my current understanding of judgment and decision making, which has been shaped by psychological discoveries of recent decades. However, I trace the central ideas to the lucky day in 1969 when I asked a colleague to speak as a guest to a seminar I was teaching in the Department of Psychology at the Hebrew University of Jerusalem. Amos Tversky was considered a rising star in the field of decision research—indeed, in anything he did—so I knew we would have an interesting time. Many people who knew Amos thought he was the most intelligent person they had ever met. He was brilliant, voluble, and charismatic. He was also blessed with a perfect memory for jokes and an exceptional ability to use them to make a point. There was never a dull moment when Amos was around. He was then thirty-two; I was thirty-five.
本书介绍了我目前对判断和决策的理解,这种理解是由近几十年来的心理学发现形成的。 然而,我把中心思想追溯到1969年的幸运日,当时我请一位同事作为嘉宾在我在耶路撒冷希伯来大学心理学系 的研讨会上发言。Amos Tversky被认为是决策研究领域的一颗新星--事实上,在他所做的任何事情上--所以我知道我们会有一段有趣的时间。 许多认识Amos的人认为他是他们所见过的最聪明的人 。他才华横溢,侃侃而谈,魅力十足。他还得天独厚地拥有完美的笑话记忆力,以及利用笑话来表达观点的非凡能力。当阿莫斯在身边的时候,从来没有一个沉闷的时刻。 他当时是32岁,我是35岁。
Amos told the class about an ongoing program of research at the University of Michigan that sought to answer this question: Are people good intuitive statisticians? We already knew that people are good intuitive grammarians: at age four a child effortlessly conforms to the rules of grammar as she speaks, although she has no idea that such rules exist. Do people have a similar intuitive feel for the basic principles of statistics? Amos reported that the answer was a qualified yes. We had a lively debate in the seminar and ultimately concluded that a qualified no was a better answer.
阿莫斯在课堂上讲述了密歇根大学正在进行的一项研究计划,该计划试图回答 这个问题:人们是好的直觉统计学家吗?我们已经知道人们是好的直觉语法学家:四岁的孩子在说话时毫不费力地符合语法规则,尽管她不知道这种规则存在。人们对统计学的基本原理有类似的直观感受吗?阿莫斯报告说,答案是有条件的肯定。我们在研讨会上进行了热烈的辩论 ,最终得出结论,有条件的否定是更好的答案。
Amos and I enjoyed the exchange and concluded that intuitive statistics was an interesting topic and that it would be fun to explore it together. That Friday we met for lunch at Café Rimon, the favorite hangout of bohemians and professors in Jerusalem, and planned a study of the statistical intuitions of sophisticated researchers. We had concluded in the seminar that our own intuitions were deficient. In spite of years of teaching and using statistics, we had not developed an intuitive sense of the reliability of statistical results observed in small samples. Our subjective judgments were biased: we were far too willing to believe research findings based on inadequate evidence and prone to collect too few observations in our own research. The goal of our study was to examine whether other researchers suffered from the same affliction.
阿莫斯和我很喜欢这种交流,并得出结论,直觉统计是一个有趣的话题,一起探讨它将会很有趣。那个星期五,我们在里蒙咖啡馆(Café Rimon)共进午餐,这是耶路撒冷的波希米亚人和教授们最喜欢去的地方,并计划对复杂的 研究人员的统计直觉进行研究。我们在研讨会上得出结论,我们自己的直觉是有缺陷的。尽管多年来一直在教授和使用统计学,但我们对在小样本中观察到的统计结果的可靠性并没有形成一种直观的感觉。我们的主观判断是有偏见的:我们太愿意相信基于不充分证据的研究结果,并 容易 在自己的研究中 收集太少的观察 。我们研究的目的是检查其他研究人员是否遭受同样的困扰。
We prepared a survey that included realistic scenarios of statistical issues that arise in research. Amos collected the responses of a group of expert participants in a meeting of the Society of Mathematical Psychology, including the authors of two statistical textbooks. As expected, we found that our expert colleagues, like us, greatly exaggerated the likelihood that the original result of an experiment would be successfully replicated even with a small sample. They also gave very poor advice to a fictitious graduate student about the number of observations she needed to collect. Even statisticians were not good intuitive statisticians.
我们准备了一份调查,其中包括研究中出现的统计问题的现实情景。 阿莫斯收集了一组参加数学心理学会会议的专家的答复,其中包括两本统计学教科书的作者。正如预期的那样, ,我们发现我们的专家同事和我们一样,大大夸大了实验的原始结果被成功复制的可能性,即使是小样本。他们也给一个虚构的研究生提供了非常糟糕的建议,告诉她需要收集的观测数据的数量。 即使是统计学家也不是好的直觉统计学家。
While writing the article that reported these findings, Amos and I discovered that we enjoyed working together. Amos was always very funny, and in his presence I became funny as well, so we spent hours of solid work in continuous amusement. The pleasure we found in working together made us exceptionally patient; it is much easier to strive for perfection when you are never bored. Perhaps most important, we checked our critical weapons at the door. Both Amos and I were critical and argumentative, he even more than I, but during the years of our collaboration neither of us ever rejected out of hand anything the other said. Indeed, one of the great joys I found in the collaboration was that Amos frequently saw the point of my vague ideas much more clearly than I did. Amos was the more logical thinker, with an orientation to theory and an unfailing sense of direction. I was more intuitive and rooted in the psychology of perception, from which we borrowed many ideas. We were sufficiently similar to understand each other easily, and sufficiently different to surprise each other. We developed a routine in which we spent much of our working days together, often on long walks. For the next fourteen years our collaboration was the focus of our lives, and the work we did together during those years was the best either of us ever did.
在撰写报道 这些发现的文章时,阿莫斯和我发现,我们很喜欢一起工作。阿莫斯总是非常有趣, ,在他面前,我也变得很有趣,所以我们在不断的娱乐中度过了几个小时的坚实工作。我们在一起工作时发现的乐趣使我们特别有耐心;当你从不感到厌烦时,就更容易争取完美。也许最重要的是,我们在门口检查了我们的重要武器 。阿摩司和我都很挑剔,也很爱争论,他甚至比我更爱争论,但在我们合作的这些年里,我们谁也没有断然拒绝对方说的任何话。 事实上,我在合作中发现的最大乐趣之一是,阿摩司经常比我更清楚地看到我模糊想法的重点。阿莫斯是更有逻辑性的思考者,他对理论 ,并有坚定的方向感。我更有直觉,扎根于感知心理学,我们从那里借用了许多想法。 我们有足够的相似性,容易理解对方,也有足够的不同,使对方感到惊讶。 我们形成了一种惯例,我们的大部分工作日都在一起,经常长途跋涉。在接下来的十四年里,我们的合作是 ,是我们生活的重点,我们在这些年里一起做的工作是我们中任何一个人做得最好的。
We quickly adopted a practice that we maintained for many years. Our research was a conversation, in which we invented questions and jointly examined our intuitive answers. Each question was a small experiment, and we carried out many experiments in a single day. We were not seriously looking for the correct answer to the statistical questions we posed. Our aim was to identify and analyze the intuitive answer, the first one that came to mind, the one we were tempted to make even when we knew it to be wrong. We believed—correctly, as it happened—that any intuition that the two of us shared would be shared by many other people as well, and that it would be easy to demonstrate its effects on judgments.
我们很快就采用了一种我们保持了多年的做法。 我们的研究是一种对话,在对话中我们发明了一些问题,并共同研究了我们的直觉答案。 每个问题都是一个小实验,我们在一天内进行了许多实验。 我们并没有认真寻找我们提出的统计问题的正确答案, 。我们的目的是识别和分析直觉的答案,即第一个出现在脑海中的答案,即使我们知道它是错误的,我们也想做出这样的答案。 我们相信--正确地相信--我们两个人共享的任何直觉也会被其他许多人共享,而且很容易证明它对判断的影响 。
We once discovered with great delight that we had identical silly ideas about the future professions of several toddlers we both knew. We could identify the argumentative three-year-old lawyer, the nerdy professor, the empathetic and mildly intrusive psychotherapist. Of course these predictions were absurd, but we still found them appealing. It was also clear that our intuitions were governed by the resemblance of each child to the cultural stereotype of a profession. The amusing exercise helped us develop a theory that was emerging in our minds at the time, about the role of resemblance in predictions. We went on to test and elaborate that theory in dozens of experiments, as in the following example.
我们曾经非常高兴地发现,对于我们都认识的几个幼儿的未来职业,我们有相同的愚蠢想法。 我们可以识别出善于争论的三岁律师、书呆子教授、有同情心和轻微侵入性的心理治疗师。当然这些预测是荒谬的,但我们仍然觉得它们很有吸引力。同样明显的是,我们的直觉 ,受每个孩子与某一职业的文化定型的相似性所支配。这个有趣的练习帮助我们发展了当时在我们头脑中出现的一个理论,即关于相似性在预测中的作用。 我们继续在几十个实验中测试和阐述这个理论,如下面的例子。
As you consider the next question, please assume that Steve was selected at random from a representative sample:
当你考虑下一个问题时,请假设史蒂夫是从一个有代表性的样本中随机抽取的 。
An individual has been described by a neighbor as follows: “Steve is very shy and withdrawn, invariably helpful but with little interest in people or in the world of reality. A meek and tidy soul, he has a need for order and structure, and a passion for detail.” Is Steve more likely to be a librarian or a farmer?
一个邻居曾这样描述个人 :"史蒂夫非常害羞和孤僻,总是乐于助人,但对人或现实世界没有什么兴趣。 他是一个温顺整洁的灵魂,对秩序和结构有需求,对细节有热情。"史蒂夫更有可能成为一名图书管理员还是一名农民?
The resemblance of Steve’s personality to that of a stereotypical librarian strikes everyone immediately, but equally relevant statistical considerations are almost always ignored. Did it occur to you that there are more than 20 male farmers for each male librarian in the United States? Because there are so many more farmers, it is almost certain that more “meek and tidy” souls will be found on tractors than at library information desks. However, we found that participants in our experiments ignored the relevant statistical facts and relied exclusively on resemblance. We proposed that they used resemblance as a simplifying heuristic (roughly, a rule of thumb) to make a difficult judgment. The reliance on the heuristic caused predictable biases (systematic errors) in their predictions.
史蒂夫的个性 ,与刻板印象中的图书管理员的个性相似,这让每个人都立即感到震惊,但同样相关的统计考虑几乎总是被忽略。你有没有想过,在美国,每一个男性图书管理员都有超过20个男性农民?因为有这么多的农民,几乎可以肯定的是,在拖拉机上发现的 "温顺和整洁 "的灵魂会比在图书馆信息台发现的多。 然而,我们发现,在我们的实验中,参与者忽略了相关的统计事实,完全依赖相似性。 我们提出,他们把相似性作为一种简化的启发式方法(大致是一种经验法则)来进行困难的判断。 对启发式方法的依赖导致他们预测中可预测的偏差(系统错误)。
On another occasion, Amos and I wondered about the rate of divorce among professors in our university. We noticed that the question triggered a search of memory for divorced professors we knew or knew about, and that we judged the size of categories by the ease with which instances came to mind. We called this reliance on the ease of memory search the availability heuristic. In one of our studies, we asked participants to answer a simple question about words in a typical English text:
还有一次,阿莫斯和我想知道 ,关于我们大学教授的离婚率。我们注意到,这个问题引发了对我们认识的或知道的离婚教授的记忆搜索,并且我们通过实例出现在脑海中的难易程度来判断类别的大小。 我们把这种对记忆搜索难易程度的依赖称为可得性启发式。在我们的一项研究中,我们要求参与者回答一个简单的 ,关于 典型英语文本中的 单词 。
Consider the letter K .
考虑一下字母 K 。
Is K more likely to appear as the first letter in a word OR as the third letter?
K 更有可能作为一个词的第一个字母出现,还是作为第三个字母出现?
As any Scrabble player knows, it is much easier to come up with words that begin with a particular letter than to find words that have the same letter in the third position. This is true for every letter of the alphabet. We therefore expected respondents to exaggerate the frequency of letters appearing in the first position—even those letters (such as K , L , N , R , V ) which in fact occur more frequently in the third position. Here again, the reliance on a heuristic produces a predictable bias in judgments. For example, I recently came to doubt my long-held impression that adultery is more common among politicians than among physicians or lawyers. I had even come up with explanations for that “fact,” including the aphrodisiac effect of power and the temptations of life away from home. I eventually realized that the transgressions of politicians are much more likely to be reported than the transgressions of lawyers and doctors. My intuitive impression could be due entirely to journalists’ choices of topics and to my reliance on the availability heuristic.
任何拼字游戏玩家都知道,想出以某一字母开头的单词要比找到在第三位置有相同字母的单词容易得多。 这对字母表的每一个字母都是如此。因此,我们预计 受访者会夸大出现在第一位置的字母的频率--即使是那些实际上在第三位置出现频率更高的字母(如 K 、 L 、 N 、 R 、 V )。在这里,对启发式方法的依赖再次产生了可预测的判断偏差。例如,我最近开始怀疑我长期以来的印象,即通奸在政治家中比在医生中更常见 或律师。我甚至为这个 "事实 "想出了一些解释,包括权力的催情作用和离家生活的诱惑力。我最终意识到,政治家的违法行为比律师和医生的违法行为 ,更容易被报道。我的直觉印象可能完全是由于记者对主题的选择和我对可用性启发式的依赖 。
Amos and I spent several years studying and documenting biases of intuitive thinking in various tasks—assigning probabilities to events, forecasting the future, assessing hypotheses, and estimating frequencies. In the fifth year of our collaboration, we presented our main findings in Science magazine, a publication read by scholars in many disciplines. The article (which is reproduced in full at the end of this book) was titled “Judgment Under Uncertainty: Heuristics and Biases.” It described the simplifying shortcuts of intuitive thinking and explained some 20 biases as manifestations of these heuristics—and also as demonstrations of the role of heuristics in judgment.
阿莫斯和我花了几年时间研究和记录各种任务中直觉思维的偏差--为事件分配概率、预测未来、评估假设和估计频率。 在我们合作的第五年,我们在《 科学 》杂志上介绍了我们的主要发现,这是一份有许多学科的学者阅读的出版物。这篇文章 (全文转载于本书末尾),标题是 "不确定情况下的判断:启发式和偏见"。它描述了直觉思维的简化捷径,并解释了作为这些启发式思维的表现形式的大约20种偏见--也是启发式思维在判断中的作用的证明。
Historians of science have often noted that at any given time scholars in a particular field tend to share basic assumptions about their subject. Social scientists are no exception; they rely on a view of human nature that provides the background of most discussions of specific behaviors but is rarely questioned. Social scientists in the 1970s broadly accepted two ideas about human nature. First, people are generally rational, and their thinking is normally sound. Second, emotions such as fear, affection, and hatred explain most of the occasions on which people depart from rationality. Our article challenged both assumptions without discussing them directly. We documented systematic errors in the thinking of normal people, and we traced these errors to the design of the machinery of cognition rather than to the corruption of thought by emotion.
科学史家们经常注意到,在任何特定的时间, 领域的学者们都倾向于分享关于其主题的基本假设。社会科学家也不例外;他们依赖一种关于人性的观点,这种观点为大多数关于具体行为的讨论提供了背景,但很少受到质疑。 1970年代的社会科学家广泛接受关于人性的两种观点。首先,人们通常是理性的,他们的思维通常是健全的。其次,情感 ,如恐惧、感情和仇恨,解释了人们偏离理性的大多数场合。 我们的文章挑战了这两个假设,但没有直接讨论它们。 我们记录了正常人思维中的系统性错误,我们将这些错误追溯到认知机制的设计,而不是情感对思维的腐蚀。
Our article attracted much more attention than we had expected, and it remains one of the most highly cited works in social science (more than three hundred scholarly articles referred to it in 2010). Scholars in other disciplines found it useful, and the ideas of heuristics and biases have been used productively in many fields, including medical diagnosis, legal judgment, intelligence analysis, philosophy, finance, statistics, and military strategy.
我们的文章吸引了比我们预期更多的 ,它仍然是社会科学中被引用次数最多的作品之一(2010年有三百多篇学术文章提到了它)。其他学科的学者发现它很有用,启发式方法和偏见的思想已经被有效地用于许多领域,包括医学诊断、法律判断、情报分析、哲学、金融、统计、 和军事战略。
For example, students of policy have noted that the availability heuristic helps explain why some issues are highly salient in the public’s mind while others are neglected. People tend to assess the relative importance of issues by the ease with which they are retrieved from memory—and this is largely determined by the extent of coverage in the media. Frequently mentioned topics populate the mind even as others slip away from awareness. In turn, what the media choose to report corresponds to their view of what is currently on the public’s mind. It is no accident that authoritarian regimes exert substantial pressure on independent media. Because public interest is most easily aroused by dramatic events and by celebrities, media feeding frenzies are common. For several weeks after Michael Jackson’s death, for example, it was virtually impossible to find a television channel reporting on another topic. In contrast, there is little coverage of critical but unexciting issues that provide less drama, such as declining educational standards or overinvestment of medical resources in the last year of life. (As I write this, I notice that my choice of “little-covered” examples was guided by availability. The topics I chose as examples are mentioned often; equally important issues that are less available did not come to my mind.)
例如,研究政策的学生已经注意到,可用性启发式有助于解释为什么有些问题在公众心目中非常突出,而其他问题却被忽视。 人们倾向于通过从记忆中检索问题的难易程度来评估问题的相对重要性--这主要由媒体的报道程度决定。经常被提及的 ,甚至当其他话题从人们的意识中溜走时,媒体选择报道的内容与他们对当前公众心目中 的看法相一致。专制政权对独立媒体施加巨大压力并非偶然。 由于公众的兴趣最容易被戏剧性事件和名人所激发,媒体的狂欢很常见。例如,在迈克尔-杰克逊去世后的几个 ,几乎不可能找到一个电视频道来报道其他话题。相比之下,对那些关键但不刺激的问题,如教育标准的下降或生命最后一年医疗资源的过度投资,却鲜有报道。(当我写到这里时,我注意到我选择 "鲜有报道 "的 例子是由可用性引导的。我选择的主题作为例子经常被提及;同样重要的问题,却没有出现在我的脑海中)。
We did not fully realize it at the time, but a key reason for the broad appeal of “heuristics and biases” outside psychology was an incidental feature of our work: we almost always included in our articles the full text of the questions we had asked ourselves and our respondents. These questions served as demonstrations for the reader, allowing him to recognize how his own thinking was tripped up by cognitive biases. I hope you had such an experience as you read the question about Steve the librarian, which was intended to help you appreciate the power of resemblance as a cue to probability and to see how easy it is to ignore relevant statistical facts.
我们当时并没有完全意识到这一点,但 "启发式方法和偏见 "在心理学之外具有广泛吸引力的一个关键原因是我们工作中的一个偶然特征:我们几乎总是在文章中包括我们向自己和受访者提出的问题的全文 。我希望你在阅读关于图书管理员史蒂夫的问题时有这样的体验,该问题旨在帮助你体会相似性作为概率线索的力量,并看到忽视 相关的统计事实是多么容易。
The use of demonstrations provided scholars from diverse disciplines—notably philosophers and economists—an unusual opportunity to observe possible flaws in their own thinking. Having seen themselves fail, they became more likely to question the dogmatic assumption, prevalent at the time, that the human mind is rational and logical. The choice of method was crucial: if we had reported results of only conventional experiments, the article would have been less noteworthy and less memorable. Furthermore, skeptical readers would have distanced themselves from the results by attributing judgment errors to the familiar fecklessness of undergraduates, the typical participants in psychological studies. Of course, we did not choose demonstrations over standard experiments because we wanted to influence philosophers and economists. We preferred demonstrations because they were more fun, and we were lucky in our choice of method as well as in many other ways. A recurrent theme of this book is that luck plays a large role in every story of success; it is almost always easy to identify a small change in the story that would have turned a remarkable achievement into a mediocre outcome. Our story was no exception.
示范的使用为来自不同学科的学者--特别是哲学家和经济学家--提供了一个不寻常的机会,观察他们自己思维中可能存在的缺陷。在看到自己失败后,他们变得更有可能质疑当时盛行的教条式假设,即人类的思维是理性和逻辑的。方法的选择至关重要: ,如果我们只报告常规实验的结果,文章就不会那么值得注意,也不会那么令人难忘。此外,持怀疑态度的读者会把判断错误归咎于大学生熟悉的无精打采,也就是心理学研究的典型参与者,从而与结果保持距离。当然,我们选择演示而不是标准实验 ,因为我们想影响哲学家和经济学家。我们更喜欢演示,因为它们更有趣,而且我们在选择方法以及其他许多方面都很幸运。 本书的一个反复出现的主题是,运气在每个成功的故事中都起着很大的作用;几乎总是很容易找出故事中的一个小变化,而这个变化会把一个了不起的成就变成 一个平凡的结果。 我们的故事也不例外。
The reaction to our work was not uniformly positive. In particular, our focus on biases was criticized as suggesting an unfairly negative view of the mind . As expected in normal science, some investigators refined our ideas and others offered plausible alternatives . By and large, though, the idea that our minds are susceptible to systematic errors is now generally accepted. Our research on judgment had far more effect on social science than we thought possible when we were working on it.
人们对我们的工作的反应并不一致,特别是我们对偏见的关注被批评为暗示了 对心灵的 不公平的 负面看法 。正如在正常的科学中预期的那样 ,一些研究者完善了我们的想法,另一些则 提供了合理的替代方案 。但总的来说,我们的心灵容易受到系统性错误的影响 ,这一观点现在已被普遍接受。 我们对判断的研究对社会科学的影响远远超过我们在从事这项研究时的想象。
Immediately after completing our review of judgment, we switched our attention to decision making under uncertainty. Our goal was to develop a psychological theory of how people make decisions about simple gambles. For example: Would you accept a bet on the toss of a coin where you win $130 if the coin shows heads and lose $100 if it shows tails? These elementary choices had long been used to examine broad questions about decision making, such as the relative weight that people assign to sure things and to uncertain outcomes. Our method did not change: we spent many days making up choice problems and examining whether our intuitive preferences conformed to the logic of choice. Here again, as in judgment, we observed systematic biases in our own decisions, intuitive preferences that consistently violated the rules of rational choice. Five years after the Science article, we published “Prospect Theory: An Analysis of Decision Under Risk,” a theory of choice that is by some counts more influential than our work on judgment, and is one of the foundations of behavioral economics.
在完成对判断力的审查后,我们立即将注意力转移到不确定性下的决策。例如:你会接受在 抛硬币的赌注,如果硬币显示为正面,你将赢得130美元,如果显示为反面,你将失去100美元。这些基本的选择长期以来一直被用来研究有关决策的广泛问题,例如人们对确定的事情和不确定的结果所赋予的相对权重。 我们的方法没有改变:我们花了很多天时间来编造选择问题,并检查我们的直觉偏好 ,是否符合选择逻辑。在这里,正如在判断中一样,我们在自己的决定中观察到了系统性的偏见,即一贯违反理性选择规则的直觉偏好。 在 《科学》 一文发表五年后,我们发表了《前景理论:风险下的决策分析》,这一选择理论在某些方面比我们关于判断的工作更有影响力,是行为经济学基础的一个 。
Until geographical separation made it too difficult to go on, Amos and I enjoyed the extraordinary good fortune of a shared mind that was superior to our individual minds and of a relationship that made our work fun as well as productive. Our collaboration on judgment and decision making was the reason for the Nobel Prize that I received in 2002, which Amos would have shared had he not died, aged fifty-nine, in 1996.
在地理上的分离使我们难以继续下去之前,阿莫斯和我享受着非同寻常的好运,我们共同的思想优于我们个人的思想,我们的关系使我们的工作既有趣又富有成效。 我们在判断和决策方面的合作是 我 在2002年 获得诺贝尔奖 的原因,如果阿莫斯没有在1996年去世,享年五十九岁,他本可以分享这个奖项。
WHERE WE ARE NOW
我们现在的位置
This book is not intended as an exposition of the early research that Amos and I conducted together, a task that has been ably carried out by many authors over the years. My main aim here is to present a view of how the mind works that draws on recent developments in cognitive and social psychology. One of the more important developments is that we now understand the marvels as well as the flaws of intuitive thought.
本书并不是要阐述阿莫斯和我一起进行的早期研究,这项任务多年来已经由许多作者干练地完成了。 我在这里的主要目的是介绍一种关于心灵如何运作的观点,这种观点借鉴了认知和社会心理学的最新发展。其中一个 更重要的发展是,我们现在理解了直觉思维的奇迹和缺陷。
Amos and I did not address accurate intuitions beyond the casual statement that judgment heuristics “are quite useful, but sometimes lead to severe and systematic errors.” We focused on biases, both because we found them interesting in their own right and because they provided evidence for the heuristics of judgment. We did not ask ourselves whether all intuitive judgments under uncertainty are produced by the heuristics we studied; it is now clear that they are not. In particular, the accurate intuitions of experts are better explained by the effects of prolonged practice than by heuristics. We can now draw a richer and more balanced picture, in which skill and heuristics are alternative sources of intuitive judgments and choices.
阿莫斯和我除了随口说了一句判断启发式方法 "相当有用,但有时会导致严重的系统性错误 "之外,并没有涉及准确的直觉。我们关注偏见,一是因为我们发现 ,它们本身就很有趣,二是因为它们为判断的启发式方法提供了证据 。我们没有问自己,是否所有不确定性下的直觉判断都是由我们研究的启发式方法产生的;现在很清楚,它们不是。特别是,专家的准确直觉能更好地被 长期实践 的影响所解释,而不是被启发式方法所解释。 我们现在可以画出一幅更丰富和更平衡的图画,其中技能和启发式方法是 直觉判断和选择的替代来源。
The psychologist Gary Klein tells the story of a team of firefighters that entered a house in which the kitchen was on fire . Soon after they started hosing down the kitchen, the commander heard himself shout, “Let’s get out of here!” without realizing why. The floor collapsed almost immediately after the firefighters escaped. Only after the fact did the commander realize that the fire had been unusually quiet and that his ears had been unusually hot. Together, these impressions prompted what he called a “sixth sense of danger.” He had no idea what was wrong, but he knew something was wrong. It turned out that the heart of the fire had not been in the kitchen but in the basement beneath where the men had stood.
心理学家加里-克莱因(Gary Klein)讲述了这样一个故事:一队消防员进入一间 厨房着火 的房子。在他们开始冲洗厨房后不久,指挥官听到自己喊道:"我们离开这里!"却没有意识到原因。在消防员逃出后,地板几乎立即倒塌了。只有在 这一事实后,指挥官才意识到火势异常安静,他的耳朵异常热。这些印象加在一起,促使他产生了所谓的 "危险的第六感"。他不知道出了什么问题,但他知道有些事情是不对的。事实证明,火灾的中心并不在厨房,而是在那些人所站的地下室。
We have all heard such stories of expert intuition: the chess master who walks past a street game and announces “White mates in three” without stopping, or the physician who makes a complex diagnosis after a single glance at a patient. Expert intuition strikes us as magical, but it is not. Indeed, each of us performs feats of intuitive expertise many times each day. Most of us are pitch-perfect in detecting anger in the first word of a telephone call, recognize as we enter a room that we were the subject of the conversation, and quickly react to subtle signs that the driver of the car in the next lane is dangerous. Our everyday intuitive abilities are no less marvelous than the striking insights of an experienced firefighter or physician—only more common.
我们都听说过 ,这种专家直觉的故事:国际象棋大师走过街头的棋局,马不停蹄地宣布 "白方三点一线",或者医生看了病人一眼就做出了复杂的诊断。 专家直觉让我们觉得很神奇,但其实不然。事实上,我们每个人每天都会多次完成直觉专长的壮举。 我们中的大多数人都能在电话的第一句话中完美地察觉到愤怒 ,在我们进入一个房间时认识到我们是谈话的对象,并对隔壁车道上的汽车司机有危险的细微迹象迅速做出反应。我们日常的直觉能力并不亚于经验丰富的消防员或医生的惊人洞察力,只是更常见而已。
The psychology of accurate intuition involves no magic. Perhaps the best short statement of it is by the great Herbert Simon, who studied chess masters and showed that after thousands of hours of practice they come to see the pieces on the board differently from the rest of us. You can feel Simon’s impatience with the mythologizing of expert intuition when he writes: “The situation has provided a cue; this cue has given the expert access to information stored in memory, and the information provides the answer. Intuition is nothing more and nothing less than recognition.”
准确直觉的心理学涉及 ,没有魔法。也许伟大的赫伯特-西蒙(Herbert Simon) 对 它 做了 最好的简短陈述,他 研究了国际象棋大师 ,发现经过数千小时的练习,他们对棋盘上的棋子的看法与我们其他人不同。 你可以感受到西蒙对专家直觉神话的不耐烦,他写道: "情况 已经提供了一个线索;这个线索让专家获得了储存在记忆中的信息 ,而这些信息提供了答案。 直觉不过是识别,也不过如此"。
We are not surprised when a two-year-old looks at a dog and says “doggie!” because we are used to the miracle of children learning to recognize and name things. Simon’s point is that the miracles of expert intuition have the same character. Valid intuitions develop when experts have learned to recognize familiar elements in a new situation and to act in a manner that is appropriate to it. Good intuitive judgments come to mind with the same immediacy as “doggie!”
当一个两岁的孩子看着一只狗说 "小狗 "时,我们并不感到惊讶。因为我们已经习惯了儿童学习识别 ,并为事物命名的奇迹。西蒙的观点是,专家直觉的奇迹也有同样的特点。当专家们学会了在新情况下识别熟悉的元素并以适合它的方式行事时,有效的直觉就会发展到 。 好的直觉判断就像 "小狗!"一样直接地出现在脑海中。
Unfortunately, professionals’ intuitions do not all arise from true expertise. Many years ago I visited the chief investment officer of a large financial firm, who told me that he had just invested some tens of millions of dollars in the stock of Ford Motor Company. When I asked how he had made that decision, he replied that he had recently attended an automobile show and had been impressed. “Boy, do they know how to make a car!” was his explanation. He made it very clear that he trusted his gut feeling and was satisfied with himself and with his decision. I found it remarkable that he had apparently not considered the one question that an economist would call relevant: Is Ford stock currently underpriced? Instead, he had listened to his intuition; he liked the cars, he liked the company, and he liked the idea of owning its stock. From what we know about the accuracy of stock picking, it is reasonable to believe that he did not know what he was doing.
不幸的是,专业人士的直觉并不都是来自真正的专业知识。许多年前,我拜访了一家大型金融公司的首席投资官,他告诉我,他刚刚在福特汽车公司的股票上投资了 ,大约几千万美元。当我问他是如何做出这个决定的时候,他回答说,他最近参加了一个汽车展,并留下了深刻印象。"他说得很清楚,他相信自己的直觉,对自己和自己的决定感到满意。我发现,他 ,显然没有考虑一个经济学家称之为相关的问题:福特股票目前的价格是否过低?相反,他听从了自己的直觉;他喜欢这些车,喜欢这家公司,喜欢拥有其股票的想法。根据我们对选股准确性的了解,有理由相信他不知道自己在做什么。
The specific heuristics that Amos and I studied provide little help in understanding how the executive came to invest in Ford stock, but a broader conception of heuristics now exists, which offers a good account. An important advance is that emotion now looms much larger in our understanding of intuitive judgments and choices than it did in the past. The executive’s decision would today be described as an example of the affect heuristic , where judgments and decisions are guided directly by feelings of liking and disliking, with little deliberation or reasoning.
Amos和我研究的具体启发式方法对理解这位高管如何投资福特汽车的股票没有什么帮助,但现在存在一个更广泛的启发式概念,它提供了一个很好的解释。 一个重要的进步是,现在情感在我们对直觉判断和选择的理解中比过去大得多。今天,这位主管的决定将被描述为 情感启发式 的一个例子 ,在这种情况下,判断和决定直接由喜欢和不喜欢的感觉引导,几乎没有考虑或推理。
When confronted with a problem—choosing a chess move or deciding whether to invest in a stock—the machinery of intuitive thought does the best it can. If the individual has relevant expertise, she will recognize the situation, and the intuitive solution that comes to her mind is likely to be correct. This is what happens when a chess master looks at a complex position: the few moves that immediately occur to him are all strong. When the question is difficult and a skilled solution is not available, intuition still has a shot: an answer may come to mind quickly—but it is not an answer to the original question. The question that the executive faced (should I invest in Ford stock?) was difficult, but the answer to an easier and related question (do I like Ford cars?) came readily to his mind and determined his choice. This is the essence of intuitive heuristics: when faced with a difficult question, we often answer an easier one instead, usually without noticing the substitution .
当面临一个问题时--选择棋子或决定是否投资一只股票--直觉思维的机器会尽其所能。如果个人有相关的专业知识,她就会认识到这种情况,而出现在 她脑海中的直觉解决方案很可能是正确的。这就是国际象棋大师看一个复杂局面时的情况:他立即想到的几步棋都很强。当问题很困难,没有熟练的解决方案时,直觉仍然有机会:可能很快想到一个答案,但这并不是对最初问题的回答。这位高管面临的问题(我是否应该投资 ,购买福特汽车的股票?)很困难,但一个更容易的相关问题(我是否喜欢福特汽车?)的答案却很容易出现在他的脑海中,并决定了他的选择。 这就是直觉启发法的本质:当面临一个困难的问题时,我们往往会回答一个更容易的问题,而通常 没有注意到这种替换 。
The spontaneous search for an intuitive solution sometimes fails—neither an expert solution nor a heuristic answer comes to mind. In such cases we often find ourselves switching to a slower, more deliberate and effortful form of thinking. This is the slow thinking of the title. Fast thinking includes both variants of intuitive thought—the expert and the heuristic—as well as the entirely automatic mental activities of perception and memory, the operations that enable you to know there is a lamp on your desk or retrieve the name of the capital of Russia.
自发地寻找直观的解决方案有时会失败--既没有想到 专家的解决方案,也没有想到启发式的答案。在这种情况下,我们常常发现自己转而采用一种更缓慢、更慎重和更努力的思考方式。这就是标题中的缓慢思考。快速思维包括直觉思维的两种变体--专家式和启发式--以及感知和记忆的完全自动的心理活动,这些操作使你能够 ,知道你的桌子上有一盏灯或检索出俄罗斯首都的名称。
The distinction between fast and slow thinking has been explored by many psychologists over the last twenty-five years. For reasons that I explain more fully in the next chapter, I describe mental life by the metaphor of two agents, called System 1 and System 2, which respectively produce fast and slow thinking. I speak of the features of intuitive and deliberate thought as if they were traits and dispositions of two characters in your mind. In the picture that emerges from recent research, the intuitive System 1 is more influential than your experience tells you, and it is the secret author of many of the choices and judgments you make. Most of this book is about the workings of System 1 and the mutual influences between it and System 2.
在过去的25年中,许多心理学家一直在探讨快速和缓慢思维的区别。 由于我在下一章中更充分地解释的原因,我通过两个代理人的比喻来描述心理生活,称为系统1和系统2,它们分别产生快速和缓慢思维。 我把直觉和慎思的特点说成是你头脑中两个角色的特征和倾向。 在最近的研究中出现的画面中,直觉系统1比你的经验告诉你的更有影响力,它是你做出的许多选择和判断的秘密作者。这本书的大部分内容是关于系统1的运作以及它和系统2之间的相互 影响。
WHAT COMES NEXT
接下来的事情
The book is divided into five parts. Part 1 presents the basic elements of a two-systems approach to judgment and choice. It elaborates the distinction between the automatic operations of System 1 and the controlled operations of System 2, and shows how associative memory, the core of System 1, continually constructs a coherent interpretation of what is going on in our world at any instant. I attempt to give a sense of the complexity and richness of the automatic and often unconscious processes that underlie intuitive thinking, and of how these automatic processes explain the heuristics of judgment. A goal is to introduce a language for thinking and talking about the mind.
该书分为五个部分。 第一部分 介绍了判断和选择的双系统方法的基本要素。 它阐述了系统1的自动操作和系统2的受控操作之间的区别,并表明联想记忆,即系统1的核心,是如何不断地构建一个连贯的解释 ,说明我们的世界在任何瞬间发生了什么。我试图让人们了解作为直觉思维基础的自动和经常无意识的过程的复杂性和丰富性,以及这些自动过程如何解释判断的启发式方法。 一个目标是引入一种思考和谈论心灵的语言。
Part 2 updates the study of judgment heuristics and explores a major puzzle: Why is it so difficult for us to think statistically? We easily think associatively, we think metaphorically, we think causally, but statistics requires thinking about many things at once, which is something that System 1 is not designed to do.
第二部分 更新了对判断启发法的研究,并探讨了 一个主要的难题:为什么我们很难进行统计学思考?我们很容易进行联想式思考,我们进行隐喻式思考,我们进行因果式思考,但统计学需要同时思考许多事情,这是系统1的设计所不能做到的。
The difficulties of statistical thinking contribute to the main theme of Part 3 , which describes a puzzling limitation of our mind: our excessive confidence in what we believe we know, and our apparent inability to acknowledge the full extent of our ignorance and the uncertainty of the world we live in. We are prone to overestimate how much we understand about the world and to underestimate the role of chance in events. Overconfidence is fed by the illusory certainty of hindsight. My views on this topic have been influenced by Nassim Taleb, the author of The Black Swan . I hope for watercooler conversations that intelligently explore the lessons that can be learned from the past while resisting the lure of hindsight and the illusion of certainty.
统计思维的困难有助于 第三部分 的主题,它描述了我们思想的一个令人困惑的局限性:我们 ,对我们认为自己知道的东西过度自信,而我们显然无法承认我们的无知和我们生活的世界的不确定性的全部程度。 我们很容易高估我们对世界的理解程度,低估事件中偶然性的作用。过度自信是由事后的虚幻的确定性滋生的。我对这个问题的看法受到了 《 黑天鹅》 的作者纳西姆-塔勒布的影响。我希望在茶水间的对话中,能够明智地探讨可以从过去吸取的教训,同时抵制事后诸葛亮的诱惑和确定性的幻觉。
The focus of part 4 is a conversation with the discipline of economics on the nature of decision making and on the assumption that economic agents are rational. This section of the book provides a current view, informed by the two-system model, of the key concepts of prospect theory, the model of choice that Amos and I published in 1979. Subsequent chapters address several ways human choices deviate from the rules of rationality. I deal with the unfortunate tendency to treat problems in isolation, and with framing effects, where decisions are shaped by inconsequential features of choice problems. These observations, which are readily explained by the features of System 1, present a deep challenge to the rationality assumption favored in standard economics.
第四部分 的重点是与经济学学科就决策的性质和经济主体是理性的假设进行对话。本书的这一部分 ,在双系统模型的启发下,对前景理论的关键概念提出了当前的观点,即阿莫斯和我在1979年发表的选择模型。随后的章节讨论了人类选择偏离理性规则的几种方式。 我处理了孤立对待问题的不幸倾向,以及框架效应,即决定是由选择问题的无关紧要的 。这些观察很容易被系统1的特征所解释,对标准经济学所青睐的理性假设提出了深刻挑战。
Part 5 describes recent research that has introduced a distinction between two selves, the experiencing self and the remembering self, which do not have the same interests. For example, we can expose people to two painful experiences. One of these experiences is strictly worse than the other, because it is longer. But the automatic formation of memories—a feature of System 1—has its rules, which we can exploit so that the worse episode leaves a better memory. When people later choose which episode to repeat, they are, naturally, guided by their remembering self and expose themselves (their experiencing self) to unnecessary pain. The distinction between two selves is applied to the measurement of well-being, where we find again that what makes the experiencing self happy is not quite the same as what satisfies the remembering self. How two selves within a single body can pursue happiness raises some difficult questions, both for individuals and for societies that view the well-being of the population as a policy objective.
第五部分 介绍了最近的研究,引入了两个自我之间的区别,即体验中的自我和记忆中的自我,它们的利益并不相同。例如,我们可以让人们 两种痛苦的体验。其中一种体验严格来说比另一种更糟糕,因为它的时间更长。但记忆的自动形成--系统1的特征--有其规则,我们可以利用这些规则,使更坏的情节留下更好的记忆。 当人们后来选择重复哪个情节时,他们自然会被记忆中的自己引导,使自己(他们的体验 )暴露在不必要的痛苦中。两个自我之间的区别被应用于福祉的衡量,我们再次发现,使体验自我快乐的东西与满足记忆自我的东西不尽相同。 一个身体内的两个自我如何能够追求幸福,这对个人和将 人口的福祉视为政策目标的社会都提出了一些难题。
A concluding chapter explores, in reverse order, the implications of three distinctions drawn in the book: between the experiencing and the remembering selves, between the conception of agents in classical economics and in behavioral economics (which borrows from psychology), and between the automatic System 1 and the effortful System 2. I return to the virtues of educating gossip and to what organizations might do to improve the quality of judgments and decisions that are made on their behalf.
最后一章以相反的顺序探讨了书中得出的三个区别的含义:体验中的自我和记忆中的自我之间,古典经济学和行为经济学(借用心理学)中的代理人概念之间,以及自动系统1和努力系统2之间。我回到了教育八卦的 美德 ,以及组织可以做什么来提高代表他们作出的判断和决定的质量。
Two articles I wrote with Amos are reproduced as appendixes to the book. The first is the review of judgment under uncertainty that I described earlier. The second, published in 1984, summarizes prospect theory as well as our studies of framing effects. The articles present the contributions that were cited by the Nobel committee—and you may be surprised by how simple they are. Reading them will give you a sense of how much we knew a long time ago, and also of how much we have learned in recent decades.
我和阿莫斯写的两篇文章作为书中的附录转载。首先是我前面描述的不确定情况下的判断审查。第二篇发表于1984年,总结了前景理论以及我们对框架效应的研究。这些文章 ,介绍了被诺贝尔委员会引用的贡献--你可能会对它们的简单程度感到惊讶。阅读它们会让你感觉到我们在很久以前就知道了多少,也会感觉到我们在最近几十年里学到了多少。
1
1
The Characters of the Story
故事中的人物
To observe your mind in automatic mode, glance at the image below.
要观察你的头脑处于自动模式,请看下面的图片。

Figure 1
图一
Your experience as you look at the woman’s face seamlessly combines what we normally call seeing and intuitive thinking. As surely and quickly as you saw that the young woman’s hair is dark, you knew she is angry. Furthermore, what you saw extended into the future. You sensed that this woman is about to say some very unkind words, probably in a loud and strident voice. A premonition of what she was going to do next came to mind automatically and effortlessly. You did not intend to assess her mood or to anticipate what she might do, and your reaction to the picture did not have the feel of something you did. It just happened to you. It was an instance of fast thinking.
当你看着这个女人的脸时,你的经验无缝地结合了我们通常所说的观察和直觉思维。当你看到这个年轻女人的头发是黑色的,你就知道她在生气。此外,你看到的东西延伸到了未来。 你感觉到这个女人即将说一些非常不友好的话,可能是用一种响亮而尖锐的声音。一个预感 ,她接下来要做什么,自动而毫不费力地浮现在脑海中。你并没有打算评估她的情绪或预测她可能做什么,你对图片的反应也没有 ,感觉是你做的事情。它只是发生在你身上。这是一个快速思考的例子。
Now look at the following problem:
现在看一下下面的问题。
17 × 24
17 × 24
You knew immediately that this is a multiplication problem, and probably knew that you could solve it, with paper and pencil, if not without. You also had some vague intuitive knowledge of the range of possible results. You would be quick to recognize that both 12,609 and 123 are implausible. Without spending some time on the problem, however, you would not be certain that the answer is not 568. A precise solution did not come to mind, and you felt that you could choose whether or not to engage in the computation. If you have not done so yet, you should attempt the multiplication problem now, completing at least part of it.
你马上就知道这是一个乘法问题, ,并且可能知道你可以用纸和笔解决它,如果没有的话。你也对可能的结果范围有一些模糊的直觉知识。你会很快认识到12,609和123都是不可能的。 然而,如果不花一些时间在这个问题上,你就不能确定答案不是568。一个精确的解决方案并没有出现在你的脑海中,你觉得 ,你可以选择是否参与计算。如果你还没有这样做,你现在应该尝试一下乘法问题,至少完成它的一部分。
You experienced slow thinking as you proceeded through a sequence of steps. You first retrieved from memory the cognitive program for multiplication that you learned in school, then you implemented it. Carrying out the computation was a strain. You felt the burden of holding much material in memory, as you needed to keep track of where you were and of where you were going, while holding on to the intermediate result. The process was mental work: deliberate, effortful, and orderly—a prototype of slow thinking. The computation was not only an event in your mind; your body was also involved. Your muscles tensed up, your blood pressure rose, and your heart rate increased. Someone looking closely at your eyes while you tackled this problem would have seen your pupils dilate. Your pupils contracted back to normal size as soon as you ended your work—when you found the answer (which is 408, by the way) or when you gave up.
当你通过一连串的步骤进行时,你经历了缓慢的思考。你首先从记忆中找回你在学校学到的乘法认知程序,然后执行它。进行计算 是一种压力。你感到在记忆中持有许多材料的负担,因为你需要跟踪你在哪里,你要去哪里,同时保持中间结果。 这个过程是脑力劳动:故意的,努力的,有序的--慢速思维的原型。计算不仅是你头脑中的事件,你的身体也参与其中。你的肌肉绷紧,你的 血压上升,你的心率加快。在你处理这个问题时,有人仔细观察你的眼睛,会看到你的瞳孔放大了。一旦你结束工作,你的瞳孔就会收缩到正常大小--当你找到答案时(顺便说一下,这是408个)或当你放弃时。
TWO SYSTEMS
两套系统
Psychologists have been intensely interested for several decades in the two modes of thinking evoked by the picture of the angry woman and by the multiplication problem, and have offered many labels for them. I adopt terms originally proposed by the psychologists Keith Stanovich and Richard West, and will refer to two systems in the mind, System 1 and System 2.
几十年来,心理学家对愤怒的女人的图片和乘法问题所唤起的两种思维模式 ,并为它们 提供了许多标签 。我采用了最初由心理学家Keith Stanovich和Richard West提出的术语,并将提及头脑中的两个系统:系统1和系统2。
System 1 operates automatically and quickly, with little or no effort and no sense of voluntary control.
系统1 自动而快速地运行,几乎不需要努力,也没有自愿控制的感觉。
System 2 allocates attention to the effortful mental activities that demand it, including complex computations. The operations of System 2 are often associated with the subjective experience of agency , choice, and concentration.
系统2 ,将注意力分配给需要它的努力的心理活动,包括复杂的计算。系统2的操作往往与 代理 、选择和集中的 主观体验 有关。
The labels of System 1 and System 2 are widely used in psychology, but I go further than most in this book, which you can read as a psychodrama with two characters.
系统1和系统2的标签在心理学中被广泛使用,但我在这本书中比大多数人走得更远,你可以把它当作一个有两个角色的心理剧来读。
When we think of ourselves, we identify with System 2, the conscious, reasoning self that has beliefs, makes choices, and decides what to think about and what to do. Although System 2 believes itself to be where the action is, the automatic System 1 is the hero of the book. I describe System 1 as effortlessly originating impressions and feelings that are the main sources of the explicit beliefs and deliberate choices of System 2. The automatic operations of System 1 generate surprisingly complex patterns of ideas, but only the slower System 2 can construct thoughts in an orderly series of steps. I also describe circumstances in which System 2 takes over, overruling the freewheeling impulses and associations of System 1. You will be invited to think of the two systems as agents with their individual abilities, limitations, and functions.
当我们把自己想成 ,我们就会认同系统2,即有意识的、推理的自我,它有信念,做出选择,并决定想什么和做什么。虽然系统2认为自己是行动的所在,但自动的系统1才是本书的主角。我把系统1描述为毫不费力地产生印象和感觉,这些印象和感觉是系统2的明确信念和有意的 。系统1的自动操作产生了令人惊讶的复杂的思想模式,但只有速度较慢的系统2能以一系列有序的步骤构建思想。我还描述了系统2接管的情况,推翻了系统1的自由冲动和联想。我们将邀请你把这两个系统看作是具有各自的 能力、限制和功能的代理人。
In rough order of complexity, here are some examples of the automatic activities that are attributed to System 1:
按照复杂程度的粗略顺序,以下是归属于系统1的一些自动活动的例子。
Detect that one object is more distant than another.
检测到一个物体比另一个物体更遥远。
Orient to the source of a sudden sound.
向突然出现的声音的源头看去。
Complete the phrase “bread and …”
完成 "面包和...... "这句话。
Make a “disgust face” when shown a horrible picture.
当看到一张可怕的图片时,做一个 "厌恶的表情"。
Detect hostility in a voice.
检测声音中的敌意。
Answer to 2 + 2 = ?
答案是2+2=?
Read words on large billboards.
阅读大型广告牌上的文字 。
Drive a car on an empty road.
在空旷的道路上驾驶汽车。
Find a strong move in chess (if you are a chess master).
在国际象棋中找到一个强有力的步骤(如果你是一个国际象棋大师)。
Understand simple sentences.
理解简单的句子。
Recognize that a “meek and tidy soul with a passion for detail” resembles an occupational stereotype.
认识到 "温顺整洁、热衷于细节的灵魂 "类似于一种职业刻板印象。
All these mental events belong with the angry woman—they occur automatically and require little or no effort. The capabilities of System 1 include innate skills that we share with other animals. We are born prepared to perceive the world around us, recognize objects, orient attention, avoid losses, and fear spiders. Other mental activities become fast and automatic through prolonged practice. System 1 has learned associations between ideas (the capital of France?); it has also learned skills such as reading and understanding nuances of social situations. Some skills, such as finding strong chess moves, are acquired only by specialized experts. Others are widely shared. Detecting the similarity of a personality sketch to an occupational stereotype requires broad knowledge of the language and the culture, which most of us possess. The knowledge is stored in memory and accessed without intention and without effort.
所有这些心理事件都属于愤怒的女人--它们自动发生,几乎不需要任何努力。系统1的能力包括我们与其他动物共享的先天技能 。我们生来就准备好感知我们周围的世界,识别物体,确定注意力的方向,避免损失, ,并害怕蜘蛛。其他心理活动通过长期的练习变得快速和自动。系统1已经学会了各种想法之间的关联(法国的首都?);它还学会了一些技能,如阅读和理解社会情况的细微差别。 有些技能,如寻找强大的国际象棋动作,只有专门的专家才能获得。其他的则被广泛分享。检测一个人格素描与一个职业刻板印象的相似性需要广泛的语言和文化知识,而我们大多数人都拥有这些知识。这些知识被储存在记忆中,无需刻意和努力就能获取。
Several of the mental actions in the list are completely involuntary. You cannot refrain from understanding simple sentences in your own language or from orienting to a loud unexpected sound, nor can you prevent yourself from knowing that 2 + 2 = 4 or from thinking of Paris when the capital of France is mentioned. Other activities, such as chewing, are susceptible to voluntary control but normally run on automatic pilot. The control of attention is shared by the two systems. Orienting to a loud sound is normally an involuntary operation of System 1, which immediately mobilizes the voluntary attention of System 2. You may be able to resist turning toward the source of a loud and offensive comment at a crowded party, but even if your head does not move, your attention is initially directed to it, at least for a while. However, attention can be moved away from an unwanted focus, primarily by focusing intently on another target.
列表中的几个心理行为 ,完全是非自愿的。你不能不理解自己语言中的简单句子,也不能不对一个突如其来的响声进行定向,你也不能阻止自己知道2+2=4,或在提到法国首都时想到巴黎。 其他活动,如咀嚼,容易受到自愿控制,但通常是自动运行的。注意力的控制 ,是由两个系统共享的。对一个响亮的声音的定向通常是系统1的非自愿操作,它立即调动系统2的自愿注意。在一个拥挤的聚会上,你可能能够抵制转向一个响亮和令人反感的评论的来源,但即使你的头没有移动,你的注意力最初是指向它,至少有一段时间。然而, ,注意力可以从不需要的焦点移开,主要是通过专注于另一个目标。
The highly diverse operations of System 2 have one feature in common: they require attention and are disrupted when attention is drawn away. Here are some examples:
系统2的高度多样化的操作有一个共同的特点:它们需要注意力,当注意力被吸引走时就会中断。这里有一些例子。
Brace for the starter gun in a race.
在比赛中为发令枪打起精神。
Focus attention on the clowns in the circus.
将注意力集中在马戏团的小丑身上。
Focus on the voice of a particular person in a crowded and noisy room.
在 一个拥挤和嘈杂的房间里,专注于某个人的声音。
Look for a woman with white hair.
寻找一个白头发的女人。
Search memory to identify a surprising sound.
搜索记忆,找出一个令人惊讶的声音。
Maintain a faster walking speed than is natural for you.
保持一个比你自然的更快的行走速度。
Monitor the appropriateness of your behavior in a social situation.
监测你在社交场合的行为是否恰当。
Count the occurrences of the letter a in a page of text.
数一数字母 a 在一页文本中的出现次数。
Tell someone your phone number.
告诉别人你的电话号码。
Park in a narrow space (for most people except garage attendants).
在狭窄的空间里停车(对大多数人来说,除了车库管理员)。
Compare two washing machines for overall value.
比较两台 洗衣机的整体价值。
Fill out a tax form.
填写税表。
Check the validity of a complex logical argument.
检查一个复杂的逻辑论证的有效性。
In all these situations you must pay attention, and you will perform less well, or not at all, if you are not ready or if your attention is directed inappropriately. System 2 has some ability to change the way System 1 works, by programming the normally automatic functions of attention and memory. When waiting for a relative at a busy train station, for example, you can set yourself at will to look for a white-haired woman or a bearded man, and thereby increase the likelihood of detecting your relative from a distance. You can set your memory to search for capital cities that start with N or for French existentialist novels. And when you rent a car at London’s Heathrow Airport, the attendant will probably remind you that “we drive on the left side of the road over here.” In all these cases, you are asked to do something that does not come naturally, and you will find that the consistent maintenance of a set requires continuous exertion of at least some effort.
在所有这些情况下,你必须注意,如果你没有准备好,或者你的注意力被不适当地引导,你就会表现得不那么好,或者根本没有表现。 系统2有一些能力改变系统1的工作方式,通过对注意力和 记忆的正常自动功能进行编程。例如,当你在繁忙的火车站等待亲戚时,你可以随意设置自己寻找白发女人或大胡子男人,从而增加从远处发现亲戚的可能性。 你可以设置你的记忆来搜索以 N 开头的首都或法国存在主义小说。 当你在伦敦希思罗机场租车时, ,服务员可能会提醒你,"我们在这里靠道路的左边行驶"。在所有这些情况下,你被要求做一些不自然的事情,而且你会发现,持续保持一套要求至少持续付出一些努力。
The often-used phrase “pay attention” is apt: you dispose of a limited budget of attention that you can allocate to activities, and if you try to go beyond your budget, you will fail. It is the mark of effortful activities that they interfere with each other, which is why it is difficult or impossible to conduct several at once. You could not compute the product of 17 × 24 while making a left turn into dense traffic, and you certainly should not try. You can do several things at once, but only if they are easy and undemanding. You are probably safe carrying on a conversation with a passenger while driving on an empty highway, and many parents have discovered, perhaps with some guilt, that they can read a story to a child while thinking of something else.
经常使用的短语 "注意 "很贴切:你支配的注意力预算有限,你可以将其分配到 ,如果你试图超出预算,你就会失败。努力的活动的特点是它们相互干扰,这就是为什么很难或不可能同时进行几个活动。你不可能在左转弯进入密集车流时计算17×24的乘积,你当然也不应该尝试。你可以同时做几件事情,但前提是它们 ,而且要求不高。你在空旷的高速公路上开车时与乘客进行交谈可能是安全的,许多父母发现,也许他们有些内疚,他们可以一边给孩子读故事一边想别的事情。
Everyone has some awareness of the limited capacity of attention, and our social behavior makes allowances for these limitations. When the driver of a car is overtaking a truck on a narrow road, for example, adult passengers quite sensibly stop talking. They know that distracting the driver is not a good idea, and they also suspect that he is temporarily deaf and will not hear what they say.
每个人都对注意力的有限能力有一定的认识,我们的社会行为也为这些限制做出了安排。例如,当 ,汽车司机在狭窄的道路上超越卡车时,成年乘客很理智地停止交谈。他们知道让司机分心不是个好主意,而且他们还怀疑司机暂时失聪,不会听到他们说的话。
Intense focusing on a task can make people effectively blind, even to stimuli that normally attract attention. The most dramatic demonstration was offered by Christopher Chabris and Daniel Simons in their book The Invisible Gorilla . They constructed a short film of two teams passing basketballs, one team wearing white shirts, the other wearing black. The viewers of the film are instructed to count the number of passes made by the white team, ignoring the black players. This task is difficult and completely absorbing. Halfway through the video, a woman wearing a gorilla suit appears, crosses the court, thumps her chest, and moves on. The gorilla is in view for 9 seconds. Many thousands of people have seen the video, and about half of them do not notice anything unusual. It is the counting task—and especially the instruction to ignore one of the teams—that causes the blindness. No one who watches the video without that task would miss the gorilla. Seeing and orienting are automatic functions of System 1, but they depend on the allocation of some attention to the relevant stimulus. The authors note that the most remarkable observation of their study is that people find its results very surprising. Indeed, the viewers who fail to see the gorilla are initially sure that it was not there—they cannot imagine missing such a striking event. The gorilla study illustrates two important facts about our minds: we can be blind to the obvious, and we are also blind to our blindness.
强烈地专注于一项任务可以使人有效地失明,甚至对通常吸引注意力的刺激物也是如此。最引人注目的证明 ,是由克里斯托弗-查布里斯和丹尼尔-西蒙斯在他们的书 《看不见的大猩猩 》中提供的。他们构建了一个两队传递篮球的短片,一队穿白色衬衫,另一队穿黑色衬衫。影片的观众被要求计算白队的传球次数,而忽略了黑人球员。这项任务是困难的,而且完全是吸收性的。在 视频的一半时,一个穿着大猩猩衣服的女人出现了,穿过球场,捶打着胸口,然后继续前进。大猩猩出现在视野中的时间是 9秒。数以千计的人看过这段视频,其中约有一半人没有发现任何异常。正是计数任务--尤其是忽略其中一个团队的指令--导致了盲目性。没有那个任务 ,看视频的人就不会错过大猩猩。观察和定向是系统1的自动功能,但它们取决于对相关刺激物的一些注意力的分配。作者指出,他们的研究中最显著的观察是人们发现其结果非常令人惊讶。事实上,未能看到大猩猩的观众最初肯定它不在那里--他们无法想象错过这样的 ,一个引人注目的事件。大猩猩的研究说明了关于我们思想的两个重要事实:我们可以对明显的东西视而不见,而且我们也对自己的盲目性视而不见。
PLOT SYNOPSIS
剧情简介
The interaction of the two systems is a recurrent theme of the book, and a brief synopsis of the plot is in order. In the story I will tell, Systems 1 and 2 are both active whenever we are awake. System 1 runs automatically and System 2 is normally in a comfortable low-effort mode, in which only a fraction of its capacity is engaged. System 1 continuously generates suggestions for System 2: impressions, intuitions, intentions, and feelings. If endorsed by System 2, impressions and intuitions turn into beliefs, and impulses turn into voluntary actions. When all goes smoothly, which is most of the time, System 2 adopts the suggestions of System 1 with little or no modification. You generally believe your impressions and act on your desires, and that is fine—usually.
两个系统的互动是该书的一个反复出现的主题,简要介绍一下情节是有必要的。在我要讲的故事中,只要我们醒着,系统1和2都是活跃的。系统1自动运行 ,而系统2通常处于舒适的低能耗模式,其中只有一小部分能力被使用。系统1不断为系统2产生建议:印象、直觉、意图和感觉。如果得到系统2的认可,印象和直觉就会变成信念,冲动就会变成自愿的行动。当一切顺利时,也就是大多数时候,系统2会采纳系统1的建议, ,几乎没有任何修改。你通常会相信你的印象,按照你的愿望行动,这很好,通常是这样。
When System 1 runs into difficulty, it calls on System 2 to support more detailed and specific processing that may solve the problem of the moment. System 2 is mobilized when a question arises for which System 1 does not offer an answer, as probably happened to you when you encountered the multiplication problem 17 × 24. You can also feel a surge of conscious attention whenever you are surprised. System 2 is activated when an event is detected that violates the model of the world that System 1 maintains. In that world, lamps do not jump, cats do not bark, and gorillas do not cross basketball courts. The gorilla experiment demonstrates that some attention is needed for the surprising stimulus to be detected. Surprise then activates and orients your attention: you will stare, and you will search your memory for a story that makes sense of the surprising event. System 2 is also credited with the continuous monitoring of your own behavior—the control that keeps you polite when you are angry, and alert when you are driving at night. System 2 is mobilized to increased effort when it detects an error about to be made. Remember a time when you almost blurted out an offensive remark and note how hard you worked to restore control. In summary, most of what you (your System 2) think and do originates in your System 1, but System 2 takes over when things get difficult, and it normally has the last word.
当系统1遇到困难时,它会调用系统2来支持更详细、更具体的处理,以解决当前的问题。当出现系统1无法提供答案的问题时,系统2就会被调动起来,就像你遇到17×24的乘法问题时可能 。每当你感到惊讶时,你也能感觉到意识注意力的激增。 当检测到一个违反系统1所维护的世界模型的事件时,系统2被激活。 在那个世界里,灯不会跳,猫不会叫,大猩猩不会穿过篮球场。大猩猩的实验表明, ,需要一定的注意力才能发现令人惊讶的刺激。然后,惊讶会激活并引导你的注意力:你会盯着看,你会在记忆中搜索一个能让人惊讶的事件有意义的故事。系统2也得益于对你自己行为的持续监控--当你生气时保持礼貌,当你在夜间驾驶时保持警惕的控制。 系统2在检测到即将发生的错误时,会被调动起来,加大努力。记住一次你差点说出冒犯性的话,注意你是如何努力恢复控制的。总之,你(你的系统2)的大部分想法和行为都源于你的系统1,但当事情变得困难时,系统2会接管,而且它通常有最后的决定权。
The division of labor between System 1 and System 2 is highly efficient: it minimizes effort and optimizes performance. The arrangement works well most of the time because System 1 is generally very good at what it does: its models of familiar situations are accurate, its short-term predictions are usually accurate as well, and its initial reactions to challenges are swift and generally appropriate. System 1 has biases, however, systematic errors that it is prone to make in specified circumstances. As we shall see, it sometimes answers easier questions than the one it was asked, and it has little understanding of logic and statistics. One further limitation of System 1 is that it cannot be turned off. If you are shown a word on the screen in a language you know, you will read it—unless your attention is totally focused elsewhere .
系统1和系统2之间的分工是非常有效的:它最大限度地减少了工作,优化了性能。这种安排在大多数情况下都很有效,因为系统1通常非常擅长它的工作:它对熟悉情况的模型是准确的,它的短期预测通常也是准确的,它对挑战的最初反应是迅速的,而且通常是适当的。然而,系统1有偏见, ,它在特定情况下容易犯系统性错误。正如我们将看到的,它有时回答的问题比它被要求的问题更容易,而且它对逻辑和统计的理解很少。系统1的另一个限制是,它不能被关闭。如果你在屏幕上看到一个你知道的语言的单词,你就会读它--除非你的 注意力完全集中在其他地方 。
CONFLICT
冲突
Figure 2 is a variant of a classic experiment that produces a conflict between the two systems . You should try the exercise before reading on.
图2 是一个经典实验的变体,它产生了 两个系统之间的冲突 。 在继续阅读之前,你应该尝试一下这个练习。
Figure 2
图2
You were almost certainly successful in saying the correct words in both tasks, and you surely discovered that some parts of each task were much easier than others. When you identified upper- and lowercase, the left-hand column was easy and the right-hand column caused you to slow down and perhaps to stammer or stumble. When you named the position of words, the left-hand column was difficult and the right-hand column was much easier.
在这两项任务中,你几乎肯定能成功地说出正确的单词,而且你肯定发现每项任务的某些部分要比其他部分容易得多。 当你识别大写和小写时,左边一栏很容易,右边一栏则使你放慢速度,也许还会结巴或结巴。当你说出单词的位置时,左边一栏是 ,右边一栏则容易得多。
These tasks engage System 2, because saying “upper/lower” or “right/ left” is not what you routinely do when looking down a column of words. One of the things you did to set yourself for the task was to program your memory so that the relevant words ( upper and lower for the first task) were “on the tip of your tongue.” The prioritizing of the chosen words is effective and the mild temptation to read other words was fairly easy to resist when you went through the first column. But the second column was different, because it contained words for which you were set, and you could not ignore them. You were mostly able to respond correctly, but overcoming the competing response was a strain, and it slowed you down. You experienced a conflict between a task that you intended to carry out and an automatic response that interfered with it.
这些任务涉及到系统2,因为说 "上/下 "或 "右/左 "并不是你在查看一列单词时的常规做法。你为自己设定任务所做的事情之一是对你的记忆进行编程,使相关的单词 ( 第一项任务的 大写 和 小写 )"在你的舌尖上"。 所选词语的优先次序是有效的,当你通过第一栏时,阅读其他词语的轻微诱惑是相当容易抵制的。但第二栏不同,因为它包含了你被设定的单词,你不能忽视它们。 你大多能做出正确的反应,但克服竞争性反应是一种压力,它使你的速度减慢。你经历了 ,在你打算执行的任务和干扰它的自动反应之间发生了冲突。
Conflict between an automatic reaction and an intention to control it is common in our lives. We are all familiar with the experience of trying not to stare at the oddly dressed couple at the neighboring table in a restaurant. We also know what it is like to force our attention on a boring book, when we constantly find ourselves returning to the point at which the reading lost its meaning. Where winters are hard, many drivers have memories of their car skidding out of control on the ice and of the struggle to follow well-rehearsed instructions that negate what they would naturally do: “Steer into the skid, and whatever you do, do not touch the brakes!” And every human being has had the experience of not telling someone to go to hell. One of the tasks of System 2 is to overcome the impulses of System 1. In other words, System 2 is in charge of self-control.
自动反应和控制反应的意图之间的冲突在我们的生活中很常见。 我们都熟悉这样的经历:在餐馆里,我们试图不盯着邻桌那对穿着古怪的夫妇。我们也知道强迫自己的注意力放在一本无聊的书上是什么感觉,这时 ,我们不断发现自己回到了阅读失去意义的地方。在冬天艰苦的地方,许多司机都有这样的记忆:他们的车在冰面上失控打滑,他们挣扎着听从精心排练的指令,这些指令否定了他们自然会做的事情:"转向打滑,不管做什么,不要碰刹车!"而每个人都有过这样的经历: , 没有 告诉别人去地狱。系统2的任务之一是克服系统1的冲动。 换言之,系统2负责自我控制。
ILLUSIONS
幻觉
To appreciate the autonomy of System 1, as well as the distinction between impressions and beliefs, take a good look at figure 3 .
为了理解系统1的自主性,以及印象和信念之间的区别,好好看看 图3 。
This picture is unremarkable: two horizontal lines of different lengths, with fins appended, pointing in different directions. The bottom line is obviously longer than the one above it. That is what we all see, and we naturally believe what we see. If you have already encountered this image, however, you recognize it as the famous Müller-Lyer illusion. As you can easily confirm by measuring them with a ruler, the horizontal lines are in fact identical in length.
这张图片并不引人注目:两条不同长度的横线,附加有鳍状物, ,指向不同的方向。 底部的线显然比上面的线长。这就是我们大家所看到的,我们 ,自然相信我们所看到的。如果你已经遇到过这幅图,你就会认出这是著名的穆勒-莱尔错觉。 你可以通过用尺子测量来轻松确认,这些水平线的长度实际上是相同的。

Figure 3
图3
Now that you have measured the lines, you—your System 2, the conscious being you call “I”—have a new belief: you know that the lines are equally long. If asked about their length, you will say what you know. But you still see the bottom line as longer. You have chosen to believe the measurement, but you cannot prevent System 1 from doing its thing; you cannot decide to see the lines as equal, although you know they are. To resist the illusion, there is only one thing you can do: you must learn to mistrust your impressions of the length of lines when fins are attached to them. To implement that rule, you must be able to recognize the illusory pattern and recall what you know about it. If you can do this, you will never again be fooled by the Müller-Lyer illusion. But you will still see one line as longer than the other.
现在你已经测量了这些线,你--你的系统2,你称之为 "我 "的意识存在--有了一个新的信念:你 知道 这些线是一样长的。如果被问及他们的长度,你会说你所知道的。但你仍然 看到 底线更长。你选择了相信测量结果,但你不能阻止系统1做它的事情;你不能决定把线 ,尽管你知道它们是相等的。要抵制这种幻觉,你只能做一件事:你必须学会不信任你对线的长度的印象,当鳍连接到它们时。为了实施这一规则,你必须能够识别出虚幻的模式,并回忆起你对它的了解。 如果你能做到这一点,你就不会再被穆勒-莱尔的幻觉所迷惑。但你仍会 ,认为一条线比另一条长。
Not all illusions are visual. There are illusions of thought, which we call cognitive illusions . As a graduate student, I attended some courses on the art and science of psychotherapy. During one of these lectures, our teacher imparted a morsel of clinical wisdom. This is what he told us: “You will from time to time meet a patient who shares a disturbing tale of multiple mistakes in his previous treatment. He has been seen by several clinicians, and all failed him. The patient can lucidly describe how his therapists misunderstood him, but he has quickly perceived that you are different. You share the same feeling, are convinced that you understand him, and will be able to help.” At this point my teacher raised his voice as he said, “Do not even think of taking on this patient! Throw him out of the office! He is most likely a psychopath and you will not be able to help him.”
并非所有的幻觉都是视觉的。作为一名研究生,我参加了一些关于心理治疗的艺术和科学的课程。在其中一次讲座中,我们的老师传授了一丝临床智慧。这是他告诉我们的:"你会不时地遇到一个病人,他分享了一个令人不安的 ,说他以前的治疗有多个错误。他已经被几个临床医生看过,都让他失望了。病人可以清晰地描述他的治疗师是如何误解他的,但他已经很快察觉到你是不同的。 你有同样的感觉,确信你理解他,并将 ,能够帮助他。"这时,我的老师提高了嗓门,他说:"甚至 ,都不要 想 去接这个病人。把他赶出办公室!他很可能是一个心理变态者,你将无法帮助他。"
Many years later I learned that the teacher had warned us against psychopathic charm , and the leading authority in the study of psychopathy confirmed that the teacher’s advice was sound. The analogy to the Müller Lyer illusion is close. What we were being taught was not how to feel about that patient. Our teacher took it for granted that the sympathy we would feel for the patient would not be under our control; it would arise from System 1. Furthermore, we were not being taught to be generally suspicious of our feelings about patients. We were told that a strong attraction to a patient with a repeated history of failed treatment is a danger sign—like the fins on the parallel lines. It is an illusion—a cognitive illusion—and I (System 2) was taught how to recognize it and advised not to believe it or act on it.
许多年后,我了解到,这位老师曾警告我们不要有 精神病态的魅力 ,研究精神病态的权威人士证实,这位老师的建议是正确的。 与穆勒-莱尔幻觉的类比很接近。我们被教导的是 ,而不是如何感受那个病人。我们的老师想当然地认为,我们对病人的同情心不受我们的控制;它来自系统1。此外,我们没有被教导要普遍怀疑我们对病人的感觉。 我们被告知,对一个有反复治疗失败历史的病人的强烈吸引力是一个危险的信号,就像平行线上的鳍 。这是一种幻觉--一种认知上的幻觉--我(系统2)被教导如何识别它,并被建议不要相信它或按它行事。
The question that is most often asked about cognitive illusions is whether they can be overcome. The message of these examples is not encouraging. Because System 1 operates automatically and cannot be turned off at will, errors of intuitive thought are often difficult to prevent. Biases cannot always be avoided, because System 2 may have no clue to the error. Even when cues to likely errors are available, errors can be prevented only by the enhanced monitoring and effortful activity of System 2. As a way to live your life, however, continuous vigilance is not necessarily good, and it is certainly impractical. Constantly questioning our own thinking would be impossibly tedious, and System 2 is much too slow and inefficient to serve as a substitute for System 1 in making routine decisions. The best we can do is a compromise: learn to recognize situations in which mistakes are likely and try harder to avoid significant mistakes when the stakes are high. The premise of this book is that it is easier to recognize other people’s mistakes than our own.
关于认知错觉,人们最常问的问题是它们能否被克服。 这些例子所传递的信息并不令人鼓舞。 由于系统1是自动运行的,不能随意关闭,直觉 思维的错误往往难以防止。偏差不可能总是被避免,因为系统2可能没有错误的线索。 即使有可能出现错误的线索,也只能通过加强系统2的监测和努力活动来防止错误。 然而,作为一种生活方式,持续的警惕不一定是好事,而且肯定是不切实际的。不断质疑我们的 自己的思维将是不可能的乏味,而且系统2太慢,效率太低,无法替代系统1来做常规决策。我们能做的最好的事情是一个妥协:学会识别可能出现错误的情况,在风险大的时候更努力地避免重大错误。这本书的前提是,认识别人的错误比我们自己的错误更容易 。
USEFUL FICTIONS
有用的虚构
You have been invited to think of the two systems as agents within the mind, with their individual personalities, abilities, and limitations. I will often use sentences in which the systems are the subjects, such as, “System 2 calculates products.”
我经常使用的句子中,系统是主体,如 "系统2计算产品"。
The use of such language is considered a sin in the professional circles in which I travel, because it seems to explain the thoughts and actions of a person by the thoughts and actions of little people inside the person’s head. Grammatically the sentence about System 2 is similar to “The butler steals the petty cash.” My colleagues would point out that the butler’s action actually explains the disappearance of the cash, and they rightly question whether the sentence about System 2 explains how products are calculated. My answer is that the brief active sentence that attributes calculation to System 2 is intended as a description, not an explanation. It is meaningful only because of what you already know about System 2. It is shorthand for the following: “Mental arithmetic is a voluntary activity that requires effort, should not be performed while making a left turn, and is associated with dilated pupils and an accelerated heart rate.”
在我所处的专业圈子里,使用这种语言被认为是一种罪过,因为它似乎是在用一个人脑子里的 小人物 的想法和行动来解释 。在语法上,关于系统2的句子类似于 "管家偷了零用钱"。我的同事们会指出,管家的行为实际上解释了现金的消失,他们正确地质疑关于系统2的句子是否解释了产品是如何计算的。 我的回答是,将计算归于系统2的简短主动句是为了描述,而不是解释。它之所以有意义,只是因为你对系统2已经有所了解。它是以下内容的速记:"心算是一项需要努力的自愿活动,不应该在左转时进行,并且与扩大的瞳孔 和加速的心率有关。"
Similarly, the statement that “highway driving under routine conditions is left to System 1” means that steering the car around a bend is automatic and almost effortless. It also implies that an experienced driver can drive on an empty highway while conducting a conversation. Finally, “System 2 prevented James from reacting foolishly to the insult” means that James would have been more aggressive in his response if his capacity for effortful control had been disrupted (for example, if he had been drunk).
同样,"常规条件下的高速公路驾驶由系统1负责 "的说法意味着,在弯道上转向汽车是自动的,几乎毫不费力。这也意味着一个有经验的司机可以在空旷的高速公路上一边开车一边进行交谈。 最后,"系统2阻止了詹姆斯对侮辱做出愚蠢的反应 "意味着如果詹姆斯的努力控制能力被破坏(例如,如果他喝醉了),他的反应会更积极。 。
System 1 and System 2 are so central to the story I tell in this book that I must make it absolutely clear that they are fictitious characters. Systems 1 and 2 are not systems in the standard sense of entities with interacting aspects or parts. And there is no one part of the brain that either of the systems would call home. You may well ask: What is the point of introducing fictitious characters with ugly names into a serious book? The answer is that the characters are useful because of some quirks of our minds, yours and mine. A sentence is understood more easily if it describes what an agent (System 2) does than if it describes what something is, what properties it has. In other words, “System 2” is a better subject for a sentence than “mental arithmetic.” The mind—especially System 1—appears to have a special aptitude for the construction and interpretation of stories about active agents, who have personalities, habits, and abilities. You quickly formed a bad opinion of the thieving butler, you expect more bad behavior from him, and you will remember him for a while. This is also my hope for the language of systems.
系统1和系统2对我在本书中讲述的故事非常重要,我必须绝对清楚地说明,他们是虚构的人物。系统1和2不是标准意义上的系统,即具有相互作用的方面或部分的实体。而且, ,大脑中没有一个部位会被这两个系统称为家。 你可能会问:在一本严肃的书中引入名字丑陋的虚构人物有什么意义呢?答案是,这些人物之所以有用,是因为我们头脑中的一些怪癖,你的和我的。如果一个句子描述一个代理人(系统2)做什么,比描述什么东西 ,它有什么属性,更容易理解。换句话说,"系统2 "是一个比 "心算 "更好的句子主题。心灵--尤其是系统1--似乎对构建和解释关于主动代理人的故事有特殊的才能,这些人有个性、习惯和能力。 你很快对这个小偷管家形成了不好的看法,你期待他有更多的坏行为, ,你会记住他一段时间。这也是我对系统语言的希望。
Why call them System 1 and System 2 rather than the more descriptive “automatic system” and “effortful system”? The reason is simple: “Automatic system” takes longer to say than “System 1” and therefore takes more space in your working memory . This matters, because anything that occupies your working memory reduces your ability to think. You should treat “System 1” and “System 2” as nicknames, like Bob and Joe, identifying characters that you will get to know over the course of this book. The fictitious systems make it easier for me to think about judgment and choice, and will make it easier for you to understand what I say.
为什么称它们为系统1和系统2,而不是更具描述性的 "自动系统 "和 "努力系统"?原因很简单:"自动系统 "比 "系统1 "需要更长的时间,因此 在你的工作记忆中 需要更多的 空间 。这很重要, ,因为任何占据你工作记忆的东西都会降低 你的思考能力。你应该把 "系统1 "和 "系统2 "当作昵称,就像鲍勃和乔一样,识别你将在本书过程中认识的人物。虚构的系统使我更容易思考判断和选择,也会使你更容易理解我说的话。
SPEAKING OF SYSTEM 1 AND SYSTEM 2
说到系统1和系统2
“He had an impression, but some of his impressions are illusions.”
"他有一个印象,但他的一些印象 是幻觉。"
“This was a pure System 1 response. She reacted to the threat before she recognized it.”
"这是一个纯粹的系统1反应。 她在认识到威胁之前就对其做出了反应。"
“This is your System 1 talking. Slow down and let your System 2 take control.”
"这是你的系统1在说话。放慢速度,让你的系统2号来控制。"
2
2
Attention and Effort
注意力和努力
In the unlikely event of this book being made into a film, System 2 would be a supporting character who believes herself to be the hero. The defining feature of System 2, in this story, is that its operations are effortful, and one of its main characteristics is laziness, a reluctance to invest more effort than is strictly necessary. As a consequence, the thoughts and actions that System 2 believes it has chosen are often guided by the figure at the center of the story, System 1. However, there are vital tasks that only System 2 can perform because they require effort and acts of self-control in which the intuitions and impulses of System 1 are overcome.
如果这本书被拍成电影的可能性不大,系统2将是一个认为自己是英雄的配角。在这个故事中,系统2的决定性特征是它的操作 ,它的主要特征之一是懒惰,不愿意投入超过严格必要的努力。因此,系统2认为它所选择的思想和行动往往是由故事中心的人物--系统1引导的。然而,有些重要的任务只有系统2能够完成,因为它们需要努力和自我控制的行为,在这些行为中 ,系统1的直觉和冲动被战胜。
MENTAL EFFORT
精神上的努力
If you wish to experience your System 2 working at full tilt, the following exercise will do; it should bring you to the limits of your cognitive abilities within 5 seconds. To start, make up several strings of 4 digits, all different, and write each string on an index card. Place a blank card on top of the deck. The task that you will perform is called Add-1. Here is how it goes:
如果你想体验你的系统2全力以赴地工作,下面的练习就可以了;它应该在5秒钟内把你的认知能力带到极限。开始时,编几个4位数的字符串,都是不同的,并把每个字符串写在一张索引卡上。在这副牌的上面放一张空白牌。 ,你将执行的任务称为Add-1。情况是这样的。
Start beating a steady rhythm (or better yet, set a metronome at 1/sec). Remove the blank card and read the four digits aloud. Wait for two beats, then report a string in which each of the original digits is incremented by 1. If the digits on the card are 5294, the correct response is 6305. Keeping the rhythm is important.
开始打出稳定的节奏(或者最好将节拍器设置为1/秒)。取出空白卡片,大声读出四个数字。 等待两拍,然后报告一个字符串,其中每个原始数字递增1。 如果卡片上的数字是5294,正确答案是6305。保持节奏是很重要的。
Few people can cope with more than four digits in the Add-1 task, but if you want a harder challenge, please try Add-3.
很少有人能 ,应付超过四位数的加法-1任务,但如果你想要一个更难的挑战,请尝试加法-3。
If you would like to know what your body is doing while your mind is hard at work, set up two piles of books on a sturdy table, place a video camera on one and lean your chin on the other, get the video going, and stare at the camera lens while you work on Add-1 or Add-3 exercises. Later, you will find in the changing size of your pupils a faithful record of how hard you worked.
如果你想知道你的身体在努力工作时在做什么,可以在一张结实的桌子上摆上两堆书,把摄像机放在其中一本上,把你的下巴靠在另一本上,把视频打开,在你做加一或加三练习时盯着摄像机镜头看。以后, ,你会从你瞳孔的大小变化中发现你是如何努力工作的忠实记录。
I have a long personal history with the Add-1 task. Early in my career I spent a year at the University of Michigan, as a visitor in a laboratory that studied hypnosis. Casting about for a useful topic of research, I found an article in Scientific American in which the psychologist Eckhard Hess described the pupil of the eye as a window to the soul. I reread it recently and again found it inspiring. It begins with Hess reporting that his wife had noticed his pupils widening as he watched beautiful nature pictures, and it ends with two striking pictures of the same good-looking woman, who somehow appears much more attractive in one than in the other. There is only one difference: the pupils of the eyes appear dilated in the attractive picture and constricted in the other. Hess also wrote of belladonna, a pupil-dilating substance that was used as a cosmetic, and of bazaar shoppers who wear dark glasses in order to hide their level of interest from merchants.
我与Add-1任务有很长的个人历史。在我职业生涯的早期,我在密歇根大学呆了一年,作为一个研究催眠的实验室的访问者。 在寻找一个有用的研究课题时,我在《 科学美国人》 上发现了一篇文章,其中心理学家埃克哈德-赫斯将 ,将 眼睛的瞳孔 描述为通往灵魂的窗口。我最近重读了这本书,再次发现它很有启发。它的开头是赫斯报告说他的妻子注意到他在观看美丽的自然图片时瞳孔扩大了,它的结尾是两张引人注目的照片,都是同一个好看的女人,不知何故,她在其中一张中比另一张中显得更有吸引力。只有一个区别: ,眼睛的瞳孔在有吸引力的图片中出现扩张,而在另一个图片中则是收缩的。赫斯还写到了颠茄,一种被用作化妆品的瞳孔扩张物质,以及在集市上购物的人戴着黑框眼镜,以掩盖他们对商人的兴趣程度。
One of Hess’s findings especially captured my attention. He had noticed that the pupils are sensitive indicators of mental effort—they dilate substantially when people multiply two-digit numbers, and they dilate more if the problems are hard than if they are easy. His observations indicated that the response to mental effort is distinct from emotional arousal. Hess’s work did not have much to do with hypnosis, but I concluded that the idea of a visible indication of mental effort had promise as a research topic. A graduate student in the lab, Jackson Beatty, shared my enthusiasm and we got to work.
赫斯的一项发现尤其吸引了我的注意。 他注意到,瞳孔是衡量脑力劳动的敏感指标--当人们将两位数的数字相乘时,瞳孔 ,大幅放大,如果问题难,瞳孔放大的程度比容易的要大。他的观察表明,对心理努力的反应与情绪唤醒不同。赫斯的工作与催眠没有太大关系,但我得出结论,心理努力的可见指示这一想法有希望成为研究课题。实验室的一名研究生 ,Jackson Beatty,与我有同样的热情,我们开始了工作。
Beatty and I developed a setup similar to an optician’s examination room, in which the experimental participant leaned her head on a chin-and-forehead rest and stared at a camera while listening to prerecorded information and answering questions on the recorded beats of a metronome. The beats triggered an infrared flash every second, causing a picture to be taken. At the end of each experimental session, we would rush to have the film developed, project the images of the pupil on a screen, and go to work with a ruler. The method was a perfect fit for young and impatient researchers: we knew our results almost immediately, and they always told a clear story.
Beatty和我开发了一个类似于验光室的装置,在这个装置中,实验参与者将头靠在一个下巴和前额的支架上,盯着一个摄像头,同时聆听预先录制的信息,并在节拍器的记录节拍上回答问题。节拍每秒钟都会触发红外闪光灯 ,导致照片被拍下。在每个实验环节结束时,我们会赶去冲洗胶片,将瞳孔的图像投射到屏幕上,然后拿着尺子去工作。这种方法非常适合年轻而缺乏耐心的 研究人员:我们几乎马上就能知道我们的结果,而且它们总是讲述一个清晰的故事。
Beatty and I focused on paced tasks, such as Add-1, in which we knew precisely what was on the subject’s mind at any time. We recorded strings of digits on beats of the metronome and instructed the subject to repeat or transform the digits one by one, maintaining the same rhythm. We soon discovered that the size of the pupil varied second by second, reflecting the changing demands of the task. The shape of the response was an inverted V. As you experienced it if you tried Add-1 or Add-3, effort builds up with every added digit that you hear, reaches an almost intolerable peak as you rush to produce a transformed string during and immediately after the pause, and relaxes gradually as you “unload” your short-term memory. The pupil data corresponded precisely to subjective experience: longer strings reliably caused larger dilations, the transformation task compounded the effort, and the peak of pupil size coincided with maximum effort. Add-1 with four digits caused a larger dilation than the task of holding seven digits for immediate recall. Add-3, which is much more difficult, is the most demanding that I ever observed. In the first 5 seconds, the pupil dilates by about 50% of its original area and heart rate increases by about 7 beats per minute. This is as hard as people can work—they give up if more is asked of them. When we exposed our subjects to more digits than they could remember, their pupils stopped dilating or actually shrank.
Beatty和我专注于有节奏的任务,如Add-1, ,在这些任务中,我们准确地知道 受试者 在任何时候都在 想 什么。我们在节拍器的节拍上记录数字串,并指示受试者一个一个地重复或转换数字,保持相同的节奏。 我们很快发现,瞳孔的大小每秒都在变化,反映了任务的需求变化。 反应的形状是一个倒V形。如果你试过加法-1或加法-3,你会发现,每听到一个加法数字,你的努力就会增加,在暂停期间和紧接着的时间里,你急于产生一个转换的字符串,达到一个几乎无法忍受的高峰,而当你 "卸下 "你的短期记忆时,就会逐渐放松。瞳孔数据与主观体验完全吻合:较长的琴弦可靠地引起较大的扩张,转换 任务加重了努力,而瞳孔大小的峰值与最大努力相吻合。四个数字的加法-1比持有七个数字的任务引起更大的扩张,以便立即回忆。加三,难度更大,是我所观察到的要求最高的。在最初的5秒钟内,瞳孔扩大了约50%的原始面积, 心率增加了 约7次/ 分钟。当我们让受试者接触到比他们能够记住的更多的数字时,他们的瞳孔就会停止扩张或实际上缩小。
We worked for some months in a spacious basement suite in which we had set up a closed-circuit system that projected an image of the subject’s pupil on a screen in the corridor; we also could hear what was happening in the laboratory. The diameter of the projected pupil was about a foot; watching it dilate and contract when the participant was at work was a fascinating sight, quite an attraction for visitors in our lab. We amused ourselves and impressed our guests by our ability to divine when the participant gave up on a task. During a mental multiplication, the pupil normally dilated to a large size within a few seconds and stayed large as long as the individual kept working on the problem; it contracted immediately when she found a solution or gave up. As we watched from the corridor, we would sometimes surprise both the owner of the pupil and our guests by asking, “Why did you stop working just now?” The answer from inside the lab was often, “How did you know?” to which we would reply, “We have a window to your soul.”
我们在一个宽敞的地下室套房里工作了几个月,在那里我们建立了一个闭路系统,将受试者的瞳孔图像投射在走廊的屏幕上;我们还可以听到 实验室里发生的事情。投射的瞳孔直径约为一英尺;当受试者在工作时,观察它的扩张和收缩是一个迷人的景象,对我们实验室的访客具有相当的吸引力。我们自娱自乐,并通过预测参与者何时放弃一项任务的能力给客人留下深刻印象。 在心算乘法过程中,瞳孔通常会在几秒钟内扩大到很大,并且只要个人一直在做题,瞳孔就会保持很大;当她找到一个解决方案或放弃时,瞳孔会立即收缩。当我们在走廊上观看时,有时会惊讶地问瞳孔的主人和我们的客人:"你刚才为什么停止工作?" 来自实验室内部的回答往往是:"你怎么知道的?"对此,我们的回答是:"我们有一个通往你灵魂的窗口。"
The casual observations we made from the corridor were sometimes as informative as the formal experiments. I made a significant discovery as I was idly watching a woman’s pupil during a break between two tasks. She had kept her position on the chin rest, so I could see the image of her eye while she engaged in routine conversation with the experimenter. I was surprised to see that the pupil remained small and did not noticeably dilate as she talked and listened. Unlike the tasks that we were studying, the mundane conversation apparently demanded little or no effort—no more than retaining two or three digits. This was a eureka moment: I realized that the tasks we had chosen for study were exceptionally effortful. An image came to mind: mental life—today I would speak of the life of System 2—is normally conducted at the pace of a comfortable walk, sometimes interrupted by episodes of jogging and on rare occasions by a frantic sprint. The Add-1 and Add-3 exercises are sprints, and casual chatting is a stroll.
我们在走廊上进行的随意观察有时和正式的实验一样有意义。我有一个重要的发现,因为我 ,在两个任务之间的休息时间里,我闲闲地观察一个女人的瞳孔。她一直保持在下巴上的位置,所以我可以看到她的眼睛的图像,而她与实验者进行了例行的谈话。 我惊讶地看到,在她说话和倾听时,瞳孔仍然很小,没有明显的扩张。 与我们正在研究的任务不同,平凡的对话显然不需要什么努力--不超过保留两三个数字。 这是一个令人振奋的时刻:我意识到,我们选择的研究任务是非常努力的。我想到了一个形象:精神生活--今天 ,我说的是系统2的生活,通常以舒适的步行速度进行,有时被慢跑打断,在极少数情况下被疯狂的冲刺打断。添加-1和添加-3的练习是短跑,而休闲聊天是散步。
We found that people, when engaged in a mental sprint, may become effectively blind. The authors of The Invisible Gorilla had made the gorilla “invisible” by keeping the observers intensely busy counting passes. We reported a rather less dramatic example of blindness during Add-1. Our subjects were exposed to a series of rapidly flashing letters while they worked. They were told to give the task complete priority, but they were also asked to report, at the end of the digit task, whether the letter K had appeared at any time during the trial. The main finding was that the ability to detect and report the target letter changed in the course of the 10 seconds of the exercise. The observers almost never missed a K that was shown at the beginning or near the end of the Add-1 task but they missed the target almost half the time when mental effort was at its peak, although we had pictures of their wide-open eye staring straight at it. Failures of detection followed the same inverted-V pattern as the dilating pupil. The similarity was reassuring: the pupil was a good measure of the physical arousal that accompanies mental effort, and we could go ahead and use it to understand how the mind works.
我们发现,人们在进行心理冲刺时,可能会变得有效盲目。 隐形大猩猩》 一书的作者通过让观察者忙于计算传球次数,使大猩猩 "隐形"。我们报告了一个在Add-1期间失明的相当不引人注目的例子。我们的受试者在工作时接触到一系列 快速闪烁的字母 。 他们被告知要完全优先完成任务,但他们也被要求在数字任务结束时报告字母 K 是否在 试验期间出现过。主要发现是检测和报告目标字母的能力在10秒的练习过程中发生了变化。 观察者几乎从未错过在Add-1任务开始或接近结束时显示的 K ,但他们几乎有一半的时间在心理努力达到高峰时错过了目标,尽管我们有他们睁大眼睛直盯着 。检测失败的情况与瞳孔扩大的情况相同,呈倒V模式。这种相似性令人欣慰:瞳孔是对伴随着精神努力的身体唤醒的一个很好的衡量标准,我们可以继续使用它来了解思想的运作。
Much like the electricity meter outside your house or apartment, the pupils offer an index of the current rate at which mental energy is used. The analogy goes deep. Your use of electricity depends on what you choose to do, whether to light a room or toast a piece of bread. When you turn on a bulb or a toaster, it draws the energy it needs but no more. Similarly, we decide what to do, but we have limited control over the effort of doing it. Suppose you are shown four digits, say, 9462, and told that your life depends on holding them in memory for 10 seconds. However much you want to live, you cannot exert as much effort in this task as you would be forced to invest to complete an Add-3 transformation on the same digits.
很像 你房子或公寓外面的 电表 ,学生们提供了一个当前精神 能量使用率的指数。这个比喻很深刻。你的用电取决于你选择做什么,是照亮一个房间还是烘烤一块面包。 当你打开一个灯泡或烤面包机时,它吸取了它需要的能量,但没有更多。同样,我们决定做什么,但我们对做这件事的努力控制有限。 假设给你看四个数字,例如9462,并告诉你,你的生命取决于 ,在记忆中保持它们10秒钟。无论你多么想活下去,你都不可能在这项任务中付出与完成相同数字的加法-3转换所被迫投入的那么多努力。
System 2 and the electrical circuits in your home both have limited capacity, but they respond differently to threatened overload. A breaker trips when the demand for current is excessive, causing all devices on that circuit to lose power at once. In contrast, the response to mental overload is selective and precise: System 2 protects the most important activity, so it receives the attention it needs; “spare capacity” is allocated second by second to other tasks. In our version of the gorilla experiment, we instructed the participants to assign priority to the digit task. We know that they followed that instruction, because the timing of the visual target had no effect on the main task. If the critical letter was presented at a time of high demand, the subjects simply did not see it. When the transformation task was less demanding, detection performance was better.
系统 2和你家里的电路都有有限的容量,但它们对威胁过载的反应不同。当对电流的需求过大时,断路器就会跳闸,导致 ,该电路上的所有设备都会立即失去电源。相比之下,对心理超负荷的反应是有选择的和精确的:系统2保护最重要的活动,所以它得到了它所需要的关注;"备用容量 "被秒速分配给其他任务。 在我们版本的大猩猩实验中,我们指示参与者将优先权分配给数字任务。我们知道, ,他们遵循了这一指令,因为视觉目标的时间对主要任务没有影响。如果关键字母是在要求较高的时候出现,受试者根本就看不到它。 当转换任务的要求较低时,检测性能会更好。
The sophisticated allocation of attention has been honed by a long evolutionary history. Orienting and responding quickly to the gravest threats or most promising opportunities improved the chance of survival, and this capability is certainly not restricted to humans. Even in modern humans, System 1 takes over in emergencies and assigns total priority to self-protective actions. Imagine yourself at the wheel of a car that unexpectedly skids on a large oil slick. You will find that you have responded to the threat before you became fully conscious of it.
复杂的注意力分配在漫长的进化历史中得到了磨练。对最严重的威胁或最有希望的机会进行定位和快速反应 ,提高了生存的机会,这种能力当然不限于人类。 即使在现代人中,系统1在紧急情况下也会接管,并将自我保护的行动完全置于优先地位。 想象一下,你在一辆汽车的方向盘上意外地滑向一个大油层。你会发现,在你完全意识到威胁之前,你已经对 。
Beatty and I worked together for only a year, but our collaboration had a large effect on our subsequent careers. He eventually became the leading authority on “cognitive pupillometry,” and I wrote a book titled Attention and Effort , which was based in large part on what we learned together and on follow-up research I did at Harvard the following year. We learned a great deal about the working mind—which I now think of as System 2—from measuring pupils in a wide variety of tasks.
比蒂和我只合作了一年,但我们的合作对我们后来的职业生涯产生了很大影响。他最终成为 "认知瞳孔测量法 "的权威,而我则写了一本名为《 注意力与努力 》的书,这本书在很大程度上是基于我们一起学到的东西,以及我在接下来的一年里在哈佛大学所做的后续研究 。我们从测量学生的各种任务中,学到了大量关于工作思维的知识--我现在认为是第二系统。
As you become skilled in a task, its demand for energy diminishes. Studies of the brain have shown that the pattern of activity associated with an action changes as skill increases, with fewer brain regions involved. Talent has similar effects. Highly intelligent individuals need less effort to solve the same problems , as indicated by both pupil size and brain activity. A general “law of least effort” applies to cognitive as well as physical exertion. The law asserts that if there are several ways of achieving the same goal, people will eventually gravitate to the least demanding course of action. In the economy of action, effort is a cost, and the acquisition of skill is driven by the balance of benefits and costs . Laziness is built deep into our nature.
当你熟练掌握一项任务时,它对能量的需求就会减少。 对大脑的研究 表明,随着技能的提高,与行动相关的活动模式会发生变化,参与的大脑区域会减少。 天赋也有类似的影响。高智商的人 , 解决同样的问题 需要 更少的努力 ,瞳孔大小和大脑活动都表明了这一点。 一个普遍的 "最小努力法则 " 适用于认知和体力消耗的情况。该法则断言,如果有几种方法可以实现同样的目标,人们最终会倾向于要求最低的行动方案。 在行动经济中,努力是一种成本,技能的获得 ,是由 利益和成本的平衡 驱动的。 懒惰已经深入我们的本性。
The tasks that we studied varied considerably in their effects on the pupil. At baseline, our subjects were awake, aware, and ready to engage in a task—probably at a higher level of arousal and cognitive readiness than usual. Holding one or two digits in memory or learning to associate a word with a digit (3 = door) produced reliable effects on momentary arousal above that baseline, but the effects were minuscule, only 5% of the increase in pupil diameter associated with Add-3. A task that required discriminating between the pitch of two tones yielded significantly larger dilations. Recent research has shown that inhibiting the tendency to read distracting words (as in figure 2 of the preceding chapter) also induces moderate effort. Tests of short-term memory for six or seven digits were more effortful. As you can experience, the request to retrieve and say aloud your phone number or your spouse’s birthday also requires a brief but significant effort, because the entire string must be held in memory as a response is organized. Mental multiplication of two-digit numbers and the Add-3 task are near the limit of what most people can do.
我们研究的任务在对学生的影响方面有很大的不同。在基线时,我们的受试者是清醒的,有意识的,并准备从事 ,可能处于比平时更高的唤醒和认知准备水平。在记忆中保持一个或两个数字,或学习将一个词与 一个数字(3=门)联系起来,对高于该基线的瞬间唤醒产生了可靠的影响,但这种影响是微不足道的,只有与Add-3有关的瞳孔直径增加的5%。一项需要辨别两个音调的任务产生了明显较大的扩张。 最近的研究表明,抑制 阅读分心词 的倾向(如前面 章的 图2 )也会引起适度的努力。对六位或七位数字的短期记忆测试更加费力。正如你所体验到的,检索并大声说出你的电话号码或你配偶的生日的请求也需要短暂但重要的努力,因为在组织响应时,整个字符串必须被保存在记忆中。两位数的心理乘法和Add-3 任务接近大多数人能够做到的极限。
What makes some cognitive operations more demanding and effortful than others? What outcomes must we purchase in the currency of attention? What can System 2 do that System 1 cannot? We now have tentative answers to these questions.
是什么让一些认知操作比其他操作更加苛刻和费力?我们必须用注意力的货币购买什么结果? 系统2能做什么,而系统1不能? 我们现在对这些问题有了初步的答案。
Effort is required to maintain simultaneously in memory several ideas that require separate actions, or that need to be combined according to a rule—rehearsing your shopping list as you enter the supermarket, choosing between the fish and the veal at a restaurant, or combining a surprising result from a survey with the information that the sample was small, for example. System 2 is the only one that can follow rules, compare objects on several attributes, and make deliberate choices between options. The automatic System 1 does not have these capabilities. System 1 detects simple relations (“they are all alike,” “the son is much taller than the father”) and excels at integrating information about one thing, but it does not deal with multiple distinct topics at once, nor is it adept at using purely statistical information. System 1 will detect that a person described as “a meek and tidy soul, with a need for order and structure, and a passion for detail” resembles a caricature librarian, but combining this intuition with knowledge about the small number of librarians is a task that only System 2 can perform—if System 2 knows how to do so, which is true of few people.
需要努力在记忆中同时保持几个需要单独行动的想法,或者需要根据一个规则将这些想法 ,例如在进入超市时排练你的购物清单,在餐馆中选择鱼和小牛肉,或者将调查中的一个令人惊讶的结果与样本很小的信息相结合。系统2是唯一能够遵循规则、在几个属性上比较对象,并在选项之间做出慎重选择的系统。 自动的系统1不具备这些能力。系统1能够检测简单的关系("他们都是一样的","儿子比父亲高很多"),并擅长整合关于一个事物的信息,但它不能同时处理多个不同的主题,也不善于使用纯粹的统计信息。系统1会发现,一个被描述为 "灵魂温顺整洁, ,需要秩序和结构,对细节充满热情 "的人很像漫画中的图书馆员,但将这种直觉与关于少数图书馆员的知识相结合,是一项只有系统2才能完成的任务--如果系统2知道如何去做,而这是很少有人知道的。
A crucial capability of System 2 is the adoption of “task sets”: it can program memory to obey an instruction that overrides habitual responses. Consider the following: Count all occurrences of the letter f in this page. This is not a task you have ever performed before and it will not come naturally to you, but your System 2 can take it on. It will be effortful to set yourself up for this exercise, and effortful to carry it out, though you will surely improve with practice. Psychologists speak of “executive control” to describe the adoption and termination of task sets, and neuroscientists have identified the main regions of the brain that serve the executive function. One of these regions is involved whenever a conflict must be resolved. Another is the prefrontal area of the brain, a region that is substantially more developed in humans than in other primates, and is involved in operations that we associate with intelligence .
系统2的一个重要能力是采用 "任务集":它可以对内存进行编程,使其服从一个指令,而这个指令会推翻 的习惯性反应。考虑以下情况:计算这一页中所有字母 f 的出现次数。这不是你以前执行过的任务,对你来说也不会很自然,但你的系统2可以承担这个任务。为这一练习设置 ,并努力执行,尽管你一定会在实践中得到改善,但会很费力。心理学家谈到 "执行控制 " ,以描述任务集的采用和终止,神经科学家已经确定了大脑中服务于执行功能的主要区域。其中一个区域在必须解决冲突时就会参与。另一个是大脑的前额区,这个区域在人类中的发育程度大大高于其他灵长类动物,它参与了我们 与智能有关的 操作 。
Now suppose that at the end of the page you get another instruction: count all the commas in the next page. This will be harder, because you will have to overcome the newly acquired tendency to focus attention on the letter f . One of the significant discoveries of cognitive psychologists in recent decades is that switching from one task to another is effortful, especially under time pressure . The need for rapid switching is one of the reasons that Add-3 and mental multiplication are so difficult. To perform the Add-3 task, you must hold several digits in your working memory at the same time, associating each with a particular operation: some digits are in the queue to be transformed, one is in the process of transformation, and others, already transformed, are retained for reporting. Modern tests of working memory require the individual to switch repeatedly between two demanding tasks, retaining the results of one operation while performing the other. People who do well on these tests tend to do well on tests of general intelligence . However, the ability to control attention is not simply a measure of intelligence; measures of efficiency in the control of attention predict performance of air traffic controllers and of Israeli Air Force pilots beyond the effects of intelligence.
现在假设在这一页的最后,你得到了另一个指令:计算下一页中所有的逗号。近几十年来,认知心理学家的一个重要发现是,从一个任务切换到另一个任务是很费力的,尤其是 在时间紧迫的情况下 。对快速切换的需求是加3和心算乘法如此困难的原因之一。为了执行Add-3任务,你必须在你的 工作记忆 中同时保留几个数字,将每个数字与一个特定的操作联系起来:一些数字在队列中被转化,一个数字正在转化过程中,而其他已经转化的数字则被保留下来用于报告。 现代的工作记忆测试要求个人在两个高要求的任务之间反复切换,在执行另一个任务时保留一个操作的结果。在这些测试中表现良好的人往往在 一般智力测试 中表现良好。然而,控制注意力的能力并不仅仅是衡量智力的标准;控制注意力的效率的措施可以预测空中交通管制员和 以色列空军飞行员 的表现 ,超越智力的影响。
Time pressure is another driver of effort. As you carried out the Add-3 exercise, the rush was imposed in part by the metronome and in part by the load on memory. Like a juggler with several balls in the air, you cannot afford to slow down; the rate at which material decays in memory forces the pace, driving you to refresh and rehearse information before it is lost. Any task that requires you to keep several ideas in mind at the same time has the same hurried character. Unless you have the good fortune of a capacious working memory, you may be forced to work uncomfortably hard. The most effortful forms of slow thinking are those that require you to think fast.
时间压力是另一个努力的驱动因素。当你进行Add-3练习时,急促的节奏部分是由节拍器造成的,部分是由记忆的负荷造成的。就像一个在空中玩杂耍的人,你不能慢下来;材料在记忆中衰减的速度迫使 ,促使你在信息丢失之前进行刷新和排练。任何需要你在同一时间保持几个想法的任务都具有同样的匆忙特点。除非你幸运地拥有大容量的工作记忆,否则你可能被迫不自觉地努力工作。最努力的慢速思考形式是那些需要你快速思考的形式。
You surely observed as you performed Add-3 how unusual it is for your mind to work so hard. Even if you think for a living, few of the mental tasks in which you engage in the course of a working day are as demanding as Add-3, or even as demanding as storing six digits for immediate recall. We normally avoid mental overload by dividing our tasks into multiple easy steps, committing intermediate results to long-term memory or to paper rather than to an easily overloaded working memory. We cover long distances by taking our time and conduct our mental lives by the law of least effort.
你肯定观察到 ,因为你执行了Add-3,你的头脑如此努力工作是多么不寻常。即使你以思考为生,你在工作过程中所从事的精神任务中,很少有像 Add-3,甚至像存储六位数以便立即回忆那样的要求。我们通常通过将任务分为多个简单的步骤,将中间结果存入长期 记忆或纸张,而不是存入容易超载的工作记忆,来避免精神超载。我们通过慢慢地走,按照最小努力法则进行精神生活。
SPEAKING OF ATTENTION AND EFFORT
说到注意和努力
“I won’t try to solve this while driving. This is a pupil-dilating task. It requires mental effort!”
"我不会在开车时尝试解决这个问题。这是个使学生瘦身的任务。这需要精神上的努力!"
“The law of least effort is operating here. He will think as little as possible.”
"最小努力法则 "在这里发挥作用。他将尽可能少地思考。"
“She did not forget about the meeting. She was completely focused on something else when the meeting was set and she just didn’t hear you.”
"她 ,并没有忘记这次会议。会议安排时,她完全专注于其他事情,她只是没有听到你的声音。"
“What came quickly to my mind was an intuition from System 1. I’ll have to start over and search my memory deliberately.”
"我脑海中迅速浮现的是系统1的直觉,我得重新开始,刻意搜索我的记忆。"
3
3
The Lazy Controller
懒惰的控制器
I spend a few months each year in Berkeley, and one of my great pleasures there is a daily four-mile walk on a marked path in the hills, with a fine view of San Francisco Bay. I usually keep track of my time and have learned a fair amount about effort from doing so. I have found a speed, about 17 minutes for a mile, which I experience as a stroll. I certainly exert physical effort and burn more calories at that speed than if I sat in a recliner, but I experience no strain, no conflict, and no need to push myself. I am also able to think and work while walking at that rate. Indeed, I suspect that the mild physical arousal of the walk may spill over into greater mental alertness.
我每年在伯克利呆上几个月,我在那里最大的乐趣之一是每天在山上一条有标记的小路上散步,可以看到旧金山湾的美好景色。我通常会记录我的时间, ,从中了解到相当多的努力。我已经找到了一个速度,一英里大约17分钟,我把它当作一种漫步来体验。在这个速度下,我当然要付出体力,并比坐在躺椅上燃烧更多的卡路里,但我没有感到劳累,没有冲突,也不需要强迫自己。 在以这个速度行走时,我也能思考和工作。事实上,我怀疑散步时温和的 身体唤醒可能会蔓延到更大的精神警觉性。
System 2 also has a natural speed. You expend some mental energy in random thoughts and in monitoring what goes on around you even when your mind does nothing in particular, but there is little strain. Unless you are in a situation that makes you unusually wary or self-conscious, monitoring what happens in the environment or inside your head demands little effort. You make many small decisions as you drive your car, absorb some information as you read the newspaper, and conduct routine exchanges of pleasantries with a spouse or a colleague, all with little effort and no strain. Just like a stroll.
系统2也有一个自然的速度。你在随机思考和监测你周围发生的事情时消耗一些精神能量,即使你的思想不做任何特别的事情,但也没有什么压力。除非你处在一个让你异常警惕或自我意识的情况下,监控环境中发生的事情 ,或在你的脑海中需要很少的努力。你在开车时做出许多小决定,在阅读报纸时吸收一些信息,并与配偶或同事进行例行的寒暄,所有这些都不需要努力,也没有压力。就像散步一样。
It is normally easy and actually quite pleasant to walk and think at the same time, but at the extremes these activities appear to compete for the limited resources of System 2. You can confirm this claim by a simple experiment. While walking comfortably with a friend, ask him to compute 23 × 78 in his head, and to do so immediately. He will almost certainly stop in his tracks. My experience is that I can think while strolling but cannot engage in mental work that imposes a heavy load on short-term memory. If I must construct an intricate argument under time pressure, I would rather be still, and I would prefer sitting to standing. Of course, not all slow thinking requires that form of intense concentration and effortful computation—I did the best thinking of my life on leisurely walks with Amos.
通常情况下,一边走路一边思考是很容易的,实际上也是很愉快的,但在极端情况下,这些活动 ,似乎在争夺系统2的有限资源。你可以通过一个简单的实验来证实这一说法。在和一个朋友舒适地走路时,让他在脑子里计算23×78,并立即这样做。他几乎肯定会停下 。我的经验是,我可以在散步时思考,但不能从事对短期记忆造成沉重负担的脑力劳动。如果我 ,必须在时间压力下构建一个错综复杂的论点,我宁愿静止不动,而且我宁愿坐着也不愿意站着。当然,并不是所有的缓慢思考都需要这种形式的高度集中和努力计算--我在与阿莫斯悠闲地散步时做了我生命中最好的思考。
Accelerating beyond my strolling speed completely changes the experience of walking, because the transition to a faster walk brings about a sharp deterioration in my ability to think coherently. As I speed up, my attention is drawn with increasing frequency to the experience of walking and to the deliberate maintenance of the faster pace. My ability to bring a train of thought to a conclusion is impaired accordingly. At the highest speed I can sustain on the hills, about 14 minutes for a mile, I do not even try to think of anything else. In addition to the physical effort of moving my body rapidly along the path, a mental effort of self-control is needed to resist the urge to slow down. Self-control and deliberate thought apparently draw on the same limited budget of effort.
加速超过我的漫步速度,完全改变了行走的体验,因为过渡 ,使我连贯思考的能力急剧恶化。随着我的加速,我的注意力越来越频繁地被吸引到行走的体验和刻意维持更快的步伐。我将一连串的思考带入结论的能力也相应受到影响。在山路上我能维持的最高速度,大约14分钟一英里,我做 ,甚至不试图去想其他事情。除了身体上的努力,沿着道路快速移动我的身体,还需要精神上的自我控制,以抵制减速的冲动。自我控制和深思熟虑显然是利用了同样有限的努力预算。
For most of us, most of the time, the maintenance of a coherent train of thought and the occasional engagement in effortful thinking also require self-control. Although I have not conducted a systematic survey, I suspect that frequent switching of tasks and speeded-up mental work are not intrinsically pleasurable, and that people avoid them when possible. This is how the law of least effort comes to be a law. Even in the absence of time pressure, maintaining a coherent train of thought requires discipline. An observer of the number of times I look at e-mail or investigate the refrigerator during an hour of writing could reasonably infer an urge to escape and conclude that keeping at it requires more self-control than I can readily muster.
对我们大多数人来说,大多数时候,保持连贯的思路和偶尔从事 努力的思考也需要自我控制。虽然我没有进行系统的调查,但我怀疑频繁地转换任务和加速的脑力劳动在本质上并不令人愉快,人们尽可能地避免它们。 这就是最小努力法则的来历。 即使在没有时间压力的情况下,保持连贯的思路也需要纪律性。一个 ,观察我在一小时的写作过程中看电子邮件或调查冰箱的次数,可以合理地推断出我有逃避的冲动,并得出结论,保持这种冲动需要更多的自制力,而不是我可以轻易做到的。
Fortunately, cognitive work is not always aversive, and people sometimes expend considerable effort for long periods of time without having to exert willpower. The psychologist Mihaly Csikszentmihalyi (pronounced six-cent-mihaly) has done more than anyone else to study this state of effortless attending, and the name he proposed for it, flow , has become part of the language. People who experience flow describe it as “a state of effortless concentration so deep that they lose their sense of time, of themselves, of their problems,” and their descriptions of the joy of that state are so compelling that Csikszentmihalyi has called it an “optimal experience.” Many activities can induce a sense of flow, from painting to racing motorcycles—and for some fortunate authors I know, even writing a book is often an optimal experience. Flow neatly separates the two forms of effort: concentration on the task and the deliberate control of attention. Riding a motorcycle at 150 miles an hour and playing a competitive game of chess are certainly very effortful. In a state of flow, however, maintaining focused attention on these absorbing activities requires no exertion of self-control, thereby freeing resources to be directed to the task at hand.
幸运的是,认知工作并不总是令人厌恶的,人们有时会在很长一段时间内花费相当大的努力,而不必发挥意志力。 心理学家Mihaly Csikszentmihalyi(发音为6-cent-mihaly)在研究这种毫不费力的出席状态方面做得比任何人都多,而他为其提出的名字-- 流 ,已经成为语言的一部分。体验流的人将其描述为 "一种毫不费力的集中状态,以至于他们失去了对时间、对自己、对问题的感觉",他们对 ,这种状态的快乐的描述是如此引人注目,以至于Csikszentmihalyi将其称为 "最佳体验"。 许多活动都能引起一种流动感,从绘画到飙车--对于我认识的一些幸运的作者来说,甚至写书也常常是一种最佳的 。流动将两种形式的努力整齐地分开:专注于任务和有意控制注意力。以每小时150英里的速度骑摩托车 ,以及下一盘竞争性的国际象棋,当然是非常努力的。然而,在流动状态下,保持对这些吸收性活动的集中注意力,不需要施加自我控制,从而释放出资源,使之用于手头的任务。
THE BUSY AND DEPLETED SYSTEM 2
忙碌而枯竭的系统 2
It is now a well-established proposition that both self-control and cognitive effort are forms of mental work. Several psychological studies have shown that people who are simultaneously challenged by a demanding cognitive task and by a temptation are more likely to yield to the temptation. Imagine that you are asked to retain a list of seven digits for a minute or two. You are told that remembering the digits is your top priority. While your attention is focused on the digits, you are offered a choice between two desserts: a sinful chocolate cake and a virtuous fruit salad. The evidence suggests that you would be more likely to select the tempting chocolate cake when your mind is loaded with digits. System 1 has more influence on behavior when System 2 is busy, and it has a sweet tooth .
现在一个公认的命题是,自我控制和认知努力 ,都是心理工作的形式。一些心理学研究表明,同时受到苛刻的认知任务和诱惑挑战的人,更有可能向诱惑屈服。想象一下,你被要求在一两分钟内记住一串七位数的数字。 你被告知记住这些数字是你的首要任务。当你的注意力集中在这些数字上时,你 ,让你在两种甜点中选择:一种是罪恶的巧克力蛋糕,一种是良性的水果沙拉。证据表明,当你的头脑中充满数字时,你会更有可能选择诱人的巧克力蛋糕。 当系统2忙碌时,系统1对行为的影响更大,而且它 喜欢吃甜食 。
People who are cognitively busy are also more likely to make selfish choices, use sexist language, and make superficial judgments in social situations. Memorizing and repeating digits loosens the hold of System 2 on behavior, but of course cognitive load is not the only cause of weakened self-control. A few drinks have the same effect, as does a sleepless night. The self-control of morning people is impaired at night; the reverse is true of night people. Too much concern about how well one is doing in a task sometimes disrupts performance by loading short-term memory with pointless anxious thoughts . The conclusion is straightforward: self-control requires attention and effort. Another way of saying this is that controlling thoughts and behaviors is one of the tasks that System 2 performs.
认知繁忙 的人也更容易做出自私的选择,使用性别歧视的 ,并在社交场合做出肤浅的判断。 背诵和重复数字会放松系统2对行为的控制,当然,认知负荷并不是削弱自制力的唯一原因。 喝几杯酒有同样的效果,一个不眠之夜也是如此。早晨的人在晚上的自制力会受到影响;晚上的人则相反。太关心自己在一项任务中的表现如何 ,有时会使短期记忆中充满 无意义的焦虑想法 ,从而扰乱表现。 结论很直接:自制力需要注意和努力。另一种说法是,控制思想和行为是系统2所执行的任务之一。
A series of surprising experiments by the psychologist Roy Baumeister and his colleagues has shown conclusively that all variants of voluntary effort—cognitive, emotional, or physical—draw at least partly on a shared pool of mental energy. Their experiments involve successive rather than simultaneous tasks.
心理学家罗伊-鲍迈斯特(Roy Baumeister)和他的同事们进行的一系列令人惊讶的实验 ,最终表明所有变种的自愿努力--认知的、情感的或身体的--至少部分地依赖于一个共享的心理能量库。他们的实验涉及连续的而不是同时的任务。
Baumeister’s group has repeatedly found that an effort of will or self-control is tiring; if you have had to force yourself to do something, you are less willing or less able to exert self-control when the next challenge comes around. The phenomenon has been named ego depletion . In a typical demonstration, participants who are instructed to stifle their emotional reaction to an emotionally charged film will later perform poorly on a test of physical stamina—how long they can maintain a strong grip on a dynamometer in spite of increasing discomfort. The emotional effort in the first phase of the experiment reduces the ability to withstand the pain of sustained muscle contraction, and ego-depleted people therefore succumb more quickly to the urge to quit. In another experiment, people are first depleted by a task in which they eat virtuous foods such as radishes and celery while resisting the temptation to indulge in chocolate and rich cookies. Later, these people will give up earlier than normal when faced with a difficult cognitive task.
鲍迈斯特的研究小组多次发现,意志或自我控制的努力是很累人的;如果你不得不强迫自己做某事,那么当下一个挑战来临时,你就不太愿意或不太能够 。这种现象被命名为 自我耗竭 。在一个典型的示范中,被要求扼制对一部情绪激动的电影的情绪反应的参与者,后来在身体耐力的测试中会表现得很差--尽管越来越不舒服,但他们能在测力计上保持强有力的握力。实验第一阶段的情感努力 ,降低了承受持续肌肉收缩的痛苦的能力,自我耗尽的人因此更快屈服于放弃的冲动。在另一个实验中,人们首先被一项任务耗尽,在这项任务中,他们吃萝卜和芹菜等良性食物,同时抵制放纵巧克力和丰富饼干的诱惑。后来,这些人 ,在面对一项困难的认知任务时,会比正常人更早放弃。
The list of situations and tasks that are now known to deplete self-control is long and varied. All involve conflict and the need to suppress a natural tendency. They include:
现在已知的消耗自我控制能力的情况和任务清单很长,而且种类繁多。 所有这些都涉及到冲突和抑制自然倾向的需要。 它们包括。
avoiding the thought of white bears
避开白熊的想法
inhibiting the emotional response to a stirring film
抑制对激动人心的电影的情感反应
making a series of choices that involve conflict
做出一系列涉及冲突的选择
trying to impress others
讨好别人
responding kindly to a partner’s bad behavior
对伴侣的不良行为作出善意的回应
interacting with a person of a different race (for prejudiced individuals)
与不同种族的人交往(对于有偏见的人)。
The list of indications of depletion is also highly diverse:
枯竭的迹象清单也是高度多样化的。
deviating from one’s diet
饮食不节
overspending on impulsive purchases
冲动消费,过度消费
reacting aggressively to provocation
挑衅性的反应
persisting less time in a handgrip task
在手握任务中坚持的时间较短
performing poorly in cognitive tasks and logical decision making
在认知任务和逻辑决策方面表现不佳 。
The evidence is persuasive: activities that impose high demands on System 2 require self-control, and the exertion of self-control is depleting and unpleasant. Unlike cognitive load, ego depletion is at least in part a loss of motivation. After exerting self-control in one task, you do not feel like making an effort in another, although you could do it if you really had to. In several experiments, people were able to resist the effects of ego depletion when given a strong incentive to do so. In contrast, increasing effort is not an option when you must keep six digits in short-term memory while performing a task. Ego depletion is not the same mental state as cognitive busyness.
证据是有说服力的:对系统2提出高要求的活动需要自我控制,而 自我控制的发挥 是消耗性的,令人不快的。与认知负荷不同,自我耗损至少在某种程度上是一种动机的丧失。 在一项任务中发挥了自我控制能力后,你不觉得在另一项任务中作出努力,尽管如果你真的不得不这样做,你也可以这样做。在几个 实验中,当被给予强烈的激励时,人们能够 抵制自我耗损的影响 。相比之下,当你在执行 一项任务时,必须在短期记忆中保持六位数,增加努力是不可能的。自我耗损与认知忙碌的精神状态不同。
The most surprising discovery made by Baumeister’s group shows, as he puts it, that the idea of mental energy is more than a mere metaphor . The nervous system consumes more glucose than most other parts of the body, and effortful mental activity appears to be especially expensive in the currency of glucose. When you are actively involved in difficult cognitive reasoning or engaged in a task that requires self-control, your blood glucose level drops. The effect is analogous to a runner who draws down glucose stored in her muscles during a sprint. The bold implication of this idea is that the effects of ego depletion could be undone by ingesting glucose, and Baumeister and his colleagues have confirmed this hypothesis in several experiments.
鲍迈斯特小组最令人惊讶的发现表明,正如他所说, 精神能量的想法 不仅仅是一个比喻 。神经系统比身体的大多数其他部分消耗更多的葡萄糖,而努力的心理活动似乎特别昂贵的葡萄糖货币。 当你积极参与困难的认知推理或从事需要自我控制的任务时,你的血糖水平会下降。这种效果类似于一个跑步者在冲刺过程中 ,吸取储存在肌肉中的葡萄糖。这个想法的大胆含义是, 自我消耗的 影响可以通过摄取葡萄糖来消除,鲍迈斯特和他的同事已经在几个实验中证实了这个假设。
Volunteers in one of their studies watched a short silent film of a woman being interviewed and were asked to interpret her body language. While they were performing the task, a series of words crossed the screen in slow succession. The participants were specifically instructed to ignore the words, and if they found their attention drawn away they had to refocus their concentration on the woman’s behavior. This act of self-control was known to cause ego depletion. All the volunteers drank some lemonade before participating in a second task. The lemonade was sweetened with glucose for half of them and with Splenda for the others. Then all participants were given a task in which they needed to overcome an intuitive response to get the correct answer. Intuitive errors are normally much more frequent among ego-depleted people, and the drinkers of Splenda showed the expected depletion effect. On the other hand, the glucose drinkers were not depleted. Restoring the level of available sugar in the brain had prevented the deterioration of performance. It will take some time and much further research to establish whether the tasks that cause glucose-depletion also cause the momentary arousal that is reflected in increases of pupil size and heart rate.
在他们的一项研究中,志愿者们观看了一部关于一名妇女接受采访的无声短片,并被要求解释她的身体语言。 在他们执行这项任务时,屏幕上缓慢地连续划过一连串的文字。参与者被特别指示忽略这些文字,如果他们发现自己的注意力被吸引走,他们必须重新把注意力集中在该妇女的行为上。众所周知,这种自我控制的行为会导致自我消耗。 所有志愿者在参加第二个 任务之前都喝了一些柠檬水。其中一半人的柠檬水是用葡萄糖加的,其他的人是用Splenda加的。然后给所有参与者一个任务,他们需要克服直觉反应以获得正确的答案。 直觉错误通常在自我消耗的人中更频繁,喝Splenda的人显示了预期的消耗效果。另一方面,喝葡萄糖的人 ,没有被消耗。恢复大脑中可用的糖的水平已经防止了性能的恶化。需要一些时间和更多的研究来确定导致葡萄糖耗尽的任务是否也会引起瞬间的兴奋,这种兴奋反映在瞳孔大小和心率的增加上。
A disturbing demonstration of depletion effects in judgment was recently reported in the Proceedings of the National Academy of Sciences . The unwitting participants in the study were eight parole judges in Israel. They spend entire days reviewing applications for parole. The cases are presented in random order, and the judges spend little time on each one, an average of 6 minutes. (The default decision is denial of parole; only 35% of requests are approved. The exact time of each decision is recorded, and the times of the judges’ three food breaks—morning break, lunch, and afternoon break—during the day are recorded as well.) The authors of the study plotted the proportion of approved requests against the time since the last food break. The proportion spikes after each meal, when about 65% of requests are granted. During the two hours or so until the judges’ next feeding, the approval rate drops steadily, to about zero just before the meal. As you might expect, this is an unwelcome result and the authors carefully checked many alternative explanations. The best possible account of the data provides bad news: tired and hungry judges tend to fall back on the easier default position of denying requests for parole. Both fatigue and hunger probably play a role.
最近, 《美国国家科学院院刊》 报道了一个令人不安的 判断力耗损效应 的证明。研究中不知情的参与者是以色列的八名假释法官。他们花了整整一天时间审查假释申请。这些案件是按随机顺序提交的,法官在每个案件上花费的时间很少,平均为6分钟(默认的决定是拒绝假释;只有35%的请求被批准。每项决定的确切 ,同时也记录了法官在一天中的三次食物休息时间--早间休息、午餐和下午休息。)该研究的作者将 ,批准请求的比例与自上次食物休息以来的时间作了对比。该比例在每顿饭后达到顶峰,当时约有65%的请求被批准。在距离法官 下一次进食的两个小时左右,批准率稳步下降,在进食前约为零。如你所料,这是一个不受欢迎的结果,作者仔细检查了许多替代解释。 对数据的最佳解释提供了坏消息:疲惫和饥饿的法官倾向于回到拒绝假释请求这一较容易的默认立场。疲劳和饥饿可能都发挥了 作用。
THE LAZY SYSTEM 2
懒人系统2
One of the main functions of System 2 is to monitor and control thoughts and actions “suggested” by System 1, allowing some to be expressed directly in behavior and suppressing or modifying others.
系统2的主要功能之一是监测和控制系统1 "建议 "的思想和行动,允许一些直接表现在行为中,并压制或修改其他思想和行动。
For an example, here is a simple puzzle. Do not try to solve it but listen to your intuition:
举例来说,这里有一个简单的谜题。不要试图解决它,而是听从你的直觉。
A bat and ball cost $1.10.
一根球棒和一个球花了1.10美元。
The bat costs one dollar more than the ball.
球棒比球多花了一美元。
How much does the ball cost?
这个球的价格是多少?
A number came to your mind. The number, of course, is 10: 10¢. The distinctive mark of this easy puzzle is that it evokes an answer that is intuitive, appealing, and wrong. Do the math, and you will see. If the ball costs 10¢, then the total cost will be $1.20 (10¢ for the ball and $1.10 for the bat), not $1.10. The correct answer is 5¢. It is safe to assume that the intuitive answer also came to the mind of those who ended up with the correct number—they somehow managed to resist the intuition.
你的脑海中出现了一个数字。 这个数字当然是10:10美分。这道简单的谜题的独特标志是,它唤起了一个直观的、吸引人的、错误的答案。算一算,你就知道了。如果球的成本是10美分,那么总成本将是1.20美元(10美分的球和1.10美元的球棒),而不是1.10美元,正确答案是5美分。可以认为, ,那些最终得到正确数字的人脑海中也出现了直观的答案--他们以某种方式设法抵制直觉。
Shane Frederick and I worked together on a theory of judgment based on two systems, and he used the bat-and-ball puzzle to study a central question: How closely does System 2 monitor the suggestions of System 1? His reasoning was that we know a significant fact about anyone who says that the ball costs 10¢: that person did not actively check whether the answer was correct, and her System 2 endorsed an intuitive answer that it could have rejected with a small investment of effort. Furthermore, we also know that the people who give the intuitive answer have missed an obvious social cue; they should have wondered why anyone would include in a questionnaire a puzzle with such an obvious answer. A failure to check is remarkable because the cost of checking is so low: a few seconds of mental work (the problem is moderately difficult), with slightly tensed muscles and dilated pupils, could avoid an embarrassing mistake. People who say 10¢ appear to be ardent followers of the law of least effort. People who avoid that answer appear to have more active minds.
谢恩-弗雷德里克和我一起研究了基于两个系统的判断理论,他用球棒和球的难题来研究一个核心问题:系统2对系统1的建议有多密切的监控?他的理由是,我们知道一个重要的事实 ,即任何说球要花10美分的人:这个人没有主动检查答案是否正确,她的系统2认可了一个直观的答案,而它只要投入少量的精力就能拒绝这个答案。此外,我们还知道,给出直观答案的人错过了一个明显的社会线索;他们应该想知道为什么有人会在问卷调查中包括 ,一个有如此明显答案的谜题。没有检查是了不起的 ,因为检查的成本很低:几秒钟的脑力劳动(问题难度适中),稍微紧张的肌肉和放大的瞳孔,可以避免一个尴尬的错误。说10美分的人似乎是最小努力法则的狂热追随者。避开这个答案的人似乎有更活跃的 。
Many thousands of university students have answered the bat-and-ball puzzle, and the results are shocking. More than 50% of students at Harvard, MIT, and Princeton gave the intuitive—incorrect—answer . At less selective universities, the rate of demonstrable failure to check was in excess of 80%. The bat-and-ball problem is our first encounter with an observation that will be a recurrent theme of this book: many people are overconfident, prone to place too much faith in their intuitions. They apparently find cognitive effort at least mildly unpleasant and avoid it as much as possible.
数以千计的大学生回答了球棒和球的难题,结果令人震惊。在哈佛大学、麻省理工学院和普林斯顿大学,超过50%的学生给出了 直观正确的答案 。 在选择较少的大学中,明显没有检查的比率超过80%。蝙蝠和球的问题是我们第一次遇到一个观察,这将是本书的一个反复出现的主题 :许多人过于自信,容易过分相信他们的直觉。 他们显然发现认知的努力至少有轻微的不愉快,并尽可能地避免它。
Now I will show you a logical argument—two premises and a conclusion. Try to determine, as quickly as you can, if the argument is logically valid. Does the conclusion follow from the premises?
现在我给你看一个逻辑论证--两个前提和一个结论。 试着尽快判断这个论证在逻辑上是否有效。结论是由前提得出的吗?
All roses are flowers.
所有的玫瑰花都是花。
Some flowers fade quickly.
有些花很快就会凋谢。
Therefore some roses fade quickly.
因此有些玫瑰花很快就会褪色。
A large majority of college students endorse this syllogism as valid . In fact the argument is flawed, because it is possible that there are no roses among the flowers that fade quickly. Just as in the bat-and-ball problem, a plausible answer comes to mind immediately. Overriding it requires hard work—the insistent idea that “it’s true, it’s true!” makes it difficult to check the logic, and most people do not take the trouble to think through the problem.
绝大多数的大学生都认可这个 提纲是有效的 。事实上,这个论点是有缺陷的,因为有可能在迅速凋谢的花朵中没有玫瑰花。就像球棒和球的问题一样,一个看似合理的答案会立即出现在脑海中。 要推翻它需要付出艰苦的努力--"这是真的,这是真的!"这种坚持不懈的 ,使得人们很难检查逻辑,而且大多数人不会花心思去想这个问题。
This experiment has discouraging implications for reasoning in everyday life. It suggests that when people believe a conclusion is true, they are also very likely to believe arguments that appear to support it, even when these arguments are unsound. If System 1 is involved, the conclusion comes first and the arguments follow.
这个实验对日常生活中的推理有着令人沮丧的影响。 它表明,当人们相信一个结论是真的时候,他们也很有可能相信那些似乎支持这个结论的论据,即使这些论据是不健全的。如果 System 1,结论是第一位的,而论据是随后的。
Next, consider the following question and answer it quickly before reading on:
接下来,考虑以下问题,并在继续阅读之前快速回答。
How many murders occur in the state of Michigan in one year?
密歇根州一年中发生多少起谋杀案?
The question, which was also devised by Shane Frederick, is again a challenge to System 2. The “trick” is whether the respondent will remember that Detroit, a high-crime city, is in Michigan. College students in the United States know this fact and will correctly identify Detroit as the largest city in Michigan. But knowledge of a fact is not all-or-none. Facts that we know do not always come to mind when we need them. People who remember that Detroit is in Michigan give higher estimates of the murder rate in the state than people who do not, but a majority of Frederick’s respondents did not think of the city when questioned about the state. Indeed, the average guess by people who were asked about Michigan is lower than the guesses of a similar group who were asked about the murder rate in Detroit.
这个问题也是由谢恩-弗雷德里克设计的,又是对系统2的挑战。"诀窍 "在于受访者是否会记得 底特律,一个高犯罪率的城市, 在密歇根州。美国的大学生都知道这个事实,他们会正确地把底特律作为密歇根州最大的城市。但对一个事实的了解并不是全部或全部的。记得底特律在密歇根州的人对该州的谋杀率的估计高于不记得的人,但弗雷德里克的 ,大多数受访者在被问及该州时并没有想到这个城市。事实上,被问及密歇根州的人的平均猜测 低于 被问及底特律的谋杀率的类似群体的猜测。
Blame for a failure to think of Detroit can be laid on both System 1 and System 2. Whether the city comes to mind when the state is mentioned depends in part on the automatic function of memory. People differ in this respect. The representation of the state of Michigan is very detailed in some people’s minds: residents of the state are more likely to retrieve many facts about it than people who live elsewhere; geography buffs will retrieve more than others who specialize in baseball statistics; more intelligent individuals are more likely than others to have rich representations of most things. Intelligence is not only the ability to reason; it is also the ability to find relevant material in memory and to deploy attention when needed. Memory function is an attribute of System 1. However, everyone has the option of slowing down to conduct an active search of memory for all possibly relevant facts—just as they could slow down to check the intuitive answer in the bat-and-ball problem. The extent of deliberate checking and search is a characteristic of System 2, which varies among individuals.
未能想到底特律的责任可以同时归咎于系统1和系统2。 当提到该州时,人们是否会想到这个城市,部分取决于 记忆的自动功能。人们在这方面是不同的。 密歇根州的表述在一些人的头脑中是非常详细的:该州的居民比住在其他地方的人更有可能检索到许多关于该州的事实;地理爱好者会比其他专门研究棒球统计的人检索到更多;更聪明的人比其他人更有可能对大多数事物有 丰富的表述。智力不仅是推理的能力,也是在记忆中寻找相关材料并在需要时部署注意力的能力。 记忆功能是系统1的一个属性。然而,每个人都可以选择放慢速度,在记忆中对所有可能的相关事实进行主动搜索--就像他们可以放慢速度,在球棒和球的问题中检查直观的答案 。刻意检查和搜索的程度是系统2的一个特点,因人而异。
The bat-and-ball problem, the flowers syllogism, and the Michigan/ Detroit problem have something in common. Failing these minitests appears to be, at least to some extent, a matter of insufficient motivation, not trying hard enough. Anyone who can be admitted to a good university is certainly able to reason through the first two questions and to reflect about Michigan long enough to remember the major city in that state and its crime problem. These students can solve much more difficult problems when they are not tempted to accept a superficially plausible answer that comes readily to mind. The ease with which they are satisfied enough to stop thinking is rather troubling. “Lazy” is a harsh judgment about the self-monitoring of these young people and their System 2, but it does not seem to be unfair. Those who avoid the sin of intellectual sloth could be called “engaged.” They are more alert, more intellectually active, less willing to be satisfied with superficially attractive answers, more skeptical about their intuitions. The psychologist Keith Stanovich would call them more rational .
球棒和球的问题、花的论证和密歇根/底特律的问题有一些共同点。至少在某种程度上,未能通过这些测试似乎是动机不足,不够努力的问题。任何能够被一所好大学录取的人 ,肯定能够推理出前两个问题,并对密歇根州进行长时间的思考,以记住该州的主要城市及其犯罪问题。当这些学生不被诱惑去接受轻易出现在脑海中的表面上看似合理的答案时,他们可以解决更多困难的问题。他们很容易满足于停止思考 ,这相当令人不安。"懒惰 "是对这些年轻人的自我监督和他们的系统2的一个苛刻的判断,但它似乎并不公平。那些避免智力懒惰罪的人可以被称为 "参与"。他们更加警觉,智力更加活跃,不太愿意满足于表面上有吸引力的答案,对自己的直觉更加怀疑。心理学家 Keith Stanovich 会称他们更加理性 。
INTELLIGENCE, CONTROL, RATIONALITY
智慧、控制、理性
Researchers have applied diverse methods to examine the connection between thinking and self-control. Some have addressed it by asking the correlation question: If people were ranked by their self-control and by their cognitive aptitude, would individuals have similar positions in the two rankings?
研究人员已经应用了不同的方法来研究思维和自我控制之间的联系。有些人通过提出相关问题来解决这个问题:如果人们按自我控制能力和认知能力进行排名,那么个人在两个排名中的位置是否相似?
In one of the most famous experiments in the history of psychology, Walter Mischel and his students exposed four-year-old children to a cruel dilemma . They were given a choice between a small reward (one Oreo), which they could have at any time, or a larger reward (two cookies) for which they had to wait 15 minutes under difficult conditions. They were to remain alone in a room, facing a desk with two objects: a single cookie and a bell that the child could ring at any time to call in the experimenter and receive the one cookie. As the experiment was described: “There were no toys , books, pictures, or other potentially distracting items in the room. The experimenter left the room and did not return until 15 min had passed or the child had rung the bell, eaten the rewards, stood up, or shown any signs of distress.”
在心理学历史上最著名的实验之一 ,沃尔特-米歇尔和他的学生将四岁的孩子暴露在一个 残酷的困境中 。他们可以选择一个小的奖励(一块奥利奥),他们可以随时得到,或者一个更大的奖励(两块饼干),他们必须在困难的条件下等待15分钟。他们独自呆在一个房间里,面对一张桌子,桌子上有两个东西: 一块饼干和一个铃铛,孩子可以在任何时候按铃,叫来实验者,得到那块饼干。正如实验所描述的那样: " 房间里 没有玩具 、书籍、图片或其他潜在的分心物品。实验者离开房间,直到15分钟过去,或者孩子敲了钟,吃了奖励,站了起来,或者表现出任何痛苦的迹象 ,才会回来。"
The children were watched through a one-way mirror, and the film that shows their behavior during the waiting time always has the audience roaring in laughter. About half the children managed the feat of waiting for 15 minutes, mainly by keeping their attention away from the tempting reward. Ten or fifteen years later, a large gap had opened between those who had resisted temptation and those who had not. The resisters had higher measures of executive control in cognitive tasks, and especially the ability to reallocate their attention effectively. As young adults, they were less likely to take drugs. A significant difference in intellectual aptitude emerged: the children who had shown more self-control as four-year-olds had substantially higher scores on tests of intelligence .
孩子们通过一个单向的镜子进行观察,显示他们在等待时间内的行为的影片总是让观众在笑声中轰鸣。大约一半的孩子成功地完成了等待15分钟的壮举,主要是通过将他们的注意力从诱人的奖励上移开。10或15年后,在那些抵制了诱惑的人和那些没有抵制诱惑的人之间出现了巨大的差距, 。抵抗者在认知任务中具有更高的执行控制能力,特别是有效地重新分配注意力的能力。 作为年轻的成年人,他们不太可能吸毒。在智力方面出现了明显的差异:在四岁时表现出更多自制力的儿童, 在智力测试中的分数 大大 高于 。
A team of researchers at the University of Oregon explored the link between cognitive control and intelligence in several ways, including an attempt to raise intelligence by improving the control of attention. During five 40-minute sessions, they exposed children aged four to six to various computer games especially designed to demand attention and control. In one of the exercises, the children used a joystick to track a cartoon cat and move it to a grassy area while avoiding a muddy area. The grassy areas gradually shrank and the muddy area expanded, requiring progressively more precise control. The testers found that training attention not only improved executive control; scores on nonverbal tests of intelligence also improved and the improvement was maintained for several months. Other research by the same group identified specific genes that are involved in the control of attention, showed that parenting techniques also affected this ability, and demonstrated a close connection between the children’s ability to control their attention and their ability to control their emotions.
俄勒冈大学的一个研究小组以多种方式探讨了认知控制和智力之间的联系,包括试图通过改善注意力的控制来提高智力。 在五个40分钟的课程中,他们让四到六岁的儿童接触各种特别设计来要求注意力和控制的电脑游戏。在其中一项练习中,孩子们 ,用操纵杆跟踪一只卡通猫,并将它移到草地上,同时避开泥泞的地方。草地逐渐缩小,泥泞的地方逐渐扩大,需要逐渐更精确地控制。测试人员发现,训练注意力不仅改善了 执行控制;非语言智力测试的分数也得到了改善,而且这种 改善保持了 几个月。 同一小组的其他研究确定了参与控制注意力的特定基因,表明养育技术也影响这种能力,并证明儿童控制注意力的能力与他们控制情绪的能力之间有密切联系。
Shane Frederick constructed a Cognitive Reflection Test, which consists of the bat-and-ball problem and two other questions, chosen because they also invite an intuitive answer that is both compelling and wrong (the questions are shown in chapter 5 ). He went on to study the characteristics of students who score very low on this test—the supervisory function of System 2 is weak in these people—and found that they are prone to answer questions with the first idea that comes to mind and unwilling to invest the effort needed to check their intuitions. Individuals who uncritically follow their intuitions about puzzles are also prone to accept other suggestions from System 1. In particular, they are impulsive, impatient, and keen to receive immediate gratification. For example, 63% of the intuitive respondents say they would prefer to get $3,400 this month rather than $3,800 next month. Only 37% of those who solve all three puzzles correctly have the same shortsighted preference for receiving a smaller amount immediately. When asked how much they will pay to get overnight delivery of a book they have ordered, the low scorers on the Cognitive Reflection Test are willing to pay twice as much as the high scorers. Frederick’s findings suggest that the characters of our psychodrama have different “personalities.” System 1 is impulsive and intuitive; System 2 is capable of reasoning, and it is cautious, but at least for some people it is also lazy. We recognize related differences among individuals: some people are more like their System 2; others are closer to their System 1. This simple test has emerged as one of the better predictors of lazy thinking.
谢恩-弗雷德里克构建了一个认知反思测试,其中包括球棒和球 问题和另外两个问题,之所以选择这两个问题,是因为它们也会招致一个既令人信服又错误的直觉答案(这些问题在 第5章 显示)。他继续研究了在这个测试中得分很低的学生的特点--在这些人身上,系统2的监督功能很弱--发现他们容易用想到的第一个想法来回答问题,并且 ,不愿意投入必要的努力来检查他们的直觉。那些不加选择地跟随自己对谜题的直觉的人也容易接受系统1的其他建议。特别是,他们很冲动,没有耐心,热衷于立即得到满足。例如,63%的直觉型受访者说他们宁愿这个月得到3400美元,也不愿意下个月得到3800美元。 ,只有37%的人正确地解开了所有三个谜题,他们同样短视地倾向于立即得到一个较小的数额。当被问及他们会花多少钱来获得他们所订购的书的隔夜快递时,认知反思测试的低分者愿意支付两倍于高分者的费用。 弗雷德里克的发现表明,我们的心理剧 的人物有不同的 "个性"。系统1是冲动和直观的;系统2是能够推理的,它是谨慎的,但至少对某些人来说,它也是懒惰的。 我们认识到个人之间的相关差异:有些人更像他们的系统2;有些人更接近他们的系统1。 这个简单的测试已经成为预测懒惰思维的较好指标之一。
Keith Stanovich and his longtime collaborator Richard West originally introduced the terms System 1 and System 2 (they now prefer to speak of Type 1 and Type 2 processes). Stanovich and his colleagues have spent decades studying differences among individuals in the kinds of problems with which this book is concerned. They have asked one basic question in many different ways: What makes some people more susceptible than others to biases of judgment? Stanovich published his conclusions in a book titled Rationality and the Reflective Mind , which offers a bold and distinctive approach to the topic of this chapter. He draws a sharp distinction between two parts of System 2—indeed, the distinction is so sharp that he calls them separate “minds.” One of these minds (he calls it algorithmic) deals with slow thinking and demanding computation. Some people are better than others in these tasks of brain power—they are the individuals who excel in intelligence tests and are able to switch from one task to another quickly and efficiently. However, Stanovich argues that high intelligence does not make people immune to biases. Another ability is involved, which he labels rationality. Stanovich’s concept of a rational person is similar to what I earlier labeled “engaged.” The core of his argument is that rationality should be distinguished from intelligence . In his view, superficial or “lazy” thinking is a flaw in the reflective mind, a failure of rationality. This is an attractive and thought-provoking idea. In support of it, Stanovich and his colleagues have found that the bat-and-ball question and others like it are somewhat better indicators of our susceptibility to cognitive errors than are conventional measures of intelligence , such as IQ tests. Time will tell whether the distinction between intelligence and rationality can lead to new discoveries.
Keith Stanovich 和他的长期合作者Richard West最初提出了系统1和系统2这两个术语(他们现在更喜欢说类型1和类型2的过程)。Stanovich和他的同事花了几十年时间研究个人在本书所关注的各种问题上的差异。他们以许多不同的方式提出一个基本问题:是什么让一些人比其他人更容易受到判断偏差的影响 ?斯坦诺维奇在一本名为《 理性与反思的心灵 》的书中发表了他的结论,这本书为本章的主题提供了一个大胆而独特的方法。他在系统2的两个部分之间做出了鲜明的区分--事实上,这种区分是如此鲜明,以至于他把它们称为 "思想"。其中一种思维(他称之为算法)处理缓慢的思维 ,并要求计算。有些人在这些脑力任务方面比其他人强--他们是在智力测试中表现出色的人,能够快速有效地从一个任务切换到另一个任务。 然而,斯坦诺维奇认为,高智力并不能使人们免于偏见。还涉及另一种能力,他称之为理性。 斯坦诺维奇关于理性 的概念类似于我早先标注的 "参与"。他的论点的核心是,应将 理性 与 智力 区分开来。在他看来,肤浅或 "懒惰 "的思维是反思性思维的缺陷,是理性的失败。 这是一个有吸引力和发人深省的想法。为了支持它,斯坦诺维奇和他的同事们发现,球棒和球的问题以及其他 ,与 传统的智力测量方法 (如智商测试)相比,在某种程度上更能说明我们对认知错误的易感性。 时间将证明智力和理性之间的区别是否能带来新的发现。
SPEAKING OF CONTROL
说到控制
“She did not have to struggle to stay on task for hours. She was in a state of flow .”
"她不需要挣扎着保持几个小时的任务,她处于一种 流动 状态"。
“His ego was depleted after a long day of meetings. So he just turned to standard operating procedures instead of thinking through the problem.”
"经过一天漫长的 ,他的自我感觉被耗尽了。所以他只是转向标准操作程序,而不是思考问题。"
“He didn’t bother to check whether what he said made sense. Does he usually have a lazy System 2 or was he unusually tired?”
"他懒得检查他说的话是否有意义。他通常有一个懒惰的系统2,还是他异常的疲惫?"
“Unfortunately, she tends to say the first thing that comes into her mind. She probably also has trouble delaying gratification. Weak System 2.”
"不幸的是,她倾向于说她脑海中的第一件事。 她可能也有延迟满足的问题。 弱系统2。"
4
4
The Associative Machine
联想机
To begin your exploration of the surprising workings of System 1, look at the following words:
要开始探索系统1的令人惊讶的工作原理,请看下面的文字。
Bananas Vomit
香蕉呕吐物
A lot happened to you during the last second or two. You experienced some unpleasant images and memories. Your face twisted slightly in an expression of disgust, and you may have pushed this book imperceptibly farther away. Your heart rate increased, the hair on your arms rose a little, and your sweat glands were activated. In short, you responded to the disgusting word with an attenuated version of how you would react to the actual event. All of this was completely automatic, beyond your control .
在最后一两秒钟里,你发生了很多事情。你经历了一些不愉快的画面 和记忆。你的脸微微扭曲,露出厌恶的表情,你可能把这本书不知不觉地推得更远了。你的心率加快了,手臂上的毛发稍微竖起,你的汗腺被激活了。 简而言之,你对这个恶心的词的反应是你对实际事件的反应的减弱版。 所有这些都是完全自动的, 超出了你的控制 。
There was no particular reason to do so, but your mind automatically assumed a temporal sequence and a causal connection between the words bananas and vomit , forming a sketchy scenario in which bananas caused the sickness. As a result, you are experiencing a temporary aversion to bananas (don’t worry, it will pass). The state of your memory has changed in other ways: you are now unusually ready to recognize and respond to objects and concepts associated with “vomit,” such as sick, stink, or nausea, and words associated with “bananas,” such as yellow and fruit, and perhaps apple and berries.
没有什么特别的理由,但你的头脑自动假设了 香蕉 和 呕吐 这两个词之间的时间顺序和因果关系,形成了一个粗略的情景,即香蕉导致了疾病。 因此,你对香蕉产生了暂时的厌恶(别担心,会过去的)。你的记忆状态在其他方面也发生了变化:你现在异常地准备好 ,以识别和回应与 "呕吐物 "相关的物体和概念,如生病、发臭或恶心,以及与 "香蕉 "相关的词语,如黄色和水果,或许还有苹果和浆果。
Vomiting normally occurs in specific contexts, such as hangovers and indigestion. You would also be unusually ready to recognize words associated with other causes of the same unfortunate outcome. Furthermore, your System 1 noticed the fact that the juxtaposition of the two words is uncommon; you probably never encountered it before. You experienced mild surprise.
呕吐通常发生在特定的环境中,如宿醉和消化不良。你也会异常地准备好识别与同一不幸结果的其他原因有关的词 。 此外,你的系统1注意到这两个词的并列是不常见的;你可能以前从未遇到过。 你经历了轻微的惊讶。
This complex constellation of responses occurred quickly, automatically, and effortlessly. You did not will it and you could not stop it. It was an operation of System 1. The events that took place as a result of your seeing the words happened by a process called associative activation: ideas that have been evoked trigger many other ideas, in a spreading cascade of activity in your brain. The essential feature of this complex set of mental events is its coherence. Each element is connected, and each supports and strengthens the others. The word evokes memories, which evoke emotions, which in turn evoke facial expressions and other reactions, such as a general tensing up and an avoidance tendency. The facial expression and the avoidance motion intensify the feelings to which they are linked, and the feelings in turn reinforce compatible ideas. All this happens quickly and all at once, yielding a self-reinforcing pattern of cognitive, emotional, and physical responses that is both diverse and integrated—it has been called associatively coherent .
这一系列复杂的反应迅速、自动、毫不费力地发生了。 你没有意愿,也无法阻止它。这是系统1的操作。由于你看到 ,发生的事件是通过一个叫做联想激活的过程发生的:被唤起的想法引发了许多其他的想法,在你的大脑中形成了一个扩散的级联活动。这套复杂的心理事件的基本特征是它的连贯性。 每个元素都有联系,每个元素都支持和加强其他元素。这个词唤起了记忆,而记忆又唤起了情绪,情绪又唤起了面部表情 ,以及其他反应,如全身紧绷和回避倾向。面部表情和回避动作强化了与之相关的感觉,而感觉又反过来强化了相容的想法。所有这些都迅速地、一次性地发生了,产生了一种自我强化的认知、情绪和身体反应模式,它既多样又综合,被 ,称为 关联性的一致性 。
In a second or so you accomplished, automatically and unconsciously, a remarkable feat. Starting from a completely unexpected event, your System 1 made as much sense as possible of the situation—two simple words, oddly juxtaposed—by linking the words in a causal story; it evaluated the possible threat (mild to moderate) and created a context for future developments by preparing you for events that had just become more likely; it also created a context for the current event by evaluating how surprising it was. You ended up as informed about the past and as prepared for the future as you could be.
在一秒钟左右的时间里,你自动和无意识地完成了一个了不起的壮举。从一个完全出乎意料的事件开始,你的系统1尽可能地使情况合理化--两个简单的词,奇怪地并列在一个因果故事中;它评估了可能的威胁(轻度到中度),并为未来的发展创造了一个背景 ,使你对刚刚变得更有可能的事件做好准备;它还通过评估当前事件的惊奇程度为其创造了一个背景。你最终尽可能地了解了过去,为未来做好准备。
An odd feature of what happened is that your System 1 treated the mere conjunction of two words as representations of reality. Your body reacted in an attenuated replica of a reaction to the real thing, and the emotional response and physical recoil were part of the interpretation of the event. As cognitive scientists have emphasized in recent years, cognition is embodied; you think with your body , not only with your brain.
所发生的事情的一个奇怪的特点是,你的系统1把两个词的单纯结合当作现实的表征。你的身体以一种减弱的 ,复制了对真实事物的反应,而情绪反应和身体后退是对事件解释的一部分。正如认知科学家近年来所强调的,认知是体现的;你 用身体思考 ,而不仅仅是用大脑。
The mechanism that causes these mental events has been known for a long time: it is the association of ideas. We all understand from experience that ideas follow each other in our conscious mind in a fairly orderly way. The British philosophers of the seventeenth and eighteenth centuries searched for the rules that explain such sequences. In An Enquiry Concerning Human Understanding , published in 1748, the Scottish philosopher David Hume reduced the principles of association to three: resemblance, contiguity in time and place, and causality. Our concept of association has changed radically since Hume’s days, but his three principles still provide a good start.
导致这些心理事件的机制早已为人所知:它是观念的关联。我们都从 ,观念在我们的意识中以相当有序的方式相互跟随。 17和18世纪的英国哲学家们寻找解释这种顺序的规则。在1748年出版的 《关于人类理解的询问》 中,苏格兰哲学家 大卫-休谟将关联的原则简化为三个:相似性、时间和地点的连续性、 和因果性。自休谟时代以来,我们的关联概念已经发生了根本性的变化,但他的三个原则仍然提供了一个良好的开端。
I will adopt an expansive view of what an idea is. It can be concrete or abstract, and it can be expressed in many ways: as a verb, as a noun, as an adjective, or as a clenched fist. Psychologists think of ideas as nodes in a vast network, called associative memory, in which each idea is linked to many others. There are different types of links: causes are linked to their effects (virus ➞ cold); things to their properties (lime ➞ green); things to the categories to which they belong (banana ➞ fruit). One way we have advanced beyond Hume is that we no longer think of the mind as going through a sequence of conscious ideas, one at a time. In the current view of how associative memory works, a great deal happens at once. An idea that has been activated does not merely evoke one other idea. It activates many ideas, which in turn activate others. Furthermore, only a few of the activated ideas will register in consciousness; most of the work of associative thinking is silent, hidden from our conscious selves. The notion that we have limited access to the workings of our minds is difficult to accept because, naturally, it is alien to our experience, but it is true: you know far less about yourself than you feel you do.
我将采用一种广义的观点来理解什么是理念。 它可以是具体的,也可以是抽象的,它可以用多种方式来表达:作为动词,作为名词,作为形容词,或者作为一个紧握的拳头。心理学家认为想法是一个庞大网络中的节点,称为联想 ,其中每个想法都与许多其他想法相联系。有不同类型的联系:原因与它们的效果相联系(病毒➞感冒);事物与它们的属性相联系(青柠➞绿色);事物与它们所属的类别相联系(香蕉➞水果)。 我们超越休谟的一个方法是,我们不再认为头脑是一次一次地经历一系列有意识的想法。在目前关于联想记忆如何工作的 ,大量的事情都是一次发生。一个被激活的想法不只是唤起一个其他的想法。此外,只有少数被激活的想法会在意识中记录下来;联想思维的大部分工作是沉默的,被我们有意识的自我隐藏起来。我们对自己的思维运作的访问有限, ,这一概念很难接受,因为它自然与我们的经验格格不入,但这是事实:你对自己的了解远比你觉得的要少。
THE MARVELS OF PRIMING
引导的奇妙之处
As is common in science, the first big breakthrough in our understanding of the mechanism of association was an improvement in a method of measurement. Until a few decades ago, the only way to study associations was to ask many people questions such as, “What is the first word that comes to your mind when you hear the word DAY?” The researchers tallied the frequency of responses, such as “night,” “sunny,” or “long.” In the 1980s, psychologists discovered that exposure to a word causes immediate and measurable changes in the ease with which many related words can be evoked. If you have recently seen or heard the word EAT, you are temporarily more likely to complete the word fragment SO_P as SOUP than as SOAP. The opposite would happen, of course, if you had just seen WASH. We call this a priming effect and say that the idea of EAT primes the idea of SOUP, and that WASH primes SOAP .
正如科学中常见的那样,我们对联想机制的理解的第一个重大突破是对一种测量方法的改进。直到几十年前,研究 联想的唯一方法是问许多人问题,如:"当你听到DAY这个词时,你脑子里想到的第一个词是什么?"研究人员统计了回答的频率,如 "晚上"、"晴天 "或 "长"。在20世纪80年代,心理学家发现,接触一个词会使许多相关的词被唤起的难易程度立即发生可测量的变化。 如果你最近 ,看到或听到EAT这个词,你暂时更有可能把SO_P这个词片断完成为SOUP而不是SOAP。当然,如果你刚刚看到了 "洗",情况就会相反。 我们把这称为 引爆效应 ,并说 "吃 "的概念引爆了 "汤 "的概念,而 "洗 "引爆了 "肥皂"。
Priming effects take many forms. If the idea of EAT is currently on your mind (whether or not you are conscious of it), you will be quicker than usual to recognize the word SOUP when it is spoken in a whisper or presented in a blurry font. And of course you are primed not only for the idea of soup but also for a multitude of food-related ideas, including fork, hungry, fat, diet, and cookie. If for your most recent meal you sat at a wobbly restaurant table, you will be primed for wobbly as well. Furthermore, the primed ideas have some ability to prime other ideas, although more weakly. Like ripples on a pond, activation spreads through a small part of the vast network of associated ideas. The mapping of these ripples is now one of the most exciting pursuits in psychological research.
引发效应有多种形式。 如果 "吃 "的概念目前在你的脑海中(无论你是否有意识到这一点),当SOUP这个词被低声说出来或以模糊的字体呈现时,你会比 通常更快地识别它。当然,你不仅对汤的想法有了初步认识,而且对众多与食物有关的想法也有了认识,包括叉子、饥饿、脂肪、饮食和饼干。如果你最近的一餐是坐在摇摇晃晃的餐厅桌子上,那么你也会有摇摇晃晃的心理准备。 此外,被激发的想法也有一定的能力来激发其他想法,尽管比较弱。就像池塘上的涟漪一样,激活通过庞大的相关思想网络中的一小部分传播开来。 绘制这些涟漪的地图现在是心理学研究中最令人兴奋的追求之一。
Another major advance in our understanding of memory was the discovery that priming is not restricted to concepts and words. You cannot know this from conscious experience, of course, but you must accept the alien idea that your actions and your emotions can be primed by events of which you are not even aware. In an experiment that became an instant classic, the psychologist John Bargh and his collaborators asked students at New York University—most aged eighteen to twenty-two—to assemble four-word sentences from a set of five words (for example, “finds he it yellow instantly” ). For one group of students, half the scrambled sentences contained words associated with the elderly, such as Florida , forgetful , bald , gray , or wrinkle . When they had completed that task, the young participants were sent out to do another experiment in an office down the hall. That short walk was what the experiment was about. The researchers unobtrusively measured the time it took people to get from one end of the corridor to the other. As Bargh had predicted, the young people who had fashioned a sentence from words with an elderly theme walked down the hallway significantly more slowly than the others.
我们对记忆的理解的另一个重大进展是发现引力并不局限于 到概念和单词。当然,你不能从有意识的经验中知道这一点,但你必须接受一个陌生的想法,即你的行动和你的情绪可以被你甚至没有意识到的事件所激发。在一个立即成为经典的实验中,心理学家约翰-巴格和他的合作者要求纽约大学的学生--大多数年龄在18到22岁之间--从一组五个词(例如, "发现他它立刻变黄了") 中组合出四个词的 。对于一组学生来说,一半的争抢句子包含与老年人有关的词,例如 佛罗里达 、 健忘 、 秃头 、 灰色 或 皱纹 。当他们完成这项任务后,年轻的参与者被派到走廊尽头的办公室里做另一个实验。这段短暂的步行就是实验 。研究人员不露声色地测量了人们从走廊的一端到另一端所需的时间。正如巴格所预料的那样,那些以老年人为主题的话语组成句子的年轻人在走廊上走得明显比其他人慢。
The “Florida effect” involves two stages of priming. First, the set of words primes thoughts of old age, though the word old is never mentioned; second, these thoughts prime a behavior, walking slowly, which is associated with old age. All this happens without any awareness. When they were questioned afterward, none of the students reported noticing that the words had had a common theme, and they all insisted that nothing they did after the first experiment could have been influenced by the words they had encountered. The idea of old age had not come to their conscious awareness, but their actions had changed nevertheless. This remarkable priming phenomenon—the influencing of an action by the idea—is known as the ideomotor effect. Although you surely were not aware of it, reading this paragraph primed you as well. If you had needed to stand up to get a glass of water, you would have been slightly slower than usual to rise from your chair—unless you happen to dislike the elderly, in which case research suggests that you might have been slightly faster than usual!
佛罗里达效应 "包括两个阶段的激发。首先,这组词激发了对老年的想法, ,尽管从未提到过 老 这个词;其次,这些想法激发了一种行为,即缓慢行走,这与老年有关。当他们事后被询问时,没有一个学生报告说注意到这些词有一个共同的主题,而且他们都坚持认为他们在第一次实验后所做的任何事情都不可能受到他们所遇到的词 。老年的想法并没有出现在他们的意识中,但他们的行动还是发生了变化。这种显著的激发现象--思想对行动的影响--被称为思想运动效应。 尽管你肯定没有意识到这一点,但读了这段话后,你也被激发了。 如果你需要站起来拿一杯水,你从 椅子上站起来的速度会比平时稍慢--除非你碰巧不喜欢老年人,在这种情况下,研究表明你可能会比平时稍快一些。
The ideomotor link also works in reverse. A study conducted in a German university was the mirror image of the early experiment that Bargh and his colleagues had carried out in New York. Students were asked to walk around a room for 5 minutes at a rate of 30 steps per minute, which was about one-third their normal pace. After this brief experience, the participants were much quicker to recognize words related to old age , such as forgetful , old , and lonely . Reciprocal priming effects tend to produce a coherent reaction: if you were primed to think of old age, you would tend to act old, and acting old would reinforce the thought of old age.
意念运动的联系也是反向工作的。在德国一所大学进行的一项研究是巴格和他的同事在纽约进行的早期实验的镜像。学生们被要求以每分钟30步的速度在 一个房间里走5分钟,这大约是他们正常步伐的三分之一。在这种短暂的体验之后,参与者更快地认出了 与老年有关的词语 ,如 健忘 、 年老 和 孤独 。 相互引诱效应往往会产生一种连贯的反应:如果你被引诱想到老年,你就会倾向于表现出老年,而表现出老年会加强 对老年的思考。
Reciprocal links are common in the associative network. For example, being amused tends to make you smile, and smiling tends to make you feel amused. Go ahead and take a pencil, and hold it between your teeth for a few seconds with the eraser pointing to your right and the point to your left. Now hold the pencil so the point is aimed straight in front of you, by pursing your lips around the eraser end. You were probably unaware that one of these actions forced your face into a frown and the other into a smile. College students were asked to rate the humor of cartoons from Gary Larson’s The Far Side while holding a pencil in their mouth. Those who were “smiling” (without any awareness of doing so) found the cartoons funnier than did those who were “frowning.” In another experiment, people whose face was shaped into a frown (by squeezing their eyebrows together) reported an enhanced emotional response to upsetting pictures —starving children, people arguing, maimed accident victims.
互惠联系在关联网络中很常见。 例如,被逗乐往往会使你微笑,而微笑往往会使你感到被逗乐。 继续拿起一支铅笔,用牙齿夹住它几秒钟,橡皮指向你的右边,笔尖指向你的左边。现在握住铅笔,使笔尖直指你的前方,通过抿紧 ,你的嘴唇围绕橡皮的一端。 你可能没有意识到,这些动作之一迫使你的脸变成皱纹,另一个变成笑。大学生们被要求在嘴里含着铅笔的情况下对加里-拉尔森的《 远方》 中的漫画的幽默程度进行评分。那些 "微笑 "的人(没有意识到这样做)比那些 "皱眉 "的人发现漫画更有趣。 在另一个实验中,脸部被塑造成皱纹的人(通过将他们的眉毛挤在一起)报告说,他们对 令人不安的图片--饥饿的 儿童、争吵的人、致残的事故受害者的情绪反应增强。
Simple, common gestures can also unconsciously influence our thoughts and feelings. In one demonstration, people were asked to listen to messages through new headphones. They were told that the purpose of the experiment was to test the quality of the audio equipment and were instructed to move their heads repeatedly to check for any distortions of sound. Half the participants were told to nod their head up and down while others were told to shake it side to side. The messages they heard were radio editorials. Those who nodded (a yes gesture) tended to accept the message they heard, but those who shook their head tended to reject it. Again, there was no awareness, just a habitual connection between an attitude of rejection or acceptance and its common physical expression. You can see why the common admonition to “act calm and kind regardless of how you feel” is very good advice: you are likely to be rewarded by actually feeling calm and kind.
简单、普通的手势也能无意识地影响我们的思想和感受。在一次演示中,人们被要求通过新的耳机 收听信息 。 他们被告知,实验的目的是测试音频设备的质量,并被指示反复移动他们的头以检查声音是否失真。 一半的参与者被告知要上下点头,而其他人则被告知要左右摇晃。那些点头的人(是的手势)倾向于接受他们听到的 ,但那些摇头的人倾向于拒绝它。同样,没有意识,只是在拒绝或接受的态度和其常见的身体表达之间有一种习惯性的联系。 你可以看到为什么 "无论你感觉如何,都要表现得平静和善良 "的常见告诫是非常好的建议:你有可能通过实际的平静和善良感觉得到回报。
PRIMES THAT GUIDE US
指导我们的原则
Studies of priming effects have yielded discoveries that threaten our self-image as conscious and autonomous authors of our judgments and our choices. For instance, most of us think of voting as a deliberate act that reflects our values and our assessments of policies and is not influenced by irrelevancies. Our vote should not be affected by the location of the polling station, for example, but it is. A study of voting patterns in precincts of Arizona in 2000 showed that the support for propositions to increase the funding of schools was significantly greater when the polling station was in a school than when it was in a nearby location. A separate experiment showed that exposing people to images of classrooms and school lockers also increased the tendency of participants to support a school initiative. The effect of the images was larger than the difference between parents and other voters! The study of priming has come some way from the initial demonstrations that reminding people of old age makes them walk more slowly. We now know that the effects of priming can reach into every corner of our lives.
对激发效应的研究已经产生了一些发现,威胁到我们作为判断和选择的有意识和自主的作者的自我形象。 例如,我们大多数人认为投票是一种深思熟虑的行为,反映了我们的价值观和对政策的评估,不受无关紧要的影响。 我们的投票不应该受到投票站位置的影响,例如, ,但它是。2000年对亚利桑那州各选区投票模式的研究表明,当投票站位于学校时,对 增加学校经费 的提案的支持率明显高于附近地区。 一个单独的实验表明,让人们看到教室和学校储物柜的图像也会增加参与者支持 学校倡议的倾向。图像的影响比父母和其他选民之间的差异更大!从最初的 "提醒人们年老会让他们走得更慢 "的演示,到现在我们知道,"暗示 "的影响可以深入到我们生活的每个角落。
Reminders of money produce some troubling effects. Participants in one experiment were shown a list of five words from which they were required to construct a four-word phrase that had a money theme (“high a salary desk paying” became “a high-paying salary”). Other primes were much more subtle, including the presence of an irrelevant money-related object in the background, such as a stack of Monopoly money on a table, or a computer with a screen saver of dollar bills floating in water.
金钱的提醒 产生了一些令人不安的效果。在一个 实验中,参与者被要求从五个词的列表中构建一个以金钱为主题的四字短语("高工资的桌子 "变成了 "高工资")。其他的诱因要微妙得多,包括在背景中出现一个与金钱无关的物体,如桌子上的一叠大富翁的钱,或一台电脑的屏幕保护程序,上面有漂浮在水中的美元钞票 。
Money-primed people become more independent than they would be without the associative trigger. They persevered almost twice as long in trying to solve a very difficult problem before they asked the experimenter for help, a crisp demonstration of increased self-reliance. Money-primed people are also more selfish: they were much less willing to spend time helping another student who pretended to be confused about an experimental task. When an experimenter clumsily dropped a bunch of pencils on the floor, the participants with money (unconsciously) on their mind picked up fewer pencils. In another experiment in the series, participants were told that they would shortly have a get-acquainted conversation with another person and were asked to set up two chairs while the experimenter left to retrieve that person. Participants primed by money chose to stay much farther apart than their nonprimed peers (118 vs. 80 centimeters). Money-primed undergraduates also showed a greater preference for being alone.
受金钱刺激的人变得比没有关联触发的人更独立。在试图解决一个非常困难的问题时,他们坚持的时间几乎是向实验者寻求帮助的两倍,这是自我依赖性增强的干脆表现。有金钱刺激的人也更自私:他们更不愿意花时间帮助另一个假装对实验任务感到困惑的学生 。当一个实验者笨拙地把一堆铅笔掉在地上时,心里想着钱的参与者就会捡起更少的铅笔。在该系列的另一个实验中,参与者被告知他们不久将与另一个人进行一次熟悉的谈话,并被要求在 实验者离开去找那个人的时候摆好两把椅子。受金钱刺激的参与者选择比未受刺激的同龄人保持更远的距离(118厘米对80厘米)。 受金钱刺激的本科生还表现出对独处的更大偏好。
The general theme of these findings is that the idea of money primes individualism: a reluctance to be involved with others, to depend on others, or to accept demands from others. The psychologist who has done this remarkable research, Kathleen Vohs, has been laudably restrained in discussing the implications of her findings, leaving the task to her readers. Her experiments are profound—her findings suggest that living in a culture that surrounds us with reminders of money may shape our behavior and our attitudes in ways that we do not know about and of which we may not be proud. Some cultures provide frequent reminders of respect, others constantly remind their members of God, and some societies prime obedience by large images of the Dear Leader. Can there be any doubt that the ubiquitous portraits of the national leader in dictatorial societies not only convey the feeling that “Big Brother Is Watching” but also lead to an actual reduction in spontaneous thought and independent action?
这些发现的总主题是,金钱的想法为 个人主义提供了动力:不愿意与他人交往,不愿意依赖他人,也不愿意接受他人的 要求。做过这项杰出研究的心理学家凯瑟琳-沃斯在讨论她的发现的意义时一直保持着值得称赞的克制,把任务留给了她的读者。她的实验是深刻的--她的发现表明,生活在一种围绕着我们的金钱提醒的文化中,可能会以我们不知道的方式塑造我们的行为和态度,而且 ,我们可能不会感到骄傲。一些文化经常提醒人们要尊重他人,另一些文化则不断提醒其成员要尊重上帝,还有一些社会则通过亲爱的领袖的大型形象来强调服从性。毫无疑问,独裁社会中无处不在的国家领导人的画像不仅传达了 "老大哥在看 "的感觉,而且还导致了自发的 思想和独立行动的实际减少。
The evidence of priming studies suggests that reminding people of their mortality increases the appeal of authoritarian ideas , which may become reassuring in the context of the terror of death. Other experiments have confirmed Freudian insights about the role of symbols and metaphors in unconscious associations. For example, consider the ambiguous word fragments W_ _ H and S_ _ P. People who were recently asked to think of an action of which they are ashamed are more likely to complete those fragments as WASH and SOAP and less likely to see WISH and SOUP. Furthermore, merely thinking about stabbing a coworker in the back leaves people more inclined to buy soap, disinfectant, or detergent than batteries, juice, or candy bars. Feeling that one’s soul is stained appears to trigger a desire to cleanse one’s body, an impulse that has been dubbed the “Lady Macbeth effect.”
引导研究的证据表明,提醒人们他们的死亡会增加 专制思想的吸引力 ,在死亡的恐怖背景下, 专制思想 可能变得令人放心。其他实验证实了弗洛伊德关于符号和隐喻在无意识联想中的作用的见解。例如,考虑模糊的单词片段 W_ _ H和S_ _ P。最近被要求想到一个他们感到羞耻的行为的人,更有可能将这些片段完成为WASH和SOAP,而不可能看到WISH和SOUP。此外,仅仅想到在背后捅同事一刀,人们就会更倾向于购买肥皂、消毒剂或洗涤剂,而不是电池、果汁或糖块。感觉到自己的灵魂被玷污, ,似乎会引发清洗自己身体的欲望,这种冲动被称为 "麦克白夫人效应"。
The cleansing is highly specific to the body parts involved in a sin. Participants in an experiment were induced to “lie” to an imaginary person, either on the phone or in e-mail. In a subsequent test of the desirability of various products, people who had lied on the phone preferred mouthwash over soap , and those who had lied in e-mail preferred soap to mouthwash.
洁净对涉及罪的身体部位来说是非常具体的。在一项实验中,参与者被诱导向一个假想的人 "撒谎",无论是在电话中还是在电子邮件中。在随后对各种产品的可取性的测试中,在电话中说过谎的人 更喜欢漱口水而不是肥皂 ,而在电子邮件中说过谎的人更喜欢肥皂而不是漱口水。
When I describe priming studies to audiences, the reaction is often disbelief. This is not a surprise: System 2 believes that it is in charge and that it knows the reasons for its choices. Questions are probably cropping up in your mind as well: How is it possible for such trivial manipulations of the context to have such large effects? Do these experiments demonstrate that we are completely at the mercy of whatever primes the environment provides at any moment? Of course not. The effects of the primes are robust but not necessarily large. Among a hundred voters, only a few whose initial preferences were uncertain will vote differently about a school issue if their precinct is located in a school rather than in a church—but a few percent could tip an election.
当我向听众描述引力研究时,他们的反应往往是不相信。这并不令人惊讶:系统2认为它是负责人,它知道自己选择的原因。 你的脑海中可能也会出现问题:对背景进行如此微不足道的操作,怎么可能产生如此大的影响? 这些实验是否表明,我们完全受环境在任何时候提供的任何诱因的摆布?当然不是。主体的影响是强大的,但不一定大。在一百名选民中,如果他们的 选区位于学校而不是教堂,只有少数人的最初偏好不确定,会对学校问题投下不同的票,但 ,这百分之几的人可能会影响选举。
The idea you should focus on, however, is that disbelief is not an option. The results are not made up, nor are they statistical flukes. You have no choice but to accept that the major conclusions of these studies are true. More important, you must accept that they are true about you . If you had been exposed to a screen saver of floating dollar bills, you too would likely have picked up fewer pencils to help a clumsy stranger. You do not believe that these results apply to you because they correspond to nothing in your subjective experience. But your subjective experience consists largely of the story that your System 2 tells itself about what is going on. Priming phenomena arise in System 1, and you have no conscious access to them.
然而,你应该关注的是,不相信不是一种选择。 结果不是编造的,也不是统计上的侥幸。你没有选择,只能接受这些研究的主要结论是真的。更重要的是,你必须接受它们是关于 你的 事实。如果你接触过漂浮的美元钞票的屏幕保护程序,你也 ,很可能会减少拿起铅笔来帮助一个笨拙的陌生人。你不相信这些结果适用于你,因为它们与你主观经验中的任何东西都不相符。但你的主观经验主要由你的系统2告诉自己的关于正在发生的事情的故事组成。引发现象出现在系统1中,你没有意识到它们。
I conclude with a perfect demonstration of a priming effect, which was conducted in an office kitchen at a British university . For many years members of that office had paid for the tea or coffee to which they helped themselves during the day by dropping money into an “honesty box.” A list of suggested prices was posted. One day a banner poster was displayed just above the price list, with no warning or explanation. For a period of ten weeks a new image was presented each week, either flowers or eyes that appeared to be looking directly at the observer. No one commented on the new decorations, but the contributions to the honesty box changed significantly. The posters and the amounts that people put into the cash box (relative to the amount they consumed) are shown in figure 4 . They deserve a close look.
最后,我想说的是, ,这是一个完美的引爆效应演示,是在 英国一所大学的 办公室厨房里进行的。多年来,该办公室的成员通过向一个 "诚实箱 "投钱来支付他们白天自己喝的茶或咖啡。张贴了一份建议价格的清单。有一天,在价格清单的正上方显示了一张横幅海报,没有任何警告或解释。 在十周的时间里,每周呈现一个新的图像,要么是鲜花,要么是似乎直视着观察者的眼睛。没有人对新的装饰发表意见,但对诚信箱的贡献却发生了很大变化。 图4 显示了海报和人们投入现金箱的金额(相对于他们的消费金额)。 它们值得仔细观察。

Figure 4
图4
On the first week of the experiment (which you can see at the bottom of the figure), two wide-open eyes stare at the coffee or tea drinkers, whose average contribution was 70 pence per liter of milk. On week 2, the poster shows flowers and average contributions drop to about 15 pence. The trend continues. On average, the users of the kitchen contributed almost three times as much in “eye weeks” as they did in “flower weeks.” Evidently, a purely symbolic reminder of being watched prodded people into improved behavior. As we expect at this point, the effect occurs without any awareness. Do you now believe that you would also fall into the same pattern?
在实验的第一周(你可以在图的底部看到),两只睁大的眼睛盯着喝咖啡或茶的人,他们的平均贡献是每升牛奶70便士。在第二周,海报上出现了鲜花,平均捐款下降到约15便士。这一趋势仍在继续。平均而言,厨房的使用者在 "眼睛周 "的贡献几乎是他们在 "鲜花周 "的三倍。显然,一个纯粹象征性的被监视的提醒,促使人们改善行为。正如我们在这一点上所期望的那样,这种效果是在没有任何意识的情况下发生的。 你现在是否相信你也会陷入同样的模式?
Some years ago, the psychologist Timothy Wilson wrote a book with the evocative title Strangers to Ourselves . You have now been introduced to that stranger in you, which may be in control of much of what you do, although you rarely have a glimpse of it. System 1 provides the impressions that often turn into your beliefs, and is the source of the impulses that often become your choices and your actions. It offers a tacit interpretation of what happens to you and around you, linking the present with the recent past and with expectations about the near future. It contains the model of the world that instantly evaluates events as normal or surprising. It is the source of your rapid and often precise intuitive judgments. And it does most of this without your conscious awareness of its activities. System 1 is also, as we will see in the following chapters, the origin of many of the systematic errors in your intuitions.
几年前,心理学家蒂莫西-威尔逊(Timothy Wilson)写了一本书,书名是《 我们自己的陌生人》(Strangers to Ourselves ),现在你已经 ,你心中的 那个陌生人 可能控制着你的很多事情,尽管你很少能看到它的影子。系统1提供的印象往往变成你的信念,是冲动的来源,而冲动往往变成你的选择和你的行动。它对发生在你身上和你周围的事情提供一种默契的解释,将现在与最近的过去 ,并与对不久的将来的期望联系起来。它包含了世界的模型,可以立即将事件评价为正常或令人惊讶。它是你快速和经常精确的直觉判断的来源。 而且它做的大部分事情是在你没有意识到它的活动的情况下进行的。 正如我们在下面的章节中看到的,系统1也是你直觉中许多系统性错误的来源。
SPEAKING OF PRIMING
说到打底
“The sight of all these people in uniforms does not prime creativity.”
"看到所有这些穿着制服的人,并不能激发出创造力。"
“The world makes much less sense than you think. The coherence comes mostly from the way your mind works.”
"这个世界比你想象的要少得多。连贯性主要来自于你的思想工作方式。"
“They were primed to find flaws, and this is exactly what they found.”
"他们被引诱去寻找缺陷,而这正是他们所发现的。"
“His System 1 constructed a story, and his System 2 believed it. It happens to all of us.”
"他的系统1构建了一个故事,而他的系统2相信了它。这发生在我们所有人身上。"
“I made myself smile and I’m actually feeling better!”
"我让自己微笑,实际上我 ,感觉好多了!"
5
5
Cognitive Ease
认知的轻松
Whenever you are conscious, and perhaps even when you are not, multiple computations are going on in your brain, which maintain and update current answers to some key questions: Is anything new going on? Is there a threat? Are things going well? Should my attention be redirected? Is more effort needed for this task? You can think of a cockpit, with a set of dials that indicate the current values of each of these essential variables. The assessments are carried out automatically by System 1, and one of their functions is to determine whether extra effort is required from System 2.
每当你有意识的时候,甚至在你没有意识的时候,你的大脑中都在进行多种计算,这些计算维持并更新着对一些关键问题的当前答案:是否有新的情况发生?是否有 威胁?事情进展顺利吗?我的注意力是否应该被转移?这项任务是否需要更多的努力?你可以想到一个驾驶舱,里面有一组表盘,显示这些基本变量的当前值。 评估是由系统1自动进行的,其功能之一是确定是否需要系统2的额外努力。
One of the dials measures cognitive ease , and its range is between “Easy” and “Strained.” Easy is a sign that things are going well—no threats, no major news, no need to redirect attention or mobilize effort. Strained indicates that a problem exists, which will require increased mobilization of System 2. Conversely, you experience cognitive strain . Cognitive strain is affected by both the current level of effort and the presence of unmet demands. The surprise is that a single dial of cognitive ease is connected to a large network of diverse inputs and outputs . Figure 5 tells the story.
其中一个转盘衡量 认知的难易程度 ,其范围在 "容易 "和 "紧张 " 之间。轻松是一种迹象,表明事情进展顺利,没有威胁,没有重大新闻,不需要调整注意力或调动努力。紧张表明存在一个问题,这将需要增加系统2的动员。 反之,你会经历 认知紧张 。认知紧张既受当前努力程度的影响,也受 ,存在未满足的要求。令人惊讶的是,一个认知轻松的表盘与一个由 不同输入和输出 组成的大型网络相连接。 图5 讲述了这个故事。
The figure suggests that a sentence that is printed in a clear font, or has been repeated, or has been primed, will be fluently processed with cognitive ease. Hearing a speaker when you are in a good mood, or even when you have a pencil stuck crosswise in your mouth to make you “smile,” also induces cognitive ease.
该图表明,用清晰的字体印刷的句子,或经过重复的句子,或经过捯饬的句子,会在认知上得到流畅的处理。在你心情好的时候听演讲,甚至在你把铅笔交叉插在嘴里让你 "微笑 "的时候 ,也能诱发认知上的轻松。

Figure 5. Causes and Consequences of Cognitive Ease
图5:认知易的原因和后果
The various causes of ease or strain have interchangeable effects. When you are in a state of cognitive ease, you are probably in a good mood, like what you see, believe what you hear, trust your intuitions, and feel that the current situation is comfortably familiar. You are also likely to be relatively casual and superficial in your thinking. When you feel strained, you are more likely to be vigilant and suspicious, invest more effort in what you are doing, feel less comfortable, and make fewer errors, but you also are less intuitive and less creative than usual.
导致轻松或紧张的各种原因具有可互换的效果。 当你处于认知轻松的状态时,你可能心情很好,喜欢你所看到的,相信你所听到的,相信你的直觉,并觉得目前的情况是舒适熟悉的。 当你感到紧张时,你更有可能保持警惕和怀疑,在你所做的事情上投入更多的精力,感觉不那么舒服,犯的错误也更少,但你的直觉也比平时少,创造力也更少。
ILLUSIONS OF REMEMBERING
记忆的幻觉
The word illusion brings visual illusions to mind, because we are all familiar with pictures that mislead. But vision is not the only domain of illusions; memory is also susceptible to them, as is thinking more generally.
幻觉 这个词让人想到视觉上的错觉,因为我们都熟悉那些误导人的图片。但视觉并不是唯一的幻觉领域; 记忆也容易受到幻觉的影响,正如更普遍的思维一样。
David Stenbill, Monica Bigoutski, Shana Tirana. I just made up these names. If you encounter any of them within the next few minutes you are likely to remember where you saw them. You know, and will know for a while, that these are not the names of minor celebrities. But suppose that a few days from now you are shown a long list of names, including some minor celebrities and “new” names of people that you have never heard of; your task will be to check every name of a celebrity in the list. There is a substantial probability that you will identify David Stenbill as a well-known person, although you will not (of course) know whether you encountered his name in the context of movies, sports, or politics. Larry Jacoby, the psychologist who first demonstrated this memory illusion in the laboratory, titled his article “Becoming Famous Overnight.” How does this happen? Start by asking yourself how you know whether or not someone is famous. In some cases of truly famous people (or of celebrities in an area you follow), you have a mental file with rich information about a person—think Albert Einstein, Bono, Hillary Clinton. But you will have no file of information about David Stenbill if you encounter his name in a few days. All you will have is a sense of familiarity—you have seen this name somewhere.
David Stenbill, Monica Bigoutski, Shana Tirana.我只是编造了这些名字。如果你在接下来的几分钟内遇到他们中的任何一个,你很可能记得你在哪里看到过他们。 你知道,并且会在一段时间内知道,这些不是小名人的名字。但是,假设从现在开始,你会看到一长串 ,包括一些小名人和你从未听说过的 "新 "名字;你的任务是检查列表中的每个名人的名字。有很大概率你会把大卫-斯滕比尔认定为一个知名人士,尽管你不会(当然)知道你是否在电影、体育或政治方面遇到过他的名字。 拉里-雅各比,这位在实验室中首次证明这种记忆错觉的 心理学家,将他的文章命名为 "一夜成名" 。这种情况是如何发生的?首先要问自己,你是如何知道某人是否出名的。在一些真正的名人(或你所关注的某个领域的名人)的情况下,你有一个心理档案,里面有关于一个人的丰富信息--想想 阿尔伯特-爱因斯坦、博诺、希拉里-克林顿。 但是,如果你在几天后遇到大卫-斯滕比尔的名字,你将没有关于他的信息档案。 你所拥有的只是一种熟悉感--你曾在某个地方见过这个名字。
Jacoby nicely stated the problem : “The experience of familiarity has a simple but powerful quality of ‘pastness’ that seems to indicate that it is a direct reflection of prior experience.” This quality of pastness is an illusion. The truth is, as Jacoby and many followers have shown, that the name David Stenbill will look familiar when you see it because you will see it more clearly . Words that you have seen before become easier to see again—you can identify them better than other words when they are shown very briefly or masked by noise, and you will be quicker (by a few hundredths of a second) to read them than to read other words. In short, you experience greater cognitive ease in perceiving a word you have seen earlier, and it is this sense of ease that gives you the impression of familiarity .
雅各比 很好地说明了这个问题 :"熟悉的经验有一种简单而强大的'过去性'的品质,似乎表明它是先前经验的直接反映"。这种pastness的特质 ,是一种错觉。事实是,正如雅各比和许多追随者所证明的那样,当你看到大卫-斯滕比尔这个名字时,会显得很熟悉 ,因为你会更清楚地看到它 。你以前见过的词会变得更容易再次看到--当它们被非常短暂地显示出来或被噪音掩盖时,你能比其他词更好地识别它们,而且你阅读 ,会比阅读其他词更快(几百分之一秒)。简而言之,你在感知一个你以前见过的词时,会体验到更大的认知轻松感,正是这种轻松感让你产生了 熟悉的印象 。
Figure 5 suggests a way to test this. Choose a completely new word, make it easier to see, and it will be more likely to have the quality of pastness. Indeed, a new word is more likely to be recognized as familiar if it is unconsciously primed by showing it for a few milliseconds just before the test, or if it is shown in sharper contrast than some other words in the list. The link also operates in the other direction. Imagine you are shown a list of words that are more or less out of focus. Some of the words are severely blurred, others less so, and your task is to identify the words that are shown more clearly. A word that you have seen recently will appear to be clearer than unfamiliar words. As figure 5 indicates, the various ways of inducing cognitive ease or strain are interchangeable; you may not know precisely what it is that makes things cognitively easy or strained. This is how the illusion of familiarity comes about.
图5 提出了一个测试方法。 选择一个全新的词,让它更容易被看到,它将更有可能具有pastness的特质。事实上,如果在测试前无意识地显示一个新词几毫秒,或者如果它比列表中的其他一些词有更鲜明的对比,那么这个新词更有可能被识别为熟悉的 。这种联系也在另一个方向上运作。 想象一下,给你看一个或多或少失焦的单词列表。有些单词严重模糊,有些则不那么模糊,你的任务是找出 ,使其显示得更清楚。一个你最近见过的词会比不熟悉的词显得更清晰。 如图5 所示,诱发认知轻松或紧张的各种方式是可以互换的;你可能并不确切知道是什么让事情在认知上变得轻松或紧张。 这就是熟悉感的错觉是如何产生的。
ILLUSIONS OF TRUTH
真理的幻觉
“New York is a large city in the United States.” “The moon revolves around Earth.” “A chicken has four legs.” In all these cases, you quickly retrieved a great deal of related information, almost all pointing one way or another. You knew soon after reading them that the first two statements are true and the last one is false. Note, however, that the statement “A chicken has three legs” is more obviously false than “A chicken has four legs.” Your associative machinery slows the judgment of the latter sentence by delivering the fact that many animals have four legs, and perhaps also that supermarkets often sell chicken legs in packages of four. System 2 was involved in sifting that information, perhaps raising the issue of whether the question about New York was too easy, or checking the meaning of revolves .
"纽约是美国的一个大城市, "。"月球围绕地球旋转"。"一只鸡有四条腿"。在所有这些案例中,你很快就检索到了大量的相关信息,几乎都指向一个方向。 你在阅读后很快就知道,前两个说法是真的,最后一个是假的。然而,请注意,"一只鸡有三条腿 "的说法比 "一只鸡有 四条腿 "更明显是错误的。你的联想机制减缓了对后一个句子的判断,因为它提供了许多动物有四条腿的事实,也许还提供了超市经常出售四条包装的鸡腿。系统2参与了筛选那些 的信息,也许提出了关于纽约的问题是否太简单,或者检查 旋转 的意义。
Think of the last time you took a driving test. Is it true that you need a special license to drive a vehicle that weighs more than three tons? Perhaps you studied seriously and can remember the side of the page on which the answer appeared, as well as the logic behind it. This is certainly not how I passed driving tests when I moved to a new state. My practice was to read the booklet of rules quickly once and hope for the best. I knew some of the answers from the experience of driving for a long time. But there were questions where no good answer came to mind, where all I had to go by was cognitive ease. If the answer felt familiar, I assumed that it was probably true. If it looked new (or improbably extreme), I rejected it. The impression of familiarity is produced by System 1, and System 2 relies on that impression for a true/false judgment.
想想你上次参加驾驶考试时, 。驾驶超过三吨重的车辆真的需要特殊的执照吗?也许你认真学习了,可以记住答案出现在哪一页的哪一面,以及背后的逻辑。当我搬到一个新的州时,这当然不是我通过驾驶考试的方式。我的做法是,一旦 ,快速阅读规则手册,并希望得到最好的结果。从长期的驾驶经验中,我知道一些答案。 但有些问题我没有想到好的答案,我只能靠认知上的轻松。如果答案感觉很熟悉,我就认为它可能是真的。如果它看起来很新(或不可能的极端),我就拒绝它。熟悉的印象是由系统1产生的,系统2依靠这一印象进行真/假判断, 。
The lesson of figure 5 is that predictable illusions inevitably occur if a judgment is based on an impression of cognitive ease or strain. Anything that makes it easier for the associative machine to run smoothly will also bias beliefs. A reliable way to make people believe in falsehoods is frequent repetition, because familiarity is not easily distinguished from truth. Authoritarian institutions and marketers have always known this fact. But it was psychologists who discovered that you do not have to repeat the entire statement of a fact or idea to make it appear true. People who were repeatedly exposed to the phrase “the body temperature of a chicken” were more likely to accept as true the statement that “the body temperature of a chicken is 144°” (or any other arbitrary number). The familiarity of one phrase in the statement sufficed to make the whole statement feel familiar, and therefore true. If you cannot remember the source of a statement, and have no way to relate it to other things you know, you have no option but to go with the sense of cognitive ease.
图5 的教训是,如果判断是基于认知轻松或紧张的印象,就不可避免地出现可预测的错觉。任何使联想机器更容易顺利运行的东西也会使信仰产生偏差。 使人们相信假话的一个可靠方法是经常重复,因为熟悉的东西不容易与真理区分 。专制机构和营销人员一直都知道这个事实。 但是,是心理学家发现,你不必重复一个事实或想法的全部陈述来使它看起来是真的。反复接触到 "鸡的体温 " 这一短语的人更有可能接受 "鸡的体温是144°"这一说法为真 (或任何其他任意数字)。声明中一个短语的熟悉程度足以使整个声明感到熟悉,因此是真的。如果你记不起一个说法的来源,也没有办法把它与你知道的其他事情联系起来,你就没有选择,只能跟着认知的轻松感走。
HOW TO WRITE A PERSUASIVE MESSAGE
如何写出有说服力的信息
Suppose you must write a message that you want the recipients to believe. Of course, your message will be true, but that is not necessarily enough for people to believe that it is true. It is entirely legitimate for you to enlist cognitive ease to work in your favor, and studies of truth illusions provide specific suggestions that may help you achieve this goal.
假设你必须写一条信息,你 ,希望收件人相信。当然,你的信息将是真实的,但这并不一定足以让人们相信它是真的。你争取认知上的轻松对你有利是完全合法的,对 真相幻觉 的研究提供了具体的建议,可以帮助你实现这一目标。
The general principle is that anything you can do to reduce cognitive strain will help, so you should first maximize legibility. Compare these two statements:
一般的原则是,任何你能做到的减少认知 strain ,都会有帮助,所以你应该首先最大限度地提高可读性。 比较这两种说法。
Adolf Hitler was born in 1892.
阿道夫-希特勒生于1892年。
Adolf Hitler was born in 1887.
阿道夫-希特勒生于1887年。
Both are false (Hitler was born in 1889), but experiments have shown that the first is more likely to be believed. More advice: if your message is to be printed, use high-quality paper to maximize the contrast between characters and their background. If you use color, you are more likely to be believed if your text is printed in bright blue or red than in middling shades of green, yellow, or pale blue.
两者都是假的(希特勒生于1889年),但实验表明,第一种说法更容易被相信。更多建议:如果你的信息要印刷,请使用高质量的纸张,以最大限度地提高字符和其背景之间的对比度。 如果你使用颜色,如果你的文字是用明亮的蓝色或红色印刷,比用绿色、黄色或淡蓝色等中庸的色调更容易被接受。
If you care about being thought credible and intelligent, do not use complex language where simpler language will do. My Princeton colleague Danny Oppenheimer refuted a myth prevalent among undergraduates about the vocabulary that professors find most impressive. In an article titled “Consequences of Erudite Vernacular Utilized Irrespective of Necessity: Problems with Using Long Words Needlessly,” he showed that couching familiar ideas in pretentious language is taken as a sign of poor intelligence and low credibility .
如果你希望被认为是可信的和聪明的,就不要在简单的语言中使用复杂的语言。 我在普林斯顿的同事丹尼-奥本海默驳斥了一个在本科生中流行的关于教授们认为最令人印象深刻的词汇的神话, 。在一篇题为 "不顾必要性地使用博学的白话的后果:不必要地使用长词的问题 "的文章中,他表明用自命不凡的语言来表达熟悉的观点被认为是智力 低下 和 可信度低的 标志。
In addition to making your message simple, try to make it memorable. Put your ideas in verse if you can; they will be more likely to be taken as truth. Participants in a much cited experiment read dozens of unfamiliar aphorisms, such as:
除了使你的信息简单外,还要尽量使它令人难忘。 如果可以的话,把你的想法写成诗句;它们会 ,更容易被当作真理。在一项被广泛引用的实验中,参与者阅读了几十条陌生的箴言,例如。
Woes unite foes.
灾难使敌人联合起来。
Little strokes will tumble great oaks.
小小的一笔就能让伟大的橡树倒下。
A fault confessed is half redressed.
忏悔的过错已被纠正了一半。
Other students read some of the same proverbs transformed into nonrhyming versions:
其他学生阅读了一些转化为非韵文版本的相同谚语。
Woes unite enemies.
困境使敌人团结起来。
Little strokes will tumble great trees.
小笔一挥,大树倾倒。
A fault admitted is half redressed.
承认的过错已经得到了一半的纠正。
The aphorisms were judged more insightful when they rhymed than when they did not.
这些警句 在押韵时比 不押韵时被认为更有见地。
Finally, if you quote a source, choose one with a name that is easy to pronounce. Participants in an experiment were asked to evaluate the prospects of fictitious Turkish companies on the basis of reports from two brokerage firms. For each stock, one of the reports came from an easily pronounced name (e.g., Artan) and the other report came from a firm with an unfortunate name (e.g., Taahhut). The reports sometimes disagreed. The best procedure for the observers would have been to average the two reports, but this is not what they did. They gave much more weight to the report from Artan than to the report from Taahhut. Remember that System 2 is lazy and that mental effort is aversive. If possible, the recipients of your message want to stay away from anything that reminds them of effort, including a source with a complicated name.
最后, ,如果你引用一个来源,请选择一个名字容易发音的。在一项实验中,参与者被要求根据两家经纪公司的报告评估 虚构的土耳其公司 的前景。对于每只股票,其中一份报告来自一个容易发音的名字(如Artan) ,另一份报告来自一个名字不吉利的公司(如Taahhut)。这些报告有时会出现分歧。观察员的最佳程序是对两份报告进行平均,但他们没有这样做。他们对阿尔坦的报告比对塔胡特的报告更重视。请记住,系统2是懒惰的,心理努力是厌恶的。如果可能的话,你的信息的接收者 ,希望远离任何让他们想起努力的东西,包括有复杂名字的来源。
All this is very good advice, but we should not get carried away. High-quality paper, bright colors, and rhyming or simple language will not be much help if your message is obviously nonsensical, or if it contradicts facts that your audience knows to be true. The psychologists who do these experiments do not believe that people are stupid or infinitely gullible. What psychologists do believe is that all of us live much of our life guided by the impressions of System 1—and we often do not know the source of these impressions. How do you know that a statement is true? If it is strongly linked by logic or association to other beliefs or preferences you hold, or comes from a source you trust and like, you will feel a sense of cognitive ease. The trouble is that there may be other causes for your feeling of ease—including the quality of the font and the appealing rhythm of the prose—and you have no simple way of tracing your feelings to their source. This is the message of figure 5 : the sense of ease or strain has multiple causes, and it is difficult to tease them apart. Difficult, but not impossible. People can overcome some of the superficial factors that produce illusions of truth when strongly motivated to do so. On most occasions, however, the lazy System 2 will adopt the suggestions of System 1 and march on.
所有这些都是非常好的建议,但我们不应该被冲昏头脑。如果你的信息显然是无稽之谈,或者与你的听众知道的事实相矛盾,那么高质量的纸张、鲜艳的色彩、押韵或简单的语言都不会有什么帮助。做这些实验的心理学家 ,他们不相信人们是愚蠢的或无限轻信的。心理学家相信的是,我们所有人在生活中的大部分时间都被系统1的印象所引导,而我们往往不知道这些印象的来源。 你怎么知道一个说法是真的?如果它通过逻辑或联想与你持有的其他信念或偏好紧密相连,或来自 你信任和喜欢的来源,你会感到一种认知上的轻松。问题是,你的轻松感可能还有其他原因--包括字体的质量和散文吸引人的节奏--而你没有简单的方法来追踪你的感觉,找到它们的来源。这就是 图5 的信息:轻松或紧张的感觉有多种原因,很难把它们 。困难,但不是不可能。人们可以克服一些表面的因素,在强烈的动机下,产生真理的幻觉。然而,在大多数情况下,懒惰的系统2会采纳系统1的建议并继续前进。
STRAIN AND EFFORT
吃力不讨好
The symmetry of many associative connections was a dominant theme in the discussion of associative coherence. As we saw earlier, people who are made to “smile” or “frown” by sticking a pencil in their mouth or holding a ball between their furrowed brows are prone to experience the emotions that frowning and smiling normally express. The same self-reinforcing reciprocity is found in studies of cognitive ease. On the one hand, cognitive strain is experienced when the effortful operations of System 2 are engaged. On the other hand, the experience of cognitive strain, whatever its source, tends to mobilize System 2, shifting people’s approach to problems from a casual intuitive mode to a more engaged and analytic mode .
许多联想连接的对称性是讨论联想一致性的一个主导主题。正如我们前面所看到的, ,通过把铅笔塞进嘴里或在皱起的眉头之间拿着一个球来让他们 "微笑 "或 "皱眉",他们很容易体验到皱眉和微笑通常表达的情绪。同样的自我强化的互惠性在对认知轻松性的研究中也被发现。一方面,认知 ,当系统2的努力运作被调动起来时,就会出现压力。另一方面 ,认知压力的体验,无论其来源如何,都倾向于调动系统2,将人们处理问题的方式从随意的直觉模式转变为更加 投入和分析的模式 。
The bat-and-ball problem was mentioned earlier as a test of people’s tendency to answer questions with the first idea that comes to their mind, without checking it. Shane Frederick’s Cognitive Reflection Test consists of the bat-and-ball problem and two others, all chosen because they evoke an immediate intuitive answer that is incorrect. The other two items in the CRT are:
前面提到蝙蝠和球的问题是对人们倾向于用他们脑海中的第一个想法来回答问题的测试,而不去检查它。 Shane Frederick的认知反思 测试包括蝙蝠和球的问题和其他两个问题,所有被选择的问题都是因为它们唤起了一个直接的直觉答案,而这个答案是不正确的。 CRT的其他两个项目是。
If it takes 5 machines 5 minutes to make 5 widgets, how long would it take 100 machines to make 100 widgets?
如果5台机器需要5分钟来制作5个小部件,那么100台机器需要多长时间来制作100个小部件?
100 minutes OR 5 minutes
100分钟或5分钟
In a lake, there is a patch of lily pads. Every day, the patch doubles in size. If it takes 48 days for the patch to cover the entire lake, how long would it take for the patch to cover half of the lake?
在一个湖里,有一片荷花池,每天,这片荷花池的面积都会增加一倍。如果 ,补丁需要48天才能覆盖整个湖泊,那么补丁需要多长时间才能覆盖一半的湖泊?
24 days OR 47 days
24天或47天
The correct answers to both problems are in the footnote. fn1 The experimenters recruited 40 Princeton students to take the CRT. Half of them saw the puzzles in a small font in washed-out gray print. The puzzles were legible, but the font induced cognitive strain. The results tell a clear story: 90% of the students who saw the CRT in normal font made at least one mistake in the test, but the proportion dropped to 35% when the font was barely legible. You read this correctly: performance was better with the bad font. Cognitive strain, whatever its source, mobilizes System 2, which is more likely to reject the intuitive answer suggested by System 1.
这两个问题的正确答案都在脚注中。 fn1 实验者招募了40名普林斯顿学生参加CRT。他们中的一半人看到谜题是用洗过的灰色印刷品的小字体。谜题是可读的,但字体引起了认知 。结果告诉我们一个清晰的故事:90%的学生在看到正常字体的CRT时,在测试中至少犯了一个错误,但当字体几乎无法辨认时,这一比例下降到35%。 你没看错:坏字体的表现更好。认知压力,无论其来源如何,都会调动系统2,它更有可能拒绝系统1所建议的直观答案。
THE PLEASURE OF COGNITIVE EASE
认知的乐趣
An article titled “Mind at Ease Puts a Smile on the Face” describes an experiment in which participants were briefly shown pictures of objects . Some of these pictures were made easier to recognize by showing the outline of the object just before the complete image was shown, so briefly that the contours were never noticed. Emotional reactions were measured by recording electrical impulses from facial muscles, registering changes of expression that are too slight and too brief to be detectable by observers. As expected, people showed a faint smile and relaxed brows when the pictures were easier to see. It appears to be a feature of System 1 that cognitive ease is associated with good feelings.
一篇题为 "心态平和使人微笑 "的文章描述了一项实验,在该实验中,参与者被短暂地展示了 物体的图片 。其中一些图片通过在显示完整的图像之前显示物体的轮廓而变得更容易识别,如此短暂,以至于从未注意到轮廓 。通过记录来自面部肌肉的 电脉冲来测量情绪反应,记录表情的变化,这些变化太轻微,太短暂,观察者无法发现。正如预期的那样,当图片更容易看到时,人们表现出淡淡的微笑和放松的眉头。这似乎是系统1的一个特点,即认知上的轻松与良好的感觉有关。
As expected, easily pronounced words evoke a favorable attitude. Companies with pronounceable names do better than others for the first week after the stock is issued, though the effect disappears over time. Stocks with pronounceable trading symbols (like KAR or LUNMOO) outperform those with tongue-twisting tickers like PXG or RDO—and they appear to retain a small advantage over some time. A study conducted in Switzerland found that investors believe that stocks with fluent names like Emmi, Swissfirst, and Comet will earn higher returns than those with clunky labels like Geberit and Ypsomed.
正如预期的那样,容易发音的词能唤起人们的好感。 在股票发行后的第一周,名字容易发音的公司比其他公司表现更好,尽管这种影响随着时间的推移而消失。具有可读交易符号(如KAR或LUNMOO)的股票表现优于PXG或RDO等绕口的股票,而且它们似乎在一段时间内还能保持 小的优势 。 在瑞士进行的一项研究发现,投资者认为, ,具有Emmi、Swissfirst和Comet等流利名称的股票将比具有Geberit和Ypsomed等 笨重标签 的股票获得更高的回报。
As we saw in figure 5 , repetition induces cognitive ease and a comforting feeling of familiarity. The famed psychologist Robert Zajonc dedicated much of his career to the study of the link between the repetition of an arbitrary stimulus and the mild affection that people eventually have for it. Zajonc called it the mere exposure effect . A demonstration conducted in the student newspapers of the University of Michigan and of Michigan State University is one of my favorite experiments . For a period of some weeks, an ad-like box appeared on the front page of the paper, which contained one of the following Turkish (or Turkish-sounding) words: kadirga , saricik , biwonjni , nansoma , and iktitaf . The frequency with which the words were repeated varied: one of the words was shown only once, the others appeared on two, five, ten, or twenty-five separate occasions. (The words that were presented most often in one of the university papers were the least frequent in the other.) No explanation was offered, and readers’ queries were answered by the statement that “the purchaser of the display wished for anonymity.”
正如我们在 图5 中看到的那样,重复会诱发认知上的轻松和熟悉的舒适感。 著名的心理学家罗伯特-扎琼克(Robert Zajonc)将其职业生涯的大部分时间用于研究任意刺激物的重复与 ,人们最终对其产生温和的喜爱。扎琼克将其称为 单纯的接触效应 。在密歇根大学和密歇根州立大学的学生报纸上进行的一项示范是我 最喜欢的实验 之一。 在几个星期的时间里,报纸的头版出现了一个类似广告的盒子,里面有以下土耳其语(或听起来像土耳其语)的单词之一: kadirga , saricik , biwonjni , nansoma , and iktitaf.这些词被重复的频率各不相同:其中一个词只出现了一次,其他的则分别出现了两次、五次、十次或二十五次。(在其中一份大学论文中出现频率最高的词在另一份论文中出现的频率最低。)没有提供任何解释,读者的询问得到的答复是: "购买展示的人希望匿名"。
When the mysterious series of ads ended, the investigators sent questionnaires to the university communities, asking for impressions of whether each of the words “means something ‘good’ or something ‘bad.’” The results were spectacular: the words that were presented more frequently were rated much more favorably than the words that had been shown only once or twice. The finding has been confirmed in many experiments, using Chinese ideographs, faces, and randomly shaped polygons.
当这一系列神秘的广告结束后,调查人员向大学社区发出调查问卷,询问人们对每个词 "意味着什么'好'或什么'坏'"的印象。结果是惊人的:比起那些只被 、展示过一两次的单词,更频繁展示的单词得到的评价要好得多。这一发现在许多实验中得到了证实,使用的是中文表意文字、面孔和随机形状的多边形。
The mere exposure effect does not depend on the conscious experience of familiarity. In fact, the effect does not depend on consciousness at all: it occurs even when the repeated words or pictures are shown so quickly that the observers never become aware of having seen them. They still end up liking the words or pictures that were presented more frequently. As should be clear by now, System 1 can respond to impressions of events of which System 2 is unaware. Indeed, the mere exposure effect is actually stronger for stimuli that the individual never consciously sees .
,单纯的接触效应并不取决于熟悉的意识体验。事实上,这种效应根本不取决于意识:即使重复的词语或图片显示得如此之快,以至于观察者从未意识到 ,也会发生这种效应。他们最终还是会喜欢那些被更频繁呈现的词语或图片。现在应该很清楚,系统1可以对系统2不知道的事件的印象做出反应。事实上,对于个体 从未有意识地看到的 刺激物,单纯的暴露效应实际上更强。
Zajonc argued that the effect of repetition on liking is a profoundly important biological fact, and that it extends to all animals. To survive in a frequently dangerous world, an organism should react cautiously to a novel stimulus, with withdrawal and fear. Survival prospects are poor for an animal that is not suspicious of novelty. However, it is also adaptive for the initial caution to fade if the stimulus is actually safe. The mere exposure effect occurs, Zajonc claimed, because the repeated exposure of a stimulus is followed by nothing bad. Such a stimulus will eventually become a safety signal, and safety is good. Obviously, this argument is not restricted to humans. To make that point, one of Zajonc’s associates exposed two sets of fertile chicken eggs to different tones. After they hatched, the chicks consistently emitted fewer distress calls when exposed to the tone they had heard while inhabiting the shell .
Zajonc认为,重复对喜好的影响是一个深刻而重要的 生物学事实,它延伸到所有的动物。为了在一个经常有危险的世界中生存,一个有机体应该对新的刺激作出谨慎的反应,有退缩和恐惧。 对于一个不怀疑新事物的动物,生存前景是很差的。然而,如果刺激物实际上是安全的,那么最初的谨慎也是适应性的,会逐渐消失。Zajonc声称,仅仅是暴露效应的发生, ,因为重复暴露一个刺激后没有什么坏处。这样的刺激最终会成为一个安全信号,而安全是好事。显然,这种说法并不限于人类。为了说明这一点,Zajonc的一位同事将两组受精鸡蛋暴露在不同的色调下。孵化后,当雏鸟接触到它们在 壳内时 听到的音调时, ,它们发出的求救信号始终较少。
Zajonc offered an eloquent summary of his program of research:
扎翁克对他的研究计划进行了雄辩的总结。
The consequences of repeated exposures benefit the organism in its relations to the immediate animate and inanimate environment. They allow the organism to distinguish objects and habitats that are safe from those that are not, and they are the most primitive basis of social attachments. Therefore, they form the basis for social organization and cohesion—the basic sources of psychological and social stability.
反复暴露的 后果 有利于有机体与直接的有生命和无生命环境的关系。它们使有机体能够区分安全的物体和栖息地与不安全的物体和栖息地,它们是社会 依恋的最原始的基础。因此,它们构成社会组织和凝聚力的基础--心理和社会稳定的基本来源。
The link between positive emotion and cognitive ease in System 1 has a long evolutionary history.
系统1中积极情绪和认知轻松之间的联系有很长的进化历史。
EASE, MOOD, AND INTUITION
轻松,心情,和直觉
Around 1960, a young psychologist named Sarnoff Mednick thought he had identified the essence of creativity. His idea was as simple as it was powerful: creativity is associative memory that works exceptionally well. He made up a test, called the Remote Association Test (RAT), which is still often used in studies of creativity.
1960年左右,一位名叫萨诺夫-梅德尼克的年轻心理学家认为他已经确定了创造力的本质。他的想法既简单 ,又有力:创造力是效果特别好的关联记忆。他 ,编造了一个测试,称为远程联想测试(RAT),至今仍经常被用于创造力的研究。
For an easy example, consider the following three words:
对于一个简单的例子,请考虑以下三个词。
cottage Swiss cake
山寨瑞士蛋糕
Can you think of a word that is associated with all three? You probably worked out that the answer is cheese . Now try this:
你能想到一个与这三者都有关的词吗?你可能已经想到了,答案是 奶酪 。现在,请尝试 这个。
dive light rocket
潜水轻型火箭
This problem is much harder, but it has a unique correct answer, which every speaker of English recognizes, although less than 20% of a sample of students found it within 15 seconds. The answer is sky . Of course, not every triad of words has a solution. For example, the words dream , ball , book do not have a shared association that everyone will recognize as valid.
这个问题要难得多,但它有一个独特的正确答案,每个讲英语的人都能认出这个答案,尽管只有不到20%的学生在15秒内找到这个答案。答案是 天空 。当然,并不是每一个 三联词 都有解决方案。 例如, 梦 、 球 、 书 这些词并没有一个大家都认可的共同联想。
Several teams of German psychologists that have studied the RAT in recent years have come up with remarkable discoveries about cognitive ease. One of the teams raised two questions: Can people feel that a triad of words has a solution before they know what the solution is? How does mood influence performance in this task? To find out, they first made some of their subjects happy and others sad, by asking them to think for several minutes about happy or sad episodes in their lives. Then they presented these subjects with a series of triads, half of them linked (such as dive , light , rocket ) and half unlinked (such as dream , ball , book ), and instructed them to press one of two keys very quickly to indicate their guess about whether the triad was linked. The time allowed for this guess, 2 seconds, was much too short for the actual solution to come to anyone’s mind.
近年来,几个研究RAT的德国心理学家团队在认知轻松度方面有了显著的发现。其中一个小组提出了两个问题:人们在知道解决方案是什么之前,能否感觉到一个三段式的词语有一个解决方案?心情如何影响这项任务的表现?为了找出答案,他们首先让一些受试者感到高兴,另一些则感到悲伤,方法是: ,让他们思考几分钟他们生活中的快乐或悲伤事件。然后,他们向这些受试者展示了一系列的三要素,其中一半是有联系的(如 潜水 、 光 、 火箭 ),一半是没有联系的( 如梦 、 球 、 书 ),并指示他们快速按下两个键中的一个,以表明他们对三要素是否有联系的猜测。允许这种猜测的时间为2秒, ,这对任何人的实际解决方案来说都太短。
The first surprise is that people’s guesses are much more accurate than they would be by chance. I find this astonishing. A sense of cognitive ease is apparently generated by a very faint signal from the associative machine, which “knows” that the three words are coherent (share an association) long before the association is retrieved . The role of cognitive ease in the judgment was confirmed experimentally by another German team: manipulations that increase cognitive ease (priming, a clear font, pre-exposing words) all increase the tendency to see the words as linked.
第一个惊喜是,人们的猜测比偶然的猜测要准确得多。我觉得这很让人吃惊。认知上的轻松感显然是由联想机发出的一个非常微弱的信号产生的,它在 联想被检索 之前很长时间就 "知道 "这三个词是连贯的(共享一个联想) 。认知上的轻松感在判断中的作用被另一个德国团队在实验中证实: 增加认知轻松感的 操作(打底、清晰的字体、预先曝光的词)都会增加将这些词视为联系的倾向性。
Another remarkable discovery is the powerful effect of mood on this intuitive performance. The experimenters computed an “intuition index” to measure accuracy. They found that putting the participants in a good mood before the test by having them think happy thoughts more than doubled accuracy . An even more striking result is that unhappy subjects were completely incapable of performing the intuitive task accurately; their guesses were no better than random. Mood evidently affects the operation of System 1: when we are uncomfortable and unhappy, we lose touch with our intuition.
另一个引人注目的发现是情绪对这种直觉表现的强大影响。实验者计算了一个 "直觉指数 "来衡量准确性。 他们发现,在测试前让参与者处于良好的 ,让他们思考快乐的想法, 使准确性提高了一倍 以上。一个更惊人的结果是,不快乐的受试者完全没有能力准确地执行直觉任务;他们的猜测并不比随机的好。情绪显然会影响系统1的运作:当我们不舒服和不快乐的时候,我们就会与我们的直觉失去 。
These findings add to the growing evidence that good mood, intuition, creativity, gullibility, and increased reliance on System 1 form a cluster . At the other pole, sadness, vigilance, suspicion, an analytic approach, and increased effort also go together. A happy mood loosens the control of System 2 over performance: when in a good mood, people become more intuitive and more creative but also less vigilant and more prone to logical errors. Here again, as in the mere exposure effect, the connection makes biological sense. A good mood is a signal that things are generally going well, the environment is safe, and it is all right to let one’s guard down. A bad mood indicates that things are not going very well, there may be a threat, and vigilance is required. Cognitive ease is both a cause and a consequence of a pleasant feeling.
这些发现增加了越来越多的证据,即良好的情绪、直觉、创造力、轻信和增加对系统1的依赖 形成一个集群 。在另一极,悲伤、警惕、怀疑、分析的方法和增加的努力也是相辅相成的。快乐的心情会放松系统2对表现的控制:当心情好的时候,人们会变得更有直觉 ,更有创造力,但也不那么警觉,更容易出现逻辑错误。在这里,与单纯的暴露效应一样,这种联系也具有生物学意义。良好的情绪是一个信号,表明事情总体上进展顺利,环境安全,让人放松警惕是完全正确的。糟糕的情绪表明事情不太顺利,可能有威胁,需要警惕。认知 ,轻松既是愉快的感觉的原因,也是结果。
The Remote Association Test has more to tell us about the link between cognitive ease and positive affect. Briefly consider two triads of words:
遥感测试可以告诉我们更多关于认知轻松和积极情绪之间的联系。 简单地考虑两个三段式的单词。
sleep mail switch
睡眠邮件开关
salt deep foam
盐的深层泡沫
You could not know it, of course, but measurements of electrical activity in the muscles of your face would probably have shown a slight smile when you read the second triad, which is coherent ( sea is the solution). This smiling reaction to coherence appears in subjects who are told nothing about common associates; they are merely shown a vertically arranged triad of words and instructed to press the space bar after they have read it. The impression of cognitive ease that comes with the presentation of a coherent triad appears to be mildly pleasurable in itself.
当然,你不可能知道,但对你脸部肌肉电活动的测量可能会显示出轻微的微笑,当 ,你读了第二个三段论,这是连贯的 (海是 解决方案)。这种对连贯性的 微笑反应 出现在那些没有被告知关于共同联想的受试者身上;他们只是被显示了一个垂直排列的三段论,并被指示在他们读完之后按空格键。呈现连贯的三段式所带来的认知上的轻松印象,本身似乎就有轻微的 愉悦感。
The evidence that we have about good feelings, cognitive ease, and the intuition of coherence is, as scientists say, correlational but not necessarily causal. Cognitive ease and smiling occur together, but do the good feelings actually lead to intuitions of coherence? Yes, they do. The proof comes from a clever experimental approach that has become increasingly popular. Some participants were given a cover story that provided an alternative interpretation for their good feeling: they were told about music played in their earphones that “previous research showed that this music influences the emotional reactions of individuals.” This story completely eliminates the intuition of coherence. The finding shows that the brief emotional response that follows the presentation of a triad of words (pleasant if the triad is coherent, unpleasant otherwise) is actually the basis of judgments of coherence. There is nothing here that System 1 cannot do. Emotional changes are now expected, and because they are unsurprising they are not linked causally to the words.
我们所掌握的关于良好感觉、认知轻松和一致性直觉的证据,正如科学家所说,是相关的,但不一定是因果关系。认知上的轻松和微笑是一起发生的,但这种良好的感觉是否真的导致了一致性的直觉?是的,他们是这样做的。证据来自于一种聪明的实验方法,这种方法已经变得越来越流行。 ,一些参与者得到了一个封面故事,为他们的良好感觉提供了另一种解释:他们被告知关于他们的 耳机中播放的音乐, "以前的研究表明,这种音乐影响个人的情绪反应" 。这个故事完全消除了连贯性的直觉。这一发现表明,在呈现 ,一个三联词(如果三联词是连贯的,则是愉快的,否则是不愉快的)之后的短暂情绪反应实际上是连贯性判断的基础。这里没有什么是系统1不能做的。 情绪的变化现在是可以预期的,而且因为它们是不出人意料的,所以与文字没有因果关系。
This is as good as psychological research ever gets, in its combination of experimental techniques and in its results, which are both robust and extremely surprising. We have learned a great deal about the automatic workings of System 1 in the last decades. Much of what we now know would have sounded like science fiction thirty or forty years ago. It was beyond imagining that bad font influences judgments of truth and improves cognitive performance, or that an emotional response to the cognitive ease of a triad of words mediates impressions of coherence. Psychology has come a long way.
这是心理学研究有史以来最好的一次,它结合了各种实验技术和 ,其结果既稳健又极其令人惊讶。 在过去几十年里,我们对系统1的自动运作有了很大的了解。我们现在所知道的很多东西在三四十年前听起来就像是科幻小说。 人们无法想象坏的字体会影响对真理的判断并提高认知能力,也无法想象对认知的轻松程度的情感反应 ,而对连贯性的印象进行调解。心理学已经走过了漫长的道路。
SPEAKING OF COGNITIVE EASE
说到认识上的轻松
“Let’s not dismiss their business plan just because the font makes it hard to read.”
"我们不要因为字体让人难以阅读就否定他们的商业计划。"
“We must be inclined to believe it because it has been repeated so often, but let’s think it through again.”
"我们必须倾向于相信它,因为它已经被重复了很多次,但让我们再仔细想想。"
“Familiarity breeds liking. This is a mere exposure effect.”
"熟悉会滋生喜欢。这仅仅是一种曝光效应。"
“I’m in a very good mood today, and my System 2 is weaker than usual. I should be extra careful.”
"我今天心情非常好 ,我的系统2比平时更弱。我应该格外小心。"
6
6
Norms, Surprises, and Causes
规范、惊奇和原因
The central characteristics and functions of System 1 and System 2 have now been introduced, with a more detailed treatment of System 1. Freely mixing metaphors, we have in our head a remarkably powerful computer, not fast by conventional hardware standards, but able to represent the structure of our world by various types of associative links in a vast network of various types of ideas. The spreading of activation in the associative machine is automatic, but we (System 2) have some ability to control the search of memory, and also to program it so that the detection of an event in the environment can attract attention. We next go into more detail of the wonders and limitation of what System 1 can do.
现在已经介绍了系统1和系统2的核心特征和功能,并对系统1进行了更详细的处理。自由地混合比喻,我们的脑子里有一台非常强大的 ,按照传统的硬件标准,速度并不快,但能够在一个由各种类型的想法组成的巨大网络中通过各种类型的关联链接来表示我们世界的结构。联想机中激活的扩散是自动的,但我们(系统2)有一定的能力来控制记忆的搜索,也可以对其进行编程,使环境中的事件的检测 ,从而引起注意。我们接下来会更详细地介绍系统1所能做的奇迹和限制。
ASSESSING NORMALITY
评估正态性
The main function of System 1 is to maintain and update a model of your personal world, which represents what is normal in it. The model is constructed by associations that link ideas of circumstances, events, actions, and outcomes that co-occur with some regularity, either at the same time or within a relatively short interval. As these links are formed and strengthened, the pattern of associated ideas comes to represent the structure of events in your life, and it determines your interpretation of the present as well as your expectations of the future.
系统1的主要功能是维护和更新你个人世界的模型,该模型代表了其中的正常情况。该模型是由联想构建的,联想将有规律地共同出现的环境、事件、行动和结果的想法联系起来,要么 ,要么在一个相对较短的间隔内。随着这些联系的形成和加强,相关想法的模式代表了你生活中的事件结构,它决定了你对现在的解释以及你对未来的期望。
A capacity for surprise is an essential aspect of our mental life, and surprise itself is the most sensitive indication of how we understand our world and what we expect from it. There are two main varieties of surprise. Some expectations are active and conscious—you know you are waiting for a particular event to happen. When the hour is near, you may be expecting the sound of the door as your child returns from school; when the door opens you expect the sound of a familiar voice. You will be surprised if an actively expected event does not occur. But there is a much larger category of events that you expect passively; you don’t wait for them, but you are not surprised when they happen. These are events that are normal in a situation, though not sufficiently probable to be actively expected.
惊奇的能力是我们精神生活的一个重要方面,惊奇本身是最敏感的 ,表明我们如何理解我们的世界 ,以及我们对它的期望。有两个主要的惊喜品种。有些期望是主动的、有意识的--你知道你在等待某个特定事件的发生。 当时间临近时,你可能在期待孩子从学校回来时的开门声;当门打开时,你会期待一个熟悉的声音。如果一个主动期望的事件没有发生,你会感到惊讶 。但有一类更大的事件是你被动地期待的;你不等待它们,但当它们发生时你并不感到惊讶。 这些事件在某种情况下是正常的,虽然没有足够的可能性被主动期待。
A single incident may make a recurrence less surprising. Some years ago, my wife and I were vacationing in a small island resort on the Great Barrier Reef. There are only forty guest rooms on the island. When we came to dinner, we were surprised to meet an acquaintance, a psychologist named Jon. We greeted each other warmly and commented on the coincidence. Jon left the resort the next day. About two weeks later, we were in a theater in London. A latecomer sat next to me after the lights went down. When the lights came up for the intermission, I saw that my neighbor was Jon. My wife and I commented later that we were simultaneously conscious of two facts: first, this was a more remarkable coincidence than the first meeting; second, we were distinctly less surprised to meet Jon on the second occasion than we had been on the first. Evidently, the first meeting had somehow changed the idea of Jon in our minds. He was now “the psychologist who shows up when we travel abroad.” We (System 2) knew this was a ludicrous idea, but our System 1 had made it seem almost normal to meet Jon in strange places. We would have experienced much more surprise if we had met any acquaintance other than Jon in the next seat of a London theater. By any measure of probability, meeting Jon in the theater was much less likely than meeting any one of our hundreds of acquaintances—yet meeting Jon seemed more normal.
单一事件可能使复发不那么令人惊讶。几年前,我和妻子在大堡礁的一个小岛上度假, 。当我们来吃饭的时候,我们很惊讶地见到了一个熟人,一个叫乔恩的心理学家。我们热情地问候对方,并评论了这一巧合。乔恩第二天就离开了度假村。 大约两个星期后,我们在伦敦的一家剧院里。灯光熄灭后,一个迟到的人坐在我旁边。 当中场休息时灯光亮起,我看到我的邻居是乔恩。我的妻子和我后来评论说,我们同时意识到两个事实:第一,这是一个比第一次见面更显著的巧合;第二,我们在第二次见到乔恩时,明显 没有 第一次时那么惊讶。显而易见,第一次会面在某种程度上改变了我们心中对乔恩的想法 。他现在是 "我们出国旅行时出现的心理学家"。我们(系统2)知道这是一个可笑的想法,但我们的系统1已经使在陌生的地方遇到乔恩显得几乎正常。 如果我们在伦敦剧院的隔壁座位上遇到乔恩以外的任何熟人,我们会经历更多的惊讶。从任何概率上看,在剧院遇到乔恩的可能性 ,比遇到我们几百个熟人中的任何一个要小得多--但遇到乔恩似乎更正常。
Under some conditions, passive expectations quickly turn active, as we found in another coincidence. On a Sunday evening some years ago, we were driving from New York City to Princeton, as we had been doing every week for a long time. We saw an unusual sight: a car on fire by the side of the road. When we reached the same stretch of road the following Sunday, another car was burning there. Here again, we found that we were distinctly less surprised on the second occasion than we had been on the first. This was now “the place where cars catch fire.” Because the circumstances of the recurrence were the same, the second incident was sufficient to create an active expectation: for months, perhaps for years, after the event much less likely than meeting one of our hundreds of acquaintances— we were reminded of burning cars whenever we reached that spot of the road and were quite prepared to see another one (but of course we never did).
在某些条件下,被动的预期很快会变成主动的,正如我们在另一个巧合中发现的那样。几年前的一个星期天晚上,我们从纽约市开车到普林斯顿,这是我们长期以来每周都要做的事情。我们看到了一个不寻常的景象:一辆汽车在路边着火, 。当我们在下一个星期天到达同一路段时,另一辆汽车也在那里燃烧。在这里,我们再次发现,我们在第二个场合的惊讶程度明显低于第一个场合。这里现在是 "汽车起火的地方"。由于再次发生的情况相同,第二次事件足以产生一种积极的预期:在几个月内, ,也许几年内,在事件发生后,比遇到我们数百个熟人中的一个更不可能-- ,每当我们到达道路的那个地方,我们就会想起燃烧的汽车,并非常准备看到另一辆汽车(但当然我们从未看到)。
The psychologist Dale Miller and I wrote an essay in which we attempted to explain how events come to be perceived as normal or abnormal. I will use an example from our description of “norm theory,” although my interpretation of it has changed slightly:
心理学家戴尔-米勒和我写了一篇文章,其中我们试图解释事件如何被视为正常或不正常。我将 ,使用我们对 "规范理论 "的描述中的一个例子,尽管我对它的解释略有改变。
An observer , casually watching the patrons at a neighboring table in a fashionable restaurant, notices that the first guest to taste the soup winces, as if in pain. The normality of a multitude of events will be altered by this incident. It is now unsurprising for the guest who first tasted the soup to startle violently when touched by a waiter; it is also unsurprising for another guest to stifle a cry when tasting soup from the same tureen. These events and many others appear more normal than they would have otherwise, but not necessarily because they confirm advance expectations. Rather, they appear normal because they recruit the original episode, retrieve it from memory, and are interpreted in conjunction with it.
一个观察者 在一家时髦的餐厅里随意观察邻桌的客人,注意到第一个品尝汤的客人抽噎着,似乎很痛苦。 众多事件的正常性将因这一事件而改变。现在,第一次 尝到汤的客人在被服务员碰触时猛然一惊也就不足为奇了;另一位客人在品尝同一个汤锅里的汤时扼腕叹息也就不足为奇了。这些事件和其他许多事件看起来比其他情况下更正常,但不一定是因为它们证实了预先的期望。相反,它们看起来很正常,因为它们招募了原来的情节,从记忆中检索出来,并且 ,与它一起被解释。
Imagine yourself the observer at the restaurant. You were surprised by the first guest’s unusual reaction to the soup, and surprised again by the startled response to the waiter’s touch. However, the second abnormal event will retrieve the first from memory, and both make sense together. The two events fit into a pattern, in which the guest is an exceptionally tense person. On the other hand, if the next thing that happens after the first guest’s grimace is that another customer rejects the soup, these two surprises will be linked and the soup will surely be blamed.
想象自己是餐厅的观察者。你对第一位客人对汤的不寻常反应感到惊讶,对服务员的触摸的惊愕反应再次感到惊讶。然而,第二个异常事件会将第一个事件从记忆中找出来,两者结合起来才有意义。这两个事件符合一个模式,在这个模式中,客人是一个特别 紧张的人。另一方面,如果在第一位客人面无表情之后,接下来发生的事情是另一位客人拒绝喝汤,这两个意外就会被联系起来,汤肯定会受到指责。
“How many animals of each kind did Moses take into the ark?” The number of people who detect what is wrong with this question is so small that it has been dubbed the “Moses illusion.” Moses took no animals into the ark; Noah did. Like the incident of the wincing soup eater, the Moses illusion is readily explained by norm theory. The idea of animals going into the ark sets up a biblical context, and Moses is not abnormal in that context. You did not positively expect him, but the mention of his name is not surprising. It also helps that Moses and Noah have the same vowel sound and number of syllables. As with the triads that produce cognitive ease, you unconsciously detect associative coherence between “Moses” and “ark” and so quickly accept the question. Replace Moses with George W. Bush in this sentence and you will have a poor political joke but no illusion.
"摩西把每种动物带进方舟的数量是多少?"检测出这个问题有什么问题的人非常少,以至于被戏称为 "摩西幻觉"。摩西 ,没有带动物进入方舟;诺亚却带了。就像吃汤的人抽噎的事件一样,摩西的幻觉很容易用规范理论来解释。动物进入方舟的想法设置了一个圣经背景,摩西在这个背景下并不反常。 你没有积极地期待他,但提到他的名字并不令人惊讶。摩西和诺亚有相同的元音和 音节数,这也很有帮助。就像产生认知轻松的三段论一样,你无意识地检测到 "摩西 "和 "方舟 "之间的联想一致性,所以很快就接受了这个问题。在这个句子中用小布什代替摩西,你会有一个糟糕的政治笑话,但没有幻象。
When something cement does not fit into the current context of activated ideas, the system detects an abnormality, as you just experienced. You had no particular idea of what was coming after something , but you knew when the word cement came that it was abnormal in that sentence. Studies of brain responses have shown that violations of normality are detected with astonishing speed and subtlety. In a recent experiment, people heard the sentence “Earth revolves around the trouble every year.” A distinctive pattern was detected in brain activity, starting within two-tenths of a second of the onset of the odd word. Even more remarkable, the same brain response occurs at the same speed when a male voice says, “I believe I am pregnant because I feel sick every morning,” or when an upper-class voice says, “I have a large tattoo on my back .” A vast amount of world knowledge must instantly be brought to bear for the incongruity to be recognized: the voice must be identified as upper-class English and confronted with the generalization that large tattoos are uncommon in the upper class.
当 水泥的东西不符合当前被激活的想法的背景时,系统会检测到一个异常, ,就像你刚刚经历的那样。你没有特别的想法,不知道 什么东西 后面是什么,但当 水泥 这个词出现时,你知道它在这个句子中是不正常的。对大脑反应的研究表明,违反常态的行为会以惊人的速度和微妙的方式被检测出来。 在最近的一个实验中,人们听到了 "地球每年都在围绕麻烦 "的句子。在大脑活动中检测到一种独特的模式 ,在奇数词开始的十分之二秒内开始。更难能可贵的是,当一个男性的声音说:"我相信我怀孕了,因为我每天早上都觉得不舒服",或者当一个上流社会的声音说:"我 的背上 有一个很大的 纹身 "时,大脑会以同样的速度发生反应。必须立即将大量的世界知识用于 ,才能认识到这种不协调:必须将这个声音确定为上流社会的英语,并面对大面积纹身在上流社会不常见的概括。
We are able to communicate with each other because our knowledge of the world and our use of words are largely shared. When I mention a table, without specifying further, you understand that I mean a normal table. You know with certainty that its surface is approximately level and that it has far fewer than 25 legs. We have norms for a vast number of categories, and these norms provide the background for the immediate detection of anomalies such as pregnant men and tattooed aristocrats.
我们能够相互交流,因为我们对世界的认识和对语言的使用在很大程度上是共享的。当我提到一张桌子时,没有进一步说明,你就会明白我指的是一张正常的桌子。你 ,肯定地知道它的表面是近似水平的,而且它的腿远少于25条。 我们有大量类别的 规范 ,这些规范为立即发现异常情况提供了背景,如怀孕的男人和有纹身的贵族。
To appreciate the role of norms in communication, consider the sentence “The large mouse climbed over the trunk of the very small elephant.” I can count on your having norms for the size of mice and elephants that are not too far from mine. The norms specify a typical or average size for these animals, and they also contain information about the range or variability within the category. It is very unlikely that either of us got the image in our mind’s eye of a mouse larger than an elephant striding over an elephant smaller than a mouse. Instead, we each separately but jointly visualized a mouse smaller than a shoe clambering over an elephant larger than a sofa. System 1, which understands language, has access to norms of categories, which specify the range of plausible values as well as the most typical cases.
为了体会规范在交流中的作用,请考虑 "大老鼠爬上了非常 小象的树干 "这句话。我可以指望你对老鼠和大象的大小有规范,与我的规范相差不大。这些规范规定了这些动物的典型或平均尺寸,它们还包含关于该类别内的范围或变异性的信息。我们中的任何一个人都不太可能在脑海中得到这样的画面:一只比大象还大的老鼠大步走过一只比老鼠还小的大象 。相反,我们每个人都分别但共同想象出一只比鞋子还小的老鼠爬上了比沙发还大的大象。懂得语言的系统1可以接触到类别的规范,这些规范规定了合理的数值范围以及最典型的情况。
SEEING CAUSES AND INTENTIONS
看清原因和意图
“Fred’s parents arrived late. The caterers were expected soon. Fred was angry.” You know why Fred was angry, and it is not because the caterers were expected soon. In your network of associations, anger and lack of punctuality are linked as an effect and its possible cause, but there is no such link between anger and the idea of expecting caterers. A coherent story was instantly constructed as you read; you immediately knew the cause of Fred’s anger. Finding such causal connections is part of understanding a story and is an automatic operation of System 1. System 2, your conscious self, was offered the causal interpretation and accepted it.
"弗雷德的父母很晚才到。餐饮业者很快就会来了。弗雷德 ,他很生气。"你知道弗雷德为什么生气,并不是因为餐饮业者很快就会被期待。在你的联想网络中,生气和缺乏 准时性作为一种结果及其可能的原因被联系起来,但在生气和期待餐饮业者的想法之间没有这种联系。当你阅读时,一个连贯的故事立即被构建起来;你立即知道弗雷德愤怒的原因。找到这样的 因果联系是理解一个故事的一部分,是系统1的自动操作。系统2,你的意识自我,被提供了因果解释并接受了它。
A story in Nassim Taleb’s The Black Swan illustrates this automatic search for causality. He reports that bond prices initially rose on the day of Saddam Hussein’s capture in his hiding place in Iraq. Investors were apparently seeking safer assets that morning, and the Bloomberg News service flashed this headline: U.S. TREASURIES RISE; HUSSEIN CAPTURE MAY NOT CURB TERRORISM . Half an hour later, bond prices fell back and the revised headline read: U.S. TREASURIES FALL; HUSSEIN CAPTURE BOOSTS ALLURE OF RISKY ASSETS . Obviously, Hussein’s capture was the major event of the day, and because of the way the automatic search for causes shapes our thinking, that event was destined to be the explanation of whatever happened in the market on that day. The two headlines look superficially like explanations of what happened in the market, but a statement that can explain two contradictory outcomes explains nothing at all. In fact, all the headlines do is satisfy our need for coherence: a large event is supposed to have consequences, and consequences need causes to explain them. We have limited information about what happened on a day, and System 1 is adept at finding a coherent causal story that links the fragments of knowledge at its disposal.
纳西姆-塔勒布的 《黑天鹅》 中的一个故事说明了这种自动寻找因果关系的做法。 他报告说,在萨达姆-侯赛因在伊拉克的藏身之处被抓获的那天,债券价格最初上涨。当天上午,投资者在 ,显然是在寻求更安全的资产,彭博新闻服务部闪现了这样的标题: 美国国债上涨;侯赛因被俘可能不会遏制恐怖活动 。半小时后,债券价格回落,修改后的标题是: 美国国债下跌;侯赛因被俘提高了风险资产的吸引力 。显然,侯赛因被俘是当天的主要事件,由于自动 ,寻找原因的方式塑造了我们的思维,这一事件注定要成为当天市场上发生的任何事件的解释。这两个标题从表面上看像是对市场上发生的事情的解释,但一个能解释两个矛盾结果的声明根本就不能解释什么。 事实上,这些标题所做的只是满足我们对连贯性的需求:一个大事件应该 ,有后果,而后果需要原因来解释。我们对某一天发生的事情信息有限,而系统1善于找到一个连贯的因果故事,把它所掌握的碎片化的知识联系起来。
Read this sentence:
阅读这句话。
After spending a day exploring beautiful sights in the crowded streets of New York, Jane discovered that her wallet was missing.
在纽约拥挤的街道上花了一天时间探索美丽的景点后,简发现她的钱包不见了。
When people who had read this brief story (along with many others) were given a surprise recall test, the word pickpocket was more strongly associated with the story than the word sights , even though the latter was actually in the sentence while the former was not. The rules of associative coherence tell us what happened. The event of a lost wallet could evoke many different causes: the wallet slipped out of a pocket, was left in the restaurant, etc. However, when the ideas of lost wallet, New York, and crowds are juxtaposed, they jointly evoke the explanation that a pickpocket caused the loss. In the story of the startling soup, the outcome—whether another customer wincing at the taste of the soup or the first person’s extreme reaction to the waiter’s touch—brings about an associatively coherent interpretation of the initial surprise, completing a plausible story.
当 读过这个简短故事的人(以及其他许多人)接受惊喜回忆测试时, 扒手 这个词 与这个故事的联系比 景点 这个词更强烈,尽管后者实际上是在句子中,而前者却不是。联想一致性的规则告诉我们发生了什么。 丢失钱包的事件可以唤起许多不同的原因:钱包从口袋里滑落 ,被遗弃在餐馆里,等等。然而,当丢失钱包、纽约和人群的概念并列时,它们共同唤起了扒手造成损失的解释。在惊奇的汤的故事中,结果--不管是另一个顾客对汤的味道感到畏惧,还是第一个人对 服务员的触摸的极端反应--都带来了对最初惊奇的联想连贯的 解释,完成了一个合理的故事。
The aristocratic Belgian psychologist Albert Michotte published a book in 1945 (translated into English in 1963) that overturned centuries of thinking about causality, going back at least to Hume’s examination of the association of ideas. The commonly accepted wisdom was that we infer physical causality from repeated observations of correlations among events. We have had myriad experiences in which we saw one object in motion touching another object, which immediately starts to move, often (but not always) in the same direction. This is what happens when a billiard ball hits another, and it is also what happens when you knock over a vase by brushing against it. Michotte had a different idea: he argued that we see causality, just as directly as we see color. To make his point, he created episodes in which a black square drawn on paper is seen in motion; it comes into contact with another square, which immediately begins to move. The observers know that there is no real physical contact, but they nevertheless have a powerful “illusion of causality.” If the second object starts moving instantly, they describe it as having been “launched” by the first. Experiments have shown that six-month-old infants see the sequence of events as a cause-effect scenario, and they indicate surprise when the sequence is altered. We are evidently ready from birth to have impressions of causality, which do not depend on reasoning about patterns of causation. They are products of System 1.
比利时贵族心理学家Albert Michotte在1945年出版了一本书(1963年被翻译成英文),推翻了几个世纪以来关于因果关系的思考,至少可以追溯到休谟对观念关联的研究。 普遍接受的智慧是,我们从反复观察 ,推断出事件之间的相关性的物理因果关系。我们有无数的经验,我们看到一个运动中的物体接触到另一个物体,后者立即开始运动,往往(但不总是)朝同一方向运动。 这就是台球撞到另一个球时发生的情况,也是你用手拂过花瓶时发生的情况。 Michotte有一个不同的想法:他认为,我们 看到 因果关系, ,就像我们看到颜色一样直接。为了说明他的观点,他创造了一些情节,在这些情节中,人们看到一个画在纸上的黑色方块在运动;它与另一个方块接触,后者立即开始移动。观察者知道没有真正的物理接触,但他们还是有一个强大的 "因果关系的幻觉"。如果第二个物体瞬间开始移动,他们将其描述为: ,被第一个物体 "发射"。实验表明,6个月大的婴儿把事件的顺序看成是因果关系,当顺序被改变时,他们会 表示惊讶 。 显然,我们从出生起就准备好了对因果关系的 印象 ,这并不取决于对因果关系模式的推理。 它们是系统1的产物。
In 1944, at about the same time as Michotte published his demonstrations of physical causality, the psychologists Fritz Heider and Mary-Ann Simmel used a method similar to Michotte’s to demonstrate the perception of intentional causality. They made a film, which lasts all of one minute and forty seconds, in which you see a large triangle, a small triangle, and a circle moving around a shape that looks like a schematic view of a house with an open door. Viewers see an aggressive large triangle bullying a smaller triangle, a terrified circle, the circle and the small triangle joining forces to defeat the bully; they also observe much interaction around a door and then an explosive finale . The perception of intention and emotion is irresistible; only people afflicted by autism do not experience it. All this is entirely in your mind, of course. Your mind is ready and even eager to identify agents, assign them personality traits and specific intentions, and view their actions as expressing individual propensities. Here again, the evidence is that we are born prepared to make intentional attributions: infants under one year old identify bullies and victims , and expect a pursuer to follow the most direct path in attempting to catch whatever it is chasing.
1944年,就在 Michotte发表他的物理因果关系证明的同时,心理学家Fritz Heider和Mary-Ann Simmel用一种与Michotte类似的方法来证明 有意的 因果关系的感知。他们拍了一部电影,持续了1分40秒,在电影中你看到一个大三角形、一个小三角形和一个圆形围绕着一个形状移动,看起来像是一栋 ,门打开的房子的示意图。观众看到一个咄咄逼人的大三角形欺负一个较小的三角形,一个惊恐的圆圈,圆圈和小三角形联手打败了恶霸;他们还观察到在一扇门周围的许多互动,然后是 爆炸性的结局 。对意图和情感的感知是不可抗拒的;只有受自闭症困扰的人没有这种体验。当然,所有这些完全是在你的头脑中 。你的头脑已经准备好,甚至渴望识别代理人,赋予他们人格特征和具体意图,并将他们的行为视为表达个人倾向。 在这里,证据再次表明,我们天生就准备好进行有意的归因:一岁以下的婴儿 ,识别欺凌者和受害者 ,并期望追赶者遵循最直接的路径, ,试图抓住它正在追逐的东西。
The experience of freely willed action is quite separate from physical causality. Although it is your hand that picks up the salt, you do not think of the event in terms of a chain of physical causation. You experience it as caused by a decision that a disembodied you made, because you wanted to add salt to your food. Many people find it natural to describe their soul as the source and the cause of their actions. The psychologist Paul Bloom, writing in The Atlantic in 2005, presented the provocative claim that our inborn readiness to separate physical and intentional causality explains the near universality of religious beliefs. He observes that “we perceive the world of objects as essentially separate from the world of minds, making it possible for us to envision soulless bodies and bodiless souls.” The two modes of causation that we are set to perceive make it natural for us to accept the two central beliefs of many religions: an immaterial divinity is the ultimate cause of the physical world, and immortal souls temporarily control our bodies while we live and leave them behind as we die . In Bloom’s view, the two concepts of causality were shaped separately by evolutionary forces, building the origins of religion into the structure of System 1.
自由意志行动的经验与物理因果关系是完全分开的。 尽管是你的手拿起了盐,但你并没有从物理因果关系链的角度来考虑这一事件。 你的经验是由一个没有实体的 你 所做的决定引起的,因为你想在食物中加盐。许多人发现, ,将自己的灵魂描述为自己行为的来源和原因是很自然的。心理学家保罗-布鲁姆(Paul Bloom)2005年在 《大西洋》杂志 上撰文,提出了一个具有挑衅性的主张,即我们天生准备将身体和意图的因果关系分开,这解释了宗教信仰的几乎普遍性。 他观察到,"我们认为物体的世界与思想的世界基本上是分开的,使我们有可能设想无灵魂的身体和无身体的灵魂 。"我们被设定为感知的两种因果关系模式使我们很自然地接受了许多宗教的两个核心信仰:非物质的神性是物质世界的最终原因,不朽的灵魂在我们活着的时候暂时控制我们的身体, 在我们死亡的时候 离开它们。在布鲁姆看来, 因果关系的两个概念是由进化的力量分别塑造的,将宗教的起源构建在系统1的结构中。
The prominence of causal intuitions is a recurrent theme in this book because people are prone to apply causal thinking inappropriately, to situations that require statistical reasoning. Statistical thinking derives conclusions about individual cases from properties of categories and ensembles. Unfortunately, System 1 does not have the capability for this mode of reasoning; System 2 can learn to think statistically, but few people receive the necessary training.
因果直觉的突出是本书的一个反复出现的主题,因为人们很容易将因果思维不适当地应用于需要统计推理的情况。统计学思维从类别的属性推导出关于个别案例的结论 和集合。不幸的是,系统1没有这种推理模式的能力;系统2可以学习统计学思维,但很少有人接受必要的培训。
The psychology of causality was the basis of my decision to describe psychological processes by metaphors of agency, with little concern for consistency. I sometimes refer to System 1 as an agent with certain traits and preferences, and sometimes as an associative machine that represents reality by a complex pattern of links. The system and the machine are fictions; my reason for using them is that they fit the way we think about causes. Heider’s triangles and circles are not really agents—it is just very easy and natural to think of them that way. It is a matter of mental economy. I assume that you (like me) find it easier to think about the mind if we describe what happens in terms of traits and intentions (the two systems) and sometimes in terms of mechanical regularities (the associative machine). I do not intend to convince you that the systems are real, any more than Heider intended you to believe that the large triangle is really a bully.
因果关系的心理学是我决定用代理的隐喻来描述心理过程的基础,几乎不关心一致性。我有时把系统1称为具有某些 特质和偏好的代理,有时称为通过复杂的联系模式来代表现实的联想机器。系统和机器是虚构的;我使用它们的原因是它们符合我们思考原因的方式。 海德的三角形和圆圈并不是真正的代理人--只是这样思考它们非常容易和自然。 这是一个心理经济问题。我假设你(像 我)发现,如果我们用特征和意图(两个系统),有时用机械规律性(联想机器)来描述所发生的事情,就会更容易思考心灵。我并不打算说服你这些系统是真实的,任何 ,就像海德打算让你相信大三角真的是一个恶棍一样。
SPEAKING OF NORMS AND CAUSES
说到规范和原因
“When the second applicant also turned out to be an old friend of mine, I wasn’t quite as surprised. Very little repetition is needed for a new experience to feel normal!”
"当第二个申请者 ,也变成了我的一个老朋友时,我就不太惊讶了。 很少有重复,新的经验就会感觉正常!"
“When we survey the reaction to these products, let’s make sure we don’t focus exclusively on the average. We should consider the entire range of normal reactions.”
"当我们调查这些产品的反应时,让我们确保不要只关注平均水平。 我们应该考虑整个正常反应的范围。"
“She can’t accept that she was just unlucky; she needs a causal story. She will end up thinking that someone intentionally sabotaged her work.”
"她不能接受她只是不走运;她需要一个因果关系的故事。她最后会 ,认为有人故意破坏她的工作。"
7
7
A Machine for Jumping to Conclusions
妄下结论的机器
The great comedian Danny Kaye had a line that has stayed with me since my adolescence. Speaking of a woman he dislikes, he says, “Her favorite position is beside herself, and her favorite sport is jumping to conclusions.” The line came up, I remember, in the initial conversation with Amos Tversky about the rationality of statistical intuitions, and now I believe it offers an apt description of how System 1 functions. Jumping to conclusions is efficient if the conclusions are likely to be correct and the costs of an occasional mistake acceptable, and if the jump saves much time and effort. Jumping to conclusions is risky when the situation is unfamiliar, the stakes are high, and there is no time to collect more information. These are the circumstances in which intuitive errors are probable, which may be prevented by a deliberate intervention of System 2.
伟大的喜剧演员丹尼-凯(Danny Kaye)有一句台词,从我的青春期开始就一直伴随着我。谈到他不喜欢的一个女人,他说:"她最喜欢的姿势是在自己身边,而她最喜欢的运动 ,就是跳出结论"。我记得这句话是在最初与阿莫斯-特沃斯基关于统计直觉的合理性的谈话中出现的,现在我相信它对系统1的功能提供了一个恰当的描述。 如果结论可能是正确的,偶尔出错的代价是可以接受的,而且如果跳跃可以节省很多时间和精力,那么跳到结论是有效的。 当情况不熟悉,风险很大,而且没有时间收集更多的信息时,贸然下结论是有风险的。 这些情况下,直觉上的错误是可能的,而系统2的有意干预可以防止这种错误。
NEGLECT OF AMBIGUITY AND SUPPRESSION OF DOUBT
忽视模糊性和压制疑问

Figure 6
图6
What do the three exhibits in figure 6 have in common? The answer is that all are ambiguous. You almost certainly read the display on the left as A B C and the one on the right as 12 13 14, but the middle items in both displays are identical. You could just as well have read them as A 13 C or 12 B 14, but you did not. Why not? The same shape is read as a letter in a context of letters and as a number in a context of numbers. The entire context helps determine the interpretation of each element. The shape is ambiguous, but you jump to a conclusion about its identity and do not become aware of the ambiguity that was resolved.
图6 中的三个展品有什么共同点?答案是,所有这些都是模糊的。你几乎可以肯定,左边的显示是A B C ,右边的显示是12 13 14,但两个显示中的中间项目是相同的。你也可以把它们读成A 13 C或12 B 14,但你没有这样做。 为什么不呢?同样的形状在字母 ,在数字的背景下被读作一个字母,在数字的背景下被读作一个数字。整个背景有助于确定每个元素的解释。 形状是模糊的,但你对它的身份下了结论,并没有意识到被解决的模糊性。
As for Ann, you probably imagined a woman with money on her mind, walking toward a building with tellers and secure vaults. But this plausible interpretation is not the only possible one; the sentence is ambiguous. If an earlier sentence had been “They were floating gently down the river,” you would have imagined an altogether different scene. When you have just been thinking of a river, the word bank is not associated with money. In the absence of an explicit context, System 1 generated a likely context on its own. We know that it is System 1 because you were not aware of the choice or of the possibility of another interpretation. Unless you have been canoeing recently, you probably spend more time going to banks than floating on rivers, and you resolved the ambiguity accordingly. When uncertain, System 1 bets on an answer, and the bets are guided by experience. The rules of the betting are intelligent: recent events and the current context have the most weight in determining an interpretation. When no recent event comes to mind, more distant memories govern. Among your earliest and most memorable experiences was singing your ABCs; you did not sing your A13Cs.
至于Ann,你可能想象到一个满脑子都是钱的女人,走向一栋有出纳员和安全金库的大楼。但这种看似合理的解释 ,并不是唯一可能的解释;这句话有歧义。如果先前的句子是 "他们顺着河水轻轻地漂流",你会想象出完全不同的场景。 当你刚刚想到一条河的时候, 银行 这个词就不会和钱联系起来。在没有明确语境的情况下,系统1自行生成了一个可能的语境。我们知道是系统1,因为 ,你没有意识到这个选择或另一种解释的可能性。除非你最近去划独木舟,否则你可能花更多的时间去银行而不是在河流上漂流,你相应地解决了这个模糊性。 当不确定时,系统1押注于一个答案,而押注是由经验指导。赌博的规则是聪明的:最近的事件和当前的背景 ,在决定解释时具有最大的权重。当没有最近的事件出现在脑海中时,更遥远的记忆就会占据主导地位。 在你最早和最难忘的经历中,是唱你的ABC;你并没有唱你的A13Cs。
The most important aspect of both examples is that a definite choice was made, but you did not know it. Only one interpretation came to mind, and you were never aware of the ambiguity. System 1 does not keep track of alternatives that it rejects, or even of the fact that there were alternatives. Conscious doubt is not in the repertoire of System 1; it requires maintaining incompatible interpretations in mind at the same time, which demands mental effort. Uncertainty and doubt are the domain of System 2.
这两个例子中最重要的一点是,已经做出了明确的选择,但你并不知道。只有一种解释出现在脑海中,而你从来没有意识到 的模糊性。系统1不会记录它所拒绝的替代方案,甚至不会记录存在替代方案的事实。有意识的怀疑不在系统1的剧目中;它需要在脑海中同时保持不相容的解释,这需要心理努力。不确定性和怀疑是系统2的领域。
A BIAS TO BELIEVE AND CONFIRM
倾向于相信和确认
The psychologist Daniel Gilbert, widely known as the author of Stumbling on Happiness , once wrote an essay, titled “How Mental Systems Believe,” in which he developed a theory of believing and unbelieving that he traced to the seventeenth-century philosopher Baruch Spinoza. Gilbert proposed that understanding a statement must begin with an attempt to believe it: you must first know what the idea would mean if it were true. Only then can you decide whether or not to unbelieve it. The initial attempt to believe is an automatic operation of System 1, which involves the construction of the best possible interpretation of the situation. Even a nonsensical statement, Gilbert argues, will evoke initial belief. Try his example: “whitefish eat candy.” You probably were aware of vague impressions of fish and candy as an automatic process of associative memory searched for links between the two ideas that would make sense of the nonsense.
心理学家丹尼尔-吉尔伯特( )作为《 幸福的绊脚石 》的作者而广为人知,他曾经写过一篇文章,题目是 "心理系统如何相信",其中他提出了一个关于相信和不相信的理论,他追溯到17世纪的哲学家巴鲁克-斯宾诺莎。吉尔伯特提出 ,理解一个声明必须从试图相信它开始:你必须首先知道如果这个想法是真的,它将意味着什么。只有 ,然后你才能决定是否 不相信 它。相信的最初尝试是系统1的自动操作,它涉及构建对情况的最佳解释。 吉尔伯特认为,即使是一个无意义的声明,也会唤起最初的相信。试试他的例子:"白鱼吃糖"。你可能意识到对鱼和糖果的模糊印象,因为联想记忆的自动 ,在这两个想法之间寻找联系,从而使这些无稽之谈有意义。
Gilbert sees unbelieving as an operation of System 2, and he reported an elegant experiment to make his point. The participants saw nonsensical assertions, such as “a dinca is a flame,” followed after a few seconds by a single word, “true” or “false.” They were later tested for their memory of which sentences had been labeled “true.” In one condition of the experiment subjects were required to hold digits in memory during the task. The disruption of System 2 had a selective effect: it made it difficult for people to “unbelieve” false sentences. In a later test of memory, the depleted participants ended up thinking that many of the false sentences were true. The moral is significant: when System 2 is otherwise engaged, we will believe almost anything. System 1 is gullible and biased to believe, System 2 is in charge of doubting and unbelieving, but System 2 is sometimes busy, and often lazy. Indeed, there is evidence that people are more likely to be influenced by empty persuasive messages, such as commercials, when they are tired and depleted.
吉尔伯特认为不相信是系统2的一种操作,他报告了一个 优雅的实验 来说明他的观点。参与者看到的是无意义的断言,如 "丁卡是一种火焰",几秒钟后是一个词,"真 "或 "假"。后来,他们被测试了 ,看他们对哪些句子被标记为 "真实 "的记忆。在实验的一个条件中,受试者被要求在任务中保持数字的记忆。系统2的中断有一个选择性的影响:它使人们很难 "不相信 "假句子。 在后来的记忆测试中,被耗尽的参与者最终认为许多假句子是真的。这个寓意很重要: ,当系统2以其他方式参与时,我们几乎会相信任何东西。系统1容易受骗,偏向于相信,系统2负责怀疑和不相信,但系统2有时很忙,经常偷懒。事实上,有证据表明,人们在疲惫不堪的时候更容易受到空洞的说服性信息的影响,如广告。
The operations of associative memory contribute to a general confirmation bias . When asked, “Is Sam friendly?” different instances of Sam’s behavior will come to mind than would if you had been asked “Is Sam unfriendly?” A deliberate search for confirming evidence, known as positive test strategy , is also how System 2 tests a hypothesis. Contrary to the rules of philosophers of science, who advise testing hypotheses by trying to refute them, people (and scientists, quite often) seek data that are likely to be compatible with the beliefs they currently hold. The confirmatory bias of System 1 favors uncritical acceptance of suggestions and exaggeration of the likelihood of extreme and improbable events. If you are asked about the probability of a tsunami hitting California within the next thirty years, the images that come to your mind are likely to be images of tsunamis, in the manner Gilbert proposed for nonsense statements such as “whitefish eat candy.” You will be prone to overestimate the probability of a disaster.
联想式 记忆的运作促成了普遍的 确认偏差 。当被问及 "山姆是否友好?"时,与被问及 "山姆是否不友好?"相比,你会想到山姆的不同行为实例。刻意寻找确认的证据,被称为 积极的测试策略 ,也是系统2测试假设的方式。与科学哲学家建议通过尝试 来反驳假设的规则相反,人们(和科学家,相当多的时候)寻求可能与他们当前持有的信念相一致的数据。系统1的确认性偏见倾向于不加批判地接受建议,夸大极端和不可能事件的可能性。如果问你未来三十年内海啸袭击加州的可能性, ,你脑海中浮现的图像很可能是海啸的图像,就像吉尔伯特为 "白鱼吃糖 "等无稽之谈提出的方式。你将容易高估灾难发生的概率。
EXAGGERATED EMOTIONAL COHERENCE (HALO EFFECT)
夸张的情感一致性(光环效应)。
If you like the president’s politics, you probably like his voice and his appearance as well. The tendency to like (or dislike) everything about a person—including things you have not observed—is known as the halo effect. The term has been in use in psychology for a century, but it has not come into wide use in everyday language. This is a pity, because the halo effect is a good name for a common bias that plays a large role in shaping our view of people and situations. It is one of the ways the representation of the world that System 1 generates is simpler and more coherent than the real thing.
如果你喜欢总统的政治,你可能也喜欢他的声音和他的外表。喜欢(或不喜欢) 一个人的一切,包括你没有观察到的东西,这种倾向被称为光环效应。这个术语在心理学中已经使用了一个世纪,但它在日常语言中还没有被广泛使用。这是一个遗憾,因为光环效应是一个常见偏见的好名字,它在塑造我们对人和情况的看法方面起着很大的作用。这是系统1 产生的世界表征比真实事物更简单、更一致的方式之一。
You meet a woman named Joan at a party and find her personable and easy to talk to. Now her name comes up as someone who could be asked to contribute to a charity. What do you know about Joan’s generosity? The correct answer is that you know virtually nothing, because there is little reason to believe that people who are agreeable in social situations are also generous contributors to charities. But you like Joan and you will retrieve the feeling of liking her when you think of her. You also like generosity and generous people. By association, you are now predisposed to believe that Joan is generous. And now that you believe she is generous, you probably like Joan even better than you did earlier, because you have added generosity to her pleasant attributes.
你在一个聚会上遇到一个叫琼的女人,发现她很有个性,很好说话。现在她的名字出现了,是可以被要求向慈善机构捐款的人。 你对琼的慷慨大方了解多少?正确的答案是,你几乎什么都不知道,因为没有什么理由相信在 社交场合表现得合群的人也会对慈善机构慷慨解囊。 但你喜欢琼,当你想到她时,你会找回喜欢她的感觉。你也喜欢慷慨和慷慨的人。 通过联系,你现在倾向于相信琼是慷慨的。而现在你相信她是慷慨的,你可能比之前更喜欢琼,因为你在她令人愉快的属性中增加了慷慨的 。
Real evidence of generosity is missing in the story of Joan, and the gap is filled by a guess that fits one’s emotional response to her. In other situations, evidence accumulates gradually and the interpretation is shaped by the emotion attached to the first impression. In an enduring classic of psychology, Solomon Asch presented descriptions of two people and asked for comments on their personality. What do you think of Alan and Ben?
琼的故事中缺少真正的慷慨证据,而这个空白被符合人们对她的情感反应的猜测所填补。在其他情况下,证据会逐渐积累,而解释则是由第一印象所附带的情感决定的。 在一个经久不衰的心理学经典中,所罗门-阿什提出了 对两个人的描述 ,并要求 ,对他们的个性进行评论。 你怎么看艾伦和本?
Alan: intelligent—industrious—impulsive—critical—stubborn—envious
艾伦:聪明的-工业的-冲动的-批评的-顽固的-嫉妒的
Ben: envious—stubborn—critical—impulsive—industrious—intelligent
本: 明显的-顽固的-批评的-冲动的-工业的-聪明的
If you are like most of us, you viewed Alan much more favorably than Ben. The initial traits in the list change the very meaning of the traits that appear later. The stubbornness of an intelligent person is seen as likely to be justified and may actually evoke respect, but intelligence in an envious and stubborn person makes him more dangerous. The halo effect is also an example of suppressed ambiguity: like the word bank , the adjective stubborn is ambiguous and will be interpreted in a way that makes it coherent with the context.
如果你像我们大多数人一样,你对艾伦的看法要比本好得多。列表中最初的特征改变了后面出现的特征的含义。一个聪明的 者的固执被认为可能是合理的,实际上可能唤起人们的尊重,但聪明的人在一个嫉妒和固执的人身上会使他更加危险。晕轮效应也是一个被抑制的模糊性的例子:像 银行 这个词一样,形容词 stubborn 也是模糊的,会被解释成与上下文一致的方式。
There have been many variations on this research theme. Participants in one study first considered the first three adjectives that describe Alan; then they considered the last three, which belonged, they were told, to another person. When they had imagined the two individuals, the participants were asked if it was plausible for all six adjectives to describe the same person, and most of them thought it was impossible!
,这个研究主题有很多变化。 一项研究的参与者首先考虑描述艾伦的前三个形容词;然后他们考虑后三个形容词,他们被告知这属于另一个人。当他们想象出这两个人后,参与者被问及 所有六个形容词 描述同一个人是否合理,他们中的大多数人认为这是不可能的!
The sequence in which we observe characteristics of a person is often determined by chance. Sequence matters, however, because the halo effect increases the weight of first impressions, sometimes to the point that subsequent information is mostly wasted. Early in my career as a professor, I graded students’ essay exams in the conventional way. I would pick up one test booklet at a time and read all that student’s essays in immediate succession, grading them as I went. I would then compute the total and go on to the next student. I eventually noticed that my evaluations of the essays in each booklet were strikingly homogeneous. I began to suspect that my grading exhibited a halo effect, and that the first question I scored had a disproportionate effect on the overall grade. The mechanism was simple: if I had given a high score to the first essay, I gave the student the benefit of the doubt whenever I encountered a vague or ambiguous statement later on. This seemed reasonable. Surely a student who had done so well on the first essay would not make a foolish mistake in the second one! But there was a serious problem with my way of doing things. If a student had written two essays, one strong and one weak, I would end up with different final grades depending on which essay I read first. I had told the students that the two essays had equal weight, but that was not true: the first one had a much greater impact on the final grade than the second. This was unacceptable.
我们观察一个人的 特征的顺序往往是由机会决定的。然而,顺序很重要,因为光环效应会增加第一印象的分量,有时甚至会导致后续信息的浪费。 在我担任教授的早期,我以传统的方式给学生的作文考试打分。 我会一次拿起一本试卷,立即阅读该学生的所有作文, ,边读边打分。然后我就会计算总数,然后再去找下一个学生。我最终注意到,我对每本小册子中的文章的评价是惊人的同质化。我开始怀疑我的评分表现出了光环效应,我打分的第一个问题对总分的影响不成比例。其机制很简单:如果我给第一篇文章打了高分 ,那么以后每当我遇到含糊不清或模棱两可的陈述时,我就会给该学生以怀疑的好处。当然,一个在第一篇作文中表现出色的学生不会在第二篇作文中犯愚蠢的错误!但我的做事方式有一个严重的问题。如果一个学生写了两篇文章,一篇很强,一篇很弱,我就会 ,最后的成绩也不一样,这取决于我先读哪篇文章。我曾告诉学生,这两篇论文的分量相同,但事实并非如此:第一篇论文对最终成绩的影响比第二篇大得多。这是不可以接受的。
I adopted a new procedure. Instead of reading the booklets in sequence, I read and scored all the students’ answers to the first question, then went on to the next one. I made sure to write all the scores on the inside back page of the booklet so that I would not be biased (even unconsciously) when I read the second essay. Soon after switching to the new method, I made a disconcerting observation: my confidence in my grading was now much lower than it had been. The reason was that I frequently experienced a discomfort that was new to me. When I was disappointed with a student’s second essay and went to the back page of the booklet to enter a poor grade, I occasionally discovered that I had given a top grade to the same student’s first essay. I also noticed that I was tempted to reduce the discrepancy by changing the grade that I had not yet written down, and found it hard to follow the simple rule of never yielding to that temptation. My grades for the essays of a single student often varied over a considerable range. The lack of coherence left me uncertain and frustrated.
我采用了一个新的程序。我没有按顺序阅读小册子,而是阅读了所有学生对第一个问题的回答并打分, ,然后再继续阅读下一个问题。我确保将所有的分数写在小册子的内页背面,这样我在阅读第二篇论文时就不会有偏见(甚至是无意识的)。在改用新方法后不久,我就发现了一个令人不安的现象:我对自己的评分的信心现在比以前低了很多。原因是我经常经历一种对我来说是新的 的不适感。当我对一个学生的第二篇作文感到失望,并在小册子的后面一页输入一个差的分数时,我偶尔会发现,我给同一个学生的第一篇作文打了一个最高分。我还注意到,我很想通过改变尚未写下的成绩来减少差异,并发现很难 ,即永远不要屈服于这种 的诱惑。我对一个学生的作文的成绩经常在相当大的范围内变化。缺乏连贯性使我感到不确定和沮丧。
I was now less happy with and less confident in my grades than I had been earlier, but I recognized that this was a good sign, an indication that the new procedure was superior. The consistency I had enjoyed earlier was spurious; it produced a feeling of cognitive ease, and my System 2 was happy to lazily accept the final grade. By allowing myself to be strongly influenced by the first question in evaluating subsequent ones, I spared myself the dissonance of finding the same student doing very well on some questions and badly on others. The uncomfortable inconsistency that was revealed when I switched to the new procedure was real: it reflected both the inadequacy of any single question as a measure of what the student knew and the unreliability of my own grading.
我现在对自己的成绩没有以前那么满意,也没有那么自信,但我认识到这是个好兆头,说明新的程序很有优势。我之前享受的一致性是虚假的;它产生了 一种认知上的轻松感,我的系统2很乐意懒洋洋地接受最终成绩。通过允许自己在评估后续问题时受到第一个问题的强烈影响,我避免了发现同一个学生在某些问题上做得很好而在其他问题上做得很差的不和谐现象。当我改用新的程序时,那种令人不舒服的不一致是真实存在的:它 ,既反映了任何单一问题作为衡量学生知识的标准的不足,也反映了我自己评分的不可靠。
The procedure I adopted to tame the halo effect conforms to a general principle: decorrelate error! To understand how this principle works, imagine that a large number of observers are shown glass jars containing pennies and are challenged to estimate the number of pennies in each jar. As James Surowiecki explained in his best-selling The Wisdom of Crowds , this is the kind of task in which individuals do very poorly, but pools of individual judgments do remarkably well. Some individuals greatly overestimate the true number, others underestimate it, but when many judgments are averaged, the average tends to be quite accurate. The mechanism is straightforward: all individuals look at the same jar, and all their judgments have a common basis. On the other hand, the errors that individuals make are independent of the errors made by others, and (in the absence of a systematic bias) they tend to average to zero. However, the magic of error reduction works well only when the observations are independent and their errors uncorrelated. If the observers share a bias, the aggregation of judgments will not reduce it. Allowing the observers to influence each other effectively reduces the size of the sample, and with it the precision of the group estimate.
我所采用的驯服光环效应的程序符合一个一般原则:装饰误差!"。为了理解这一原理的作用,想象一下,让大量的观察者看到装有便士的玻璃罐,并让他们估计每个罐子里的便士数量 。正如詹姆斯-苏洛维茨基在他的畅销书 《 人群的智慧 》 中所解释的那样,这是一种个人做得很差的任务,但个人判断的集合体却做得非常好。有些人大大高估了真实的数字,有些人则低估了它,但当许多判断被平均化时,平均数往往是相当准确的。其机制很简单: ,所有的人都看着同一个罐子,他们的所有判断都有一个共同的基础。另一方面,个人所犯的错误与其他人的错误无关,而且(在没有系统性偏见的情况下)它们的平均值趋向于零。然而,只有当观察结果是独立的,且其误差不相关时,减少误差的魔法才会很有效。如果观察者 ,共享一个偏见,那么判断的汇总将不会减少它。允许观察者相互影响,可以有效地减少样本的大小,并随之减少群体估计的精确度。
To derive the most useful information from multiple sources of evidence, you should always try to make these sources independent of each other. This rule is part of good police procedure. When there are multiple witnesses to an event, they are not allowed to discuss it before giving their testimony. The goal is not only to prevent collusion by hostile witnesses, it is also to prevent unbiased witnesses from influencing each other. Witnesses who exchange their experiences will tend to make similar errors in their testimony, reducing the total value of the information they provide. Eliminating redundancy from your sources of information is always a good idea.
为了从多个证据来源中获得最有用的信息,你应该始终努力使这些来源相互独立。当 ,一个事件有多个证人,他们在提供证词之前不允许讨论这个事件。其目的不仅是为了防止敌对证人的串通,也是为了防止无偏见的证人相互影响。交流经验的证人在 ,他们的证词往往会出现类似的错误,从而降低他们所提供信息的总价值。从你的信息来源中消除 冗余,总是一个好主意。
The principle of independent judgments (and decorrelated errors) has immediate applications for the conduct of meetings, an activity in which executives in organizations spend a great deal of their working days. A simple rule can help: before an issue is discussed, all members of the committee should be asked to write a very brief summary of their position. This procedure makes good use of the value of the diversity of knowledge and opinion in the group. The standard practice of open discussion gives too much weight to the opinions of those who speak early and assertively, causing others to line up behind them.
独立判断(和装饰相关的错误)的原则对会议的进行有直接的应用,组织中的高管们在这种活动中花费了大量的工作日。 一个简单的规则可以帮助:在讨论一个问题之前,应该要求委员会的所有成员对他们的立场写一个非常简短的 。这个程序很好地利用了小组中知识和意见多样性的价值。公开讨论的标准做法对那些早期和自信地发言的人的意见给予过多的重视,导致其他人排在他们之后。
WHAT YOU SEE IS ALL THERE IS (WYSIATI)
眼见为实
One of my favorite memories of the early years of working with Amos is a comedy routine he enjoyed performing. In a perfect impersonation of one of the professors with whom he had studied philosophy as an undergraduate, Amos would growl in Hebrew marked by a thick German accent: “You must never forget the Primat of the Is .” What exactly his teacher had meant by that phrase never became clear to me (or to Amos, I believe), but Amos’s jokes always made a point. He was reminded of the old phrase (and eventually I was too) whenever we encountered the remarkable asymmetry between the ways our mind treats information that is currently available and information we do not have.
在早年与阿莫斯合作的过程中,我最喜欢的记忆之一是 他喜欢表演的喜剧节目。阿莫斯完美地模仿了他本科时学习哲学的一位教授,用带有浓重德国口音的希伯来语吼道:"你决不能忘记《 伊斯》的普里马特 。"他的老师对这句话的确切意思我从来没有弄清楚(我相信对阿莫斯来说也是如此),但阿莫斯的笑话总是有意义的。他 ,每当我们遇到我们的头脑对待目前可用的信息和我们没有的信息的方式之间的显著不对称时,就会想起这句老话(最终我也是)。
An essential design feature of the associative machine is that it represents only activated ideas. Information that is not retrieved (even unconsciously) from memory might as well not exist. System 1 excels at constructing the best possible story that incorporates ideas currently activated, but it does not (cannot) allow for information it does not have.
联想机的一个基本设计特点是,它只代表被激活的想法。没有从记忆中检索(甚至是无意识地)的信息可能就像不存在 。系统1擅长于构建尽可能好的故事,将目前激活的想法纳入其中,但它不(不能)允许它没有的信息存在。
The measure of success for System 1 is the coherence of the story it manages to create. The amount and quality of the data on which the story is based are largely irrelevant. When information is scarce, which is a common occurrence, System 1 operates as a machine for jumping to conclusions. Consider the following: “Will Mindik be a good leader? She is intelligent and strong …” An answer quickly came to your mind, and it was yes. You picked the best answer based on the very limited information available, but you jumped the gun. What if the next two adjectives were corrupt and cruel ?
衡量系统1成功的标准是它设法创造的故事的连贯性。当信息匮乏时(这是一种常见的情况), 系统1的运作就像一台跳出结论的机器。 请考虑以下情况:"Mindik会是一个好的领导者吗?她很聪明,很坚强......"一个答案很快出现在你的脑海中,是的。你根据非常有限的信息选择了最好的答案,但你跳过了枪。 如果接下来的两个形容词是 腐败 和 残忍 呢?
Take note of what you did not do as you briefly thought of Mindik as a leader. You did not start by asking, “What would I need to know before I formed an opinion about the quality of someone’s leadership?” System 1 got to work on its own from the first adjective: intelligent is good, intelligent and strong is very good. This is the best story that can be constructed from two adjectives, and System 1 delivered it with great cognitive ease. The story will be revised if new information comes in (such as Mindik is corrupt), but there is no waiting and no subjective discomfort. And there also remains a bias favoring the first impression.
请注意 ,你 没有 做什么,因为你 ,短暂地认为Mindik是一个领导者。你并没有一开始就问:"在我对某人的领导质量形成看法之前,我需要知道什么?"系统1从第一个形容词中得到了自己的工作:智能是好的,智能和强大是非常好的。这是由两个形容词构成的最好的故事,系统1在认知上非常轻松地提供了这个故事。 如果有新的信息(如Mindik是腐败的),故事会被修改,但没有等待,也没有主观上的不舒服。而且,也仍然存在着有利于第一印象的偏见。
The combination of a coherence-seeking System 1 with a lazy System 2 implies that System 2 will endorse many intuitive beliefs, which closely reflect the impressions generated by System 1. Of course, System 2 also is capable of a more systematic and careful approach to evidence, and of following a list of boxes that must be checked before making a decision—think of buying a home, when you deliberately seek information that you don’t have. However, System 1 is expected to influence even the more careful decisions. Its input never ceases.
寻求一致性的系统1与懒惰的系统2的结合,意味着系统2会认可许多直观的信念,这些信念密切反映了系统1所产生的印象。当然, ,系统2也能够对证据采取更系统、更谨慎的方法,并在做决定前遵循必须检查的方框清单--想想买房,你故意寻找你没有的信息。然而,系统1预计会影响甚至更谨慎的决定。 它的输入从未停止。
Jumping to conclusions on the basis of limited evidence is so important to an understanding of intuitive thinking, and comes up so often in this book, that I will use a cumbersome abbreviation for it: WYSIATI, which stands for what you see is all there is. System 1 is radically insensitive to both the quality and the quantity of the information that gives rise to impressions and intuitions.
在有限的证据基础上得出结论 ,这对理解直觉思维非常重要,而且在本书中经常出现,所以我将使用一个繁琐的缩写:WYSIATI,它代表你所看到的就是全部。 系统1对引起印象和直觉的信息的质量和数量都根本不敏感。
Amos, with two of his graduate students at Stanford, reported a study that bears directly on WYSIATI, by observing the reaction of people who are given one-sided evidence and know it. The participants were exposed to legal scenarios such as the following:
阿莫斯与他在斯坦福大学的两名研究生一起,报告了 ,这项研究直接关系到WYSIATI,通过观察那些得到 片面证据 并知道它的人的反应。与会者接触到的法律场景如下。
On September 3, plaintiff David Thornton, a forty-three-year-old union field representative, was present in Thrifty Drug Store #168, performing a routine union visit. Within ten minutes of his arrival, a store manager confronted him and told him he could no longer speak with the union employees on the floor of the store. Instead, he would have to see them in a back room while they were on break. Such a request is allowed by the union contract with Thrifty Drug but had never before been enforced. When Mr. Thornton objected, he was told that he had the choice of conforming to these requirements, leaving the store, or being arrested. At this point, Mr. Thornton indicated to the manager that he had always been allowed to speak to employees on the floor for as much as ten minutes, as long as no business was disrupted, and that he would rather be arrested than change the procedure of his routine visit. The manager then called the police and had Mr. Thornton handcuffed in the store for trespassing. After he was booked and put into a holding cell for a brief time, all charges were dropped. Mr. Thornton is suing Thrifty Drug for false arrest.
9月3日,原告大卫-桑顿(David Thornton),一位四十三岁的工会现场代表,出现在Thrifty Drug Store #168,进行例行的工会访问。在他到达的十分钟内, ,一位商店经理与他对峙,告诉他不能再与商店地板上的工会雇员交谈。相反,他将不得不在他们休息时在一个密室里见他们。这种要求是工会与Thrifty Drug签订的合同所允许的,但以前从未执行过。 当桑顿先生反对时,他被告知,他可以选择符合这些要求, 离开商店,或者被逮捕。此时,桑顿先生向经理表示,只要不干扰 业务,他一直被允许在地板上与员工交谈,时间长达10分钟,他宁愿被逮捕也不愿改变他例行访问的程序。经理随后报了警,并以非法入侵为由将桑顿先生铐在店内。 在他被收监并被关进拘留室的短暂时间内,所有指控被撤销。桑顿先生正在起诉Thrifty Drug公司的虚假逮捕。
In addition to this background material, which all participants read, different groups were exposed to presentations by the lawyers for the two parties. Naturally, the lawyer for the union organizer described the arrest as an intimidation attempt, while the lawyer for the store argued that having the talk in the store was disruptive and that the manager was acting properly. Some participants, like a jury, heard both sides. The lawyers added no useful information that you could not infer from the background story.
除了所有与会者阅读的这些背景材料外,不同的小组还听取了双方律师的介绍。 自然,工会组织者的律师将逮捕描述为一种恐吓企图,而商店的律师 ,认为在商店里进行谈话是破坏性的,经理的行为是正确的。一些与会者,像陪审团一样,听取了双方的意见。律师们没有添加任何你无法从背景故事中推断出的有用信息。
The participants were fully aware of the setup, and those who heard only one side could easily have generated the argument for the other side. Nevertheless, the presentation of one-sided evidence had a very pronounced effect on judgments. Furthermore, participants who saw one-sided evidence were more confident of their judgments than those who saw both sides. This is just what you would expect if the confidence that people experience is determined by the coherence of the story they manage to construct from available information. It is the consistency of the information that matters for a good story, not its completeness. Indeed, you will often find that knowing little makes it easier to fit everything you know into a coherent pattern.
参与者完全知道这个设置,那些只听到一方的人很容易就产生了 另一方的论点。此外,看到单方证据的参与者比看到双方证据的参与者对他们的判断更有信心。如果人们的信心是由他们设法从现有信息中构建的故事的一致性决定的,那么这正是你所期望的。 对于一个好的故事来说,重要的是信息的一致性,而不是它的完整性。事实上,你经常会发现,知道的不多,使你更容易把你知道的一切纳入一个连贯的模式。
WY SIATI facilitates the achievement of coherence and of the cognitive ease that causes us to accept a statement as true. It explains why we can think fast, and how we are able to make sense of partial information in a complex world. Much of the time, the coherent story we put together is close enough to reality to support reasonable action. However, I will also invoke WY SIATI to help explain a long and diverse list of biases of judgment and choice, including the following among many others:
WY SIATI有利于实现连贯性和认知上的轻松,使我们接受一个陈述为真实。它解释了为什么我们能够快速思考,以及我们如何能够在复杂的世界中对部分信息做出 。很多时候,我们拼凑出来的连贯的故事足以支持合理的行动。 然而,我也会引用WY SIATI来帮助解释一长串不同的判断和选择的偏见,包括以下这些。
Overconfidence: As the WY SIATI rule implies, neither the quantity nor the quality of the evidence counts for much in subjective confidence. The confidence that individuals have in their beliefs depends mostly on the quality of the story they can tell about what they see, even if they see little. We often fail to allow for the possibility that evidence that should be critical to our judgment is missing—what we see is all there is. Furthermore, our associative system tends to settle on a coherent pattern of activation and suppresses doubt and ambiguity.
过度自信:正如WY SIATI规则所暗示的那样,证据的数量和质量 ,在主观上都不太重要。个人对其信仰的信心主要取决于他们对所看到的东西所能讲述的故事的质量,即使他们看到的东西很少。我们常常不允许对我们的判断至关重要的证据缺失的可能性--我们所看到的就是全部。此外,我们的联想系统倾向于 ,解决一个连贯的激活模式,抑制怀疑和模糊性。
Framing effects: Different ways of presenting the same information often evoke different emotions. The statement that “the odds of survival one month after surgery are 90%” is more reassuring than the equivalent statement that “mortality within one month of surgery is 10%.” Similarly, cold cuts described as “90% fat-free” are more attractive than when they are described as “10% fat.” The equivalence of the alternative formulations is transparent, but an individual normally sees only one formulation, and what she sees is all there is.
框架效应:以不同的方式呈现相同的信息往往会唤起不同的情感。 "手术后一个月的生存几率为90%"的说法比 "手术后一个月内的死亡率为10%"的同等说法更令人放心。同样,描述为 "90%无脂肪 "的冷盘 ,比描述为 "10%脂肪 "时更有吸引力。替代方案的等价性是透明的,但个人通常只看到一种方案,她所看到的就是全部内容。
Base-rate neglect: Recall Steve, the meek and tidy soul who is often believed to be a librarian. The personality description is salient and vivid, and although you surely know that there are more male farmers than male librarians, that statistical fact almost certainly did not come to your mind when you first considered the question. What you saw was all there was.
基准忽视:回顾一下史蒂夫,这个温顺整洁的灵魂,通常被认为是一个图书管理员。个性描述突出而生动,尽管你肯定知道 ,男性农民比男性图书管理员多,但当你第一次考虑这个问题时,几乎肯定没有想到这个统计事实。 你看到的就是全部。
SPEAKING OF JUMPING TO CONCLUSIONS
说到结论的跳跃性
“She knows nothing about this person’s management skills. All she is going by is the halo effect from a good presentation.”
"她对这个人的管理能力一无所知。她所依据的只是一个好的演讲所带来的光环效应。"
“Let’s decorrelate errors by obtaining separate judgments on the issue before any discussion. We will get more information from independent assessments.”
"在进行任何讨论之前,让我们通过获得关于该问题的独立 判断来装饰错误。 我们将从独立评估中获得更多信息。"
“They made that big decision on the basis of a good report from one consultant. WYSIATI—what you see is all there is. They did not seem to realize how little information they had.”
"他们根据一位顾问的良好报告做出了这一重大决定。 所见即所得,他们似乎没有意识到他们掌握的信息有多么少。"
“They didn’t want more information that might spoil their story. WYSIATI.”
"他们不希望有更多的信息,可能会破坏他们的故事。 WYSIATI。"
8
8
How Judgments Happen
判决是如何发生的
There is no limit to the number of questions you can answer, whether they are questions someone else asks or questions you ask yourself. Nor is there a limit to the number of attributes you can evaluate. You are capable of counting the number of capital letters on this page, comparing the height of the windows of your house to the one across the street, and assessing the political prospects of your senator on a scale from excellent to disastrous. System 2 receives questions or generates them: in either case it directs attention and searches memory to find the answers. System 1 operates differently. It continuously monitors what is going on outside and inside the mind, and continuously generates assessments of various aspects of the situation without specific intention and with little or no effort. These basic assessments play an important role in intuitive judgment, because they are easily substituted for more difficult questions—this is the essential idea of the heuristics and biases approach. Two other features of System 1 also support the substitution of one judgment for another. One is the ability to translate values across dimensions, which you do in answering a question that most people find easy: “If Sam were as tall as he is intelligent, how tall would he be?” Finally, there is the mental shotgun. An intention of System 2 to answer a specific question or evaluate a particular attribute of the situation automatically triggers other computations, including basic assessments.
你能回答的问题数量没有限制,无论是别人问的问题还是你自己问的问题。 你能评估的属性数量也没有限制。你 ,能够计算本页大写字母的数量,比较你的房子和对面的房子窗户的高度,并在从优秀到灾难的范围内评估你的参议员的政治前景。系统2接收问题或产生问题:在任何一种情况下,它都会引导注意力并搜索记忆以找到答案。系统1的运作方式不同。它 ,不断地监测思想外部和内部发生的事情,不断地产生对各方面情况的评估,没有具体的意图,几乎不需要努力。这些 基本评估 在直觉判断中起着重要作用,因为它们很容易被替换成更难的问题--这就是启发式和偏见方法的基本思想。 系统1的另外两个特点也支持用一种判断替换另一种判断。一个是跨维度转换价值的能力,你在回答一个大多数人都觉得很容易的问题时就是这样做的:"如果山姆和他的智慧一样高,他会有多高?"最后,还有精神上的猎枪。系统2的意图是回答一个特定的问题或评估一个特定的 情况的属性,自动触发其他计算,包括基本评估。
BASIC ASSESSMENTS
基本评估
System 1 has been shaped by evolution to provide a continuous assessment of the main problems that an organism must solve to survive: How are things going? Is there a threat or a major opportunity? Is everything normal? Should I approach or avoid? The questions are perhaps less urgent for a human in a city environment than for a gazelle on the savannah, but we have inherited the neural mechanisms that evolved to provide ongoing assessments of threat level, and they have not been turned off. Situations are constantly evaluated as good or bad, requiring escape or permitting approach. Good mood and cognitive ease are the human equivalents of assessments of safety and familiarity.
系统1由进化形成,对生物体为生存而必须解决的主要问题进行持续评估:事情进展如何?是否存在威胁或重大机遇?一切正常吗?我应该接近还是回避? 对于城市环境中的人类来说,这些问题也许没有草原上的羚羊那么紧迫,但我们已经继承了进化出的对威胁程度进行持续评估的神经机制,而且它们并没有被关闭。情况不断被评估为好或坏,需要逃避或允许接近。 良好的情绪和认知上的轻松是人类对安全和熟悉的评估的等效物。
For a specific example of a basic assessment , consider the ability to discriminate friend from foe at a glance. This contributes to one’s chances of survival in a dangerous world, and such a specialized capability has indeed evolved. Alex Todorov, my colleague at Princeton, has explored the biological roots of the rapid judgments of how safe it is to interact with a stranger. He showed that we are endowed with an ability to evaluate, in a single glance at a stranger’s face, two potentially crucial facts about that person: how dominant (and therefore potentially threatening) he is, and how trustworthy he is, whether his intentions are more likely to be friendly or hostile . The shape of the face provides the cues for assessing dominance: a “strong” square chin is one such cue. Facial expression (smile or frown) provides the cues for assessing the stranger’s intentions. The combination of a square chin with a turned-down mouth may spell trouble . The accuracy of face reading is far from perfect: round chins are not a reliable indicator of meekness, and smiles can (to some extent) be faked. Still, even an imperfect ability to assess strangers confers a survival advantage.
关于 基本评估 的一个具体例子,请考虑一下一目了然地辨别敌我的能力。我在普林斯顿大学的同事亚历克斯-托多罗夫(Alex Todorov)探讨了快速判断与陌生人交往的安全程度的 生物学根源 。他表明,我们 ,只要看一眼陌生人的脸,就有能力评估关于这个人的两个潜在的关键事实:他有多大的优势(因此有潜在的威胁),以及他有多大的可信度,他的意图更可能是 友好的还是敌对的 。面部形状提供了评估支配地位的线索:"强壮的 "方下巴就是这样一个线索。面部 表情(微笑或皱眉)提供了评估陌生人意图的线索。方下巴与下垂的嘴结合在一起 可能意味着麻烦 。 面部解读的准确性远非完美:圆下巴不是温顺的可靠指标,微笑可以(在某种程度上)被伪造。 尽管如此,即使评估陌生人的不完美能力也会赋予生存优势。
This ancient mechanism is put to a novel use in the modern world: it has some influence on how people vote. Todorov showed his students pictures of men’s faces, sometimes for as little as one-tenth of a second, and asked them to rate the faces on various attributes, including likability and competence. Observers agreed quite well on those ratings. The faces that Todorov showed were not a random set: they were the campaign portraits of politicians competing for elective office. Todorov then compared the results of the electoral races to the ratings of competence that Princeton students had made, based on brief exposure to photographs and without any political context. In about 70% of the races for senator, congressman, and governor, the election winner was the candidate whose face had earned a higher rating of competence. This striking result was quickly confirmed in national elections in Finland, in zoning board elections in England, and in various electoral contests in Australia, Germany, and Mexico . Surprisingly (at least to me), ratings of competence were far more predictive of voting outcomes in Todorov’s study than ratings of likability.
这种 古老的机制在现代世界被赋予了新奇的用途:它对人们的投票方式有一定的影响。 托多罗夫向他的学生展示了男人的脸部照片,有时只有十分之一秒,并要求他们对这些脸部的各种属性进行评分,包括可亲性和能力。观察员们对这些评级的意见相当一致。托多罗夫展示的面孔不是随机的: ,它们是竞争选举职位的政治家的竞选画像。然后,托多罗夫将选举结果与普林斯顿大学学生的能力评分进行了比较,这些评分是基于对照片的短暂接触,没有任何政治背景。在大约70%的参议员、众议员和州长的竞选中,选举获胜者是其面孔获得 ,能力评分较高的候选人 。这一引人注目的结果很快在芬兰的全国选举、英国的区划委员会选举以及 澳大利亚、德国和墨西哥 的各种选举竞争中得到了证实。令人惊讶的是(至少对我来说),在托多罗夫的研究中,对能力的评价远比对喜好的评价更能预测投票结果。
Todorov has found that people judge competence by combining the two dimensions of strength and trustworthiness. The faces that exude competence combine a strong chin with a slight confident-appearing smile. There is no evidence that these facial features actually predict how well politicians will perform in office. But studies of the brain’s response to winning and losing candidates show that we are biologically predisposed to reject candidates who lack the attributes we value—in this research, losers evoked stronger indications of (negative) emotional response. This is an example of what I will call a judgment heuristic in the following chapters. Voters are attempting to form an impression of how good a candidate will be in office, and they fall back on a simpler assessment that is made quickly and automatically and is available when System 2 must make its decision.
托多罗夫发现,人们通过结合力量和可信度这两个维度来判断 能力。散发出能力的面孔结合了强有力的下巴和略带自信的微笑。没有证据表明,这些面部特征实际上预测了政治家在办公室的表现。 但对大脑对获胜和失败的候选人的反应的研究表明,我们在生物学上倾向于拒绝 ,因为这些候选人缺乏我们所重视的属性--在这项研究中,失败者唤起了更强烈的(负面)情绪反应迹象。 这是一个例子,我将在以下章节中称之为 判断启发式 。选民们正试图形成一个印象,即一个候选人在办公室里会有多好,他们会回落到一个更简单的评估,这个评估是快速和自动做出的,当系统2必须做出决定时,可以 。
Political scientists followed up on Todorov’s initial research by identifying a category of voters for whom the automatic preferences of System 1 are particularly likely to play a large role. They found what they were looking for among politically uninformed voters who watch a great deal of television. As expected, the effect of facial competence on voting is about three times larger for information-poor and TV-prone voters than for others who are better informed and watch less television . Evidently, the relative importance of System 1 in determining voting choices is not the same for all people. We will encounter other examples of such individual differences.
政治学家对托多罗夫的初步研究进行了追踪,确定了一类选民,对他们来说,制度1的自动偏好特别可能发挥很大作用。正如预期的那样,对于信息贫乏和容易看电视的选民来说,面部能力对投票的影响 ,比其他信息较好和 看电视较少的 人要大三倍左右。显然,系统1在决定投票选择方面的相对重要性对所有人都不一样。 我们还会遇到这种个体差异的其他例子。
System 1 understands language, of course, and understanding depends on the basic assessments that are routinely carried out as part of the perception of events and the comprehension of messages. These assessments include computations of similarity and representativeness, attributions of causality, and evaluations of the availability of associations and exemplars. They are performed even in the absence of a specific task set , although the results are used to meet task demands as they arise.
当然,系统1能理解语言,而理解取决于基本的评估 ,这些评估是作为感知事件和理解信息的一部分例行进行的。这些评估包括相似性和代表性的计算、因果关系的归属以及对关联和范例可用性的评估。即使在 没有特定任务集 的情况下,也会执行这些任务,尽管其结果被用来满足出现的任务需求。
The list of basic assessments is long, but not every possible attribute is assessed. For an example, look briefly at figure 7 .
基本评估的清单很长,但并不是每一个可能的属性都要评估。 举个例子,简单看一下 图7 。
A glance provides an immediate impression of many features of the display. You know that the two towers are equally tall and that they are more similar to each other than the tower on the left is to the array of blocks in the middle. However, you do not immediately know that the number of blocks in the left-hand tower is the same as the number of blocks arrayed on the floor, and you have no impression of the height of the tower that you could build from them. To confirm that the numbers are the same, you would need to count the two sets of blocks and compare the results, an activity that only System 2 can carry out.
一眼就能对展示的许多特征有一个直接的印象。 你知道这两座塔同样高,而且它们之间的相似度比左边的塔与中间的积木阵列 。然而,你并不能立即知道 ,左侧塔中的积木数量与排列在地板上的积木数量相同,而且你对用这些积木搭建的塔的高度也没有印象。要确认这些数字是相同的,你需要计算两组积木并比较结果,这种活动只有系统2能够进行。

Figure 7
图7
SETS AND PROTOTYPES
套装和原型
For another example, consider the question: What is the average length of the lines in figure 8 ?
另一个例子,考虑问题: 图8 中线条的平均长度是多少?

Figure 8
图8
This question is easy and System 1 answers it without prompting. Experiments have shown that a fraction of a second is sufficient for people to register the average length of an array of lines with considerable precision. Furthermore, the accuracy of these judgments is not impaired when the observer is cognitively busy with a memory task. They do not necessarily know how to describe the average in inches or centimeters, but they will be very accurate in adjusting the length of another line to match the average. System 2 is not needed to form an impression of the norm of length for an array. System 1 does it, automatically and effortlessly, just as it registers the color of the lines and the fact that they are not parallel. We also can form an immediate impression of the number of objects in an array—precisely if there are four or fewer objects, crudely if there are more.
这个问题很简单,系统1不需要提示就能回答。 实验表明,几分之一秒的时间足以让人们相当精确地记录一列线条的平均长度。 此外,当观察者在认知上忙于记忆任务时,这些判断的准确性不会受到影响。他们不一定知道如何描述 ,以英寸或厘米为单位的平均值,但他们会 ,非常准确地调整另一条线的长度以配合平均值。系统2不需要对一个数组的长度规范形成印象。系统1自动地、毫不费力地做到了这一点,就像它登记了线条的颜色和它们不平行的事实一样。我们也可以对一个数组中的对象的数量形成一个直接的印象 --如果有四个或更少的对象,则是精确的,如果有更多的对象,则是粗糙的。
Now to another question: What is the total length of the lines in figure 8 ? This is a different experience, because System 1 has no suggestions to offer. The only way you can answer this question is by activating System 2, which will laboriously estimate the average, estimate or count the lines, and multiply average length by the number of lines.
现在是另一个问题: 图8 中线条的总长度是多少?这是一种不同的体验,因为系统1没有任何建议可以提供。你回答这个问题的唯一方法是激活系统2,它将费力地估计平均数,估计或计算行数,并 ,用平均长度乘以行数。
The failure of System 1 to compute the total length of a set of lines at a glance may look obvious to you; you never thought you could do it. It is in fact an instance of an important limitation of that system. Because System 1 represents categories by a prototype or a set of typical exemplars, it deals well with averages but poorly with sums. The size of the category, the number of instances it contains, tends to be ignored in judgments of what I will call sum-like variables .
系统1无法计算一组线条的总长度,这对你来说可能是显而易见的;你从未想过你能做到这一点。 事实上,这是该系统的一个重要限制的例子。由于系统1通过一个原型或一组典型范例来表示类别,它对平均数的处理很好,但对总和的处理很差。类别的 ,即它所包含的实例数量,在判断我所说的 总和类变量 时往往被忽略。
Participants in one of the numerous experiments that were prompted by the litigation following the disastrous Exxon Valdez oil spill were asked their willingness to pay for nets to cover oil ponds in which migratory birds often drown. Different groups of participants stated their willingness to pay to save 2,000, 20,000, or 200,000 birds. If saving birds is an economic good it should be a sum-like variable: saving 200,000 birds should be worth much more than saving 2,000 birds. In fact, the average contributions of the three groups were $80, $78, and $88 respectively. The number of birds made very little difference. What the participants reacted to, in all three groups, was a prototype—the awful image of a helpless bird drowning, its feathers soaked in thick oil. The almost complete neglect of quantity in such emotional contexts has been confirmed many times.
在灾难性的 埃克森-瓦尔迪兹 石油泄漏事件后的诉讼中,许多实验的参与者被问及他们是否愿意为覆盖油池的网付费,因为候鸟经常在油池中溺水。不同的参与者群体 ,表示他们愿意为拯救2,000只、20,000只或200,000只鸟而付费。如果拯救鸟类是一种经济利益,它应该是一个类似于总和的变量:拯救20万只鸟应该比拯救2000只鸟更有价值。事实上,三组的平均捐款分别为80美元、78美元和88美元。鸟的数量差别不大。在所有 三组中,参与者的反应是一个原型--一只无助的鸟被淹死的可怕形象,它的羽毛浸泡在浓油中。在这样的情感背景下,几乎完全忽视了数量,这一点已被多次证实。
INTENSITY MATCHING
强度匹配
Questions about your happiness, the president’s popularity, the proper punishment of financial evildoers, and the future prospects of a politician share an important characteristic: they all refer to an underlying dimension of intensity or amount, which permits the use of the word more : more happy, more popular, more severe, or more powerful (for a politician). For example, a candidate’s political future can range from the low of “She will be defeated in the primary” to a high of “She will someday be president of the United States.”
关于你的幸福感、总统的受欢迎程度、对金融恶棍的适当惩罚以及一个政治家的未来前景的问题都有一个重要的特点: ,它们都是指强度或数量的基本维度,这就允许使用 更多 这个词:更幸福、更受欢迎、更严重或更强大(对一个政治家来说)。例如, ,一个候选人的政治前途可以从 "她将在初选中被击败 "的低水平到 "她有一天会成为美国总统 "的高水平。
Here we encounter a new aptitude of System 1. An underlying scale of intensity allows matching across diverse dimensions. If crimes were colors, murder would be a deeper shade of red than theft. If crimes were expressed as music, mass murder would be played fortissimo while accumulating unpaid parking tickets would be a faint pianissimo. And of course you have similar feelings about the intensity of punishments. In classic experiments, people adjusted the loudness of a sound to the severity of crimes; other people adjusted loudness to the severity of legal punishments. If you heard two notes, one for the crime and one for the punishment, you would feel a sense of injustice if one tone was much louder than the other.
在这里,我们遇到了一个新的能力,即系统 1。一个基本的强度等级允许在不同的维度上 进行匹配 。如果犯罪是颜色,谋杀将是比盗窃更深的红色。如果犯罪被表达为音乐,大规模的谋杀将被演奏成强音,而累积未支付的停车票将是微弱的钢琴曲。当然,你对惩罚的强度也有类似的感觉。在经典的实验中, ,人们将声音的响度调整为犯罪的严重程度;其他人则将响度调整为法律惩罚的严重程度。如果你听到两个音符,一个代表犯罪,一个代表惩罚,如果一个音符比另一个响得多,你会感到 不公正的感觉 。
Consider an example that we will encounter again later:
考虑一下我们以后会再次遇到的一个例子。
Julie read fluently when she was four years old.
朱莉在四岁的时候就能流利地阅读。
Now match Julie’s reading prowess as a child to the following intensity scales:
现在将朱莉小时候的阅读能力与下列强度标尺相匹配。
How tall is a man who is as tall as Julie was precocious?
一个和朱莉一样高的人有多早熟?
What do you think of 6 feet? Obviously too little. What about 7 feet? Probably too much. You are looking for a height that is as remarkable as the achievement of reading at age four. Fairly remarkable, but not extraordinary. Reading at fifteen months would be extraordinary, perhaps like a man who is 7′8″.
你对6英尺有什么看法?显然是太少了。7英尺呢?可能是太多了。 你在寻找一个高度,就像四岁时阅读的成就一样显著。相当引人注目,但并不特别。十五个月时的阅读能力将 ,也许就像一个7′8英寸的人一样,是不寻常的。
What level of income in your profession matches Julie’s reading achievement?
在你的职业中,什么水平的收入与朱莉的阅读成绩相匹配?
Which crime is as severe as Julie was precocious?
哪种罪行和朱莉的早熟一样严重?
Which graduating GPA in an Ivy League college matches Julie’s reading?
常春藤盟校中哪所毕业的GPA与朱莉的阅读量相符?
Not very hard, was it? Furthermore, you can be assured that your matches will be quite close to those of other people in your cultural milieu. We will see that when people are asked to predict Julie’s GPA from the information about the age at which she learned to read, they answer by translating from one scale to another and pick the matching GPA. And we will also see why this mode of prediction by matching is statistically wrong—although it is perfectly natural to System 1, and for most people except statisticians it is also acceptable to System 2.
不是很困难,是吗?此外,你可以放心,你的匹配将与你的文化环境中的其他人相当接近。 我们将看到,当人们被要求根据朱莉学会阅读的年龄的信息来预测她的GPA时,他们的回答是从一个量表转换到另一个量表,并选择匹配的GPA。我们还将看到为什么这种通过匹配来预测的模式在统计上是错误的--尽管它对系统1来说是完全自然的,而且对大多数人来说,除了统计学家,它对系统2也是可以接受的 。
THE MENTAL SHOTGUN
精神上的猎枪
System 1 carries out many computations at any one time. Some of these are routine assessments that go on continuously. Whenever your eyes are open, your brain computes a three-dimensional representation of what is in your field of vision, complete with the shape of objects, their position in space, and their identity. No intention is needed to trigger this operation or the continuous monitoring for violated expectations. In contrast to these routine assessments, other computations are undertaken only when needed: you do not maintain a continuous evaluation of how happy or wealthy you are, and even if you are a political addict you do not continuously assess the president’s prospects. The occasional judgments are voluntary. They occur only when you intend them to do so.
系统1在任何时候都会进行许多计算,其中一些是持续进行的常规评估。 每当你的眼睛睁开时,你的大脑就会计算出你视野中的三维表示,包括物体的形状、它们在空间的位置以及它们的身份。不需要任何意图来触发这种操作 ,也不需要持续监测违反预期的情况。与这些常规评估相比,其他计算只有在需要时才会进行:你不会持续评估你有多快乐或多富有,即使你是一个政治上瘾者,你也不会持续评估总统的前景。 偶尔的判断是自愿的。只有当你打算 ,它们才会发生。
You do not automatically count the number of syllables of every word you read, but you can do it if you so choose. However, the control over intended computations is far from precise: we often compute much more than we want or need. I call this excess computation the mental shotgun . It is impossible to aim at a single point with a shotgun because it shoots pellets that scatter, and it seems almost equally difficult for System 1 not to do more than System 2 charges it to do. Two experiments that I read long ago suggested this image.
你不会自动计算你所读的每个单词的音节数,但如果你选择的话,你可以这样做。然而,对预期计算的控制远非精确:我们的计算往往比我们想要或需要的要多得多。 我把这种多余的计算称为 心理枪 。用猎枪瞄准一个点是不可能的,因为它射出的弹丸会散开, ,系统1不做比系统2指控它做的更多的事情,似乎也同样困难。 我很久以前读过的两个实验提出了这个形象的说法。
Participants in one experiment listened to pairs of words, with the instruction to press a key as quickly as possible whenever they detected that the words rhymed . The words rhyme in both these pairs:
在一项实验中,参与者听取了成对的单词,指示他们每当 发现这些单词押韵 时就尽快按下一个键。这两对词都是押韵的。
VOTE—NOTE
投票-NOTE
VOTE—GOAT
投票--GOAT
The difference is obvious to you because you see the two pairs. VOTE and GOAT rhyme, but they are spelled differently. The participants only heard the words, but they were also influenced by the spelling. They were distinctly slower to recognize the words as rhyming if their spelling was discrepant. Although the instructions required only a comparison of sounds, the participants also compared their spelling, and the mismatch on the irrelevant dimension slowed them down. An intention to answer one question evoked another, which was not only superfluous but actually detrimental to the main task.
差异 ,因为你看到的是两对。 VOTE和GOAT是押韵的,但它们的拼写不同。参与者只听到了这些词,但他们也受到了拼写的影响。如果单词的拼写不一致,他们识别出单词押韵的速度明显较慢。虽然指令只要求比较声音,但参与者还比较了他们的拼写, ,不相关维度上的不匹配使他们的速度变慢。回答一个问题的意图唤起了另一个问题,这不仅是多余的,而且实际上不利于主要任务。
In another study, people listened to a series of sentences, with the instruction to press one key as quickly as possible to indicate if the sentence was literally true , and another key if the sentence was not literally true. What are the correct responses for the following sentences?
在另一项研究中,人们听了一系列的句子,指示他们尽快按一个键来表示该 句子 是否 为字面意思 ,如果该句子不是 字面意思,则按另一个键。以下句子的正确回答是什么?
Some roads are snakes.
有些道路是蛇形的。
Some jobs are snakes.
有些工作是蛇形的。
Some jobs are jails.
有些工作是监狱。
All three sentences are literally false. However, you probably noticed that the second sentence is more obviously false than the other two—the reaction times collected in the experiment confirmed a substantial difference. The reason for the difference is that the two difficult sentences can be metaphorically true. Here again, the intention to perform one computation evoked another. And here again, the correct answer prevailed in the conflict, but the conflict with the irrelevant answer disrupted performance. In the next chapter we will see that the combination of a mental shotgun with intensity matching explains why we have intuitive judgments about many things that we know little about.
这三个句子从字面上看都是错误的。然而,你可能注意到,第二句话比其他两句更明显的错误--实验中收集的反应时间证实了一个实质性的差异。差异的原因 ,因为这两个困难的句子可以是比喻性的。在这里,执行一种计算的意图再次唤起了另一种计算。而在这里,正确的答案又在冲突中占了上风,但与不相关的答案的冲突却扰乱了表现。在下一章中,我们将看到,心理猎枪与强度匹配的结合解释了为什么我们对许多我们知之甚少的事物有直观的判断 。
SPEAKING OF JUDGMENT
说到审判
“Evaluating people as attractive or not is a basic assessment. You do that automatically whether or not you want to, and it influences you.”
"评价人们是否有吸引力是一个基本的评估。 无论你是否愿意,你都会自动这样做,而且它影响着你。"
“There are circuits in the brain that evaluate dominance from the shape of the face. He looks the part for a leadership role.”
"大脑中存在着从脸部形状评估主导地位的电路。他看起来很适合担任领导角色"。
“The punishment won’t feel just unless its intensity matches the crime. Just like you can match the loudness of a sound to the brightness of a light.”
"除非惩罚的强度符合 ,否则不会感到公正。就像你可以将声音的响度与灯光的亮度相匹配。"
“This was a clear instance of a mental shotgun. He was asked whether he thought the company was financially sound, but he couldn’t forget that he likes their product.”
"这是一个明显的精神枪击事件。有人问他是否认为该公司财务状况良好,但他不能忘记他喜欢他们的产品。"
9
9
Answering an Easier Question
回答一个更容易的问题
A remarkable aspect of your mental life is that you are rarely stumped. True, you occasionally face a question such as 17 × 24 = ? to which no answer comes immediately to mind, but these dumbfounded moments are rare. The normal state of your mind is that you have intuitive feelings and opinions about almost everything that comes your way. You like or dislike people long before you know much about them; you trust or distrust strangers without knowing why; you feel that an enterprise is bound to succeed without analyzing it. Whether you state them or not, you often have answers to questions that you do not completely understand, relying on evidence that you can neither explain nor defend.
你精神生活的一个显著方面是,你很少被绊倒。诚然,你偶尔会遇到一个问题,如17×24=?,而脑海中却没有答案,但这种茫然的时刻 ,是很少见的。你的正常状态是,你对几乎所有遇到的事情都有直观的感受和看法。你喜欢或不喜欢的人,早在你对他们了解之前;你信任或不信任陌生人,而不知道为什么;你觉得一个企业一定会成功,而不去分析它。无论你是否说明,你经常对 ,你不完全理解的问题有答案,依靠的是你无法解释或辩护的证据。
SUBSTITUTING QUESTIONS
替代问题
I propose a simple account of how we generate intuitive opinions on complex matters. If a satisfactory answer to a hard question is not found quickly, System 1 will find a related question that is easier and will answer it. I call the operation of answering one question in place of another substitution . I also adopt the following terms:
我提出了一个关于我们如何对复杂问题产生直觉意见的简单说明。如果不能很快找到一个令人满意的答案,系统1会找到一个更容易的相关问题,并回答它。我把回答一个问题的操作称为另一个 替换 的地方 。我还采用了以下术语。
The target question is the assessment you intend to produce.
目标问题是你打算制作的评估。
The heuristic question is the simpler question that you answer instead.
启发式问题是你回答的比较简单的问题,而不是。
The technical definition of heuristic is a simple procedure that helps find adequate, though often imperfect, answers to difficult questions. The word comes from the same root as eureka .
启发式的 技术定义是一个简单的程序,帮助找到充分的,尽管往往是不完美的,对困难问题的答案。 这个词与 尤里卡 来自同一个词根。
The idea of substitution came up early in my work with Amos, and it was the core of what became the heuristics and biases approach. We asked ourselves how people manage to make judgments of probability without knowing precisely what probability is. We concluded that people must somehow simplify that impossible task, and we set out to find how they do it. Our answer was that when called upon to judge probability, people actually judge something else and believe they have judged probability. System 1 often makes this move when faced with difficult target questions, if the answer to a related and easier heuristic question comes readily to mind.
在我和阿莫斯的工作中,很早就出现了替代 ,这也是后来启发式方法和偏见方法的核心。我们问自己,人们在不确切知道概率是什么的情况下是如何做出概率判断的。我们得出结论,人们必须以某种方式简化这项不可能完成的任务,我们开始寻找他们是如何做到这一点的。我们的答案是,当被要求判断概率时, ,人们实际上是在判断别的东西,并认为他们已经判断了概率。系统1在面对困难的目标问题时,如果相关的、更容易的启发式问题的答案很容易出现在脑海中,就经常做出这种举动。
Substituting one question for another can be a good strategy for solving difficult problems, and George Pólya included substitution in his classic How to Solve It : “If you can’t solve a problem, then there is an easier problem you can solve: find it.” Pólya’s heuristics are strategic procedures that are deliberately implemented by System 2. But the heuristics that I discuss in this chapter are not chosen; they are a consequence of the mental shotgun, the imprecise control we have over targeting our responses to questions.
用一个问题替代另一个问题可以是解决困难问题的好策略,乔治-波利亚在他的经典著作《 如何解决 》中包含了替代的内容 :"如果你不能解决一个问题,那么有一个更容易的问题你可以解决:找到它"。波利亚的启发式方法是系统2有意实施的战略程序。 但我在本章讨论的启发式方法并不是选择的;它们是心理枪弹的结果,是我们对问题的反应有针对性的不精确控制。
Consider the questions listed in the left-hand column of table 1 . These are difficult questions, and before you can produce a reasoned answer to any of them you must deal with other difficult issues. What is the meaning of happiness? What are the likely political developments in the next six months? What are the standard sentences for other financial crimes? How strong is the competition that the candidate faces? What other environmental or other causes should be considered? Dealing with these questions seriously is completely impractical. But you are not limited to perfectly reasoned answers to questions. There is a heuristic alternative to careful reasoning, which sometimes works fairly well and sometimes leads to serious errors.
请考虑 表1 左边一栏中列出的问题 。这些都是困难的问题,在你对其中任何一个问题做出合理的回答之前,你必须处理其他困难的问题。 幸福的含义是什么? 未来六个月可能的政治发展是什么? 其他金融犯罪的标准刑期是什么?候选人面临的竞争有多激烈? 还应考虑哪些 环境或其他原因?认真处理这些问题是完全不现实的。但你并不局限于完全推理的问题答案。 有一种启发式的方法可以替代仔细推理,有时效果相当好,有时会导致严重错误。
Target Question
目标问题 =
Heuristic Question
启发式问题 =
How much would you contribute to save an endangered species?
为了拯救一个濒临灭绝的 ,你会捐多少钱?
How much emotion do I feel when I think of dying dolphins?
当我想到奄奄一息的海豚时,我有多少感慨?
How happy are you with your life these days?
你对自己这些天的生活有多满意?
What is my mood right now?
我现在的心情是怎样的?
How popular will the president be six months from now?
6个月后,总统会有多受欢迎?
How popular is the president right now?
总统现在有多受欢迎?
How should financial advisers who prey on the elderly be punished?
掠夺老年人的财务顾问应受到怎样的惩罚 ?
How much anger do I feel when I think of financial predators?
当我想到金融掠夺者时,我感到有多愤怒?
This woman is running for the primary. How far will she go in politics?
这个女人正在竞选 ,初选。她在政治上会走多远?
Does this woman look like a political winner?
这个女人看起来像一个政治赢家吗?
Table 1
表1
The mental shotgun makes it easy to generate quick answers to difficult questions without imposing much hard work on your lazy System 2. The right-hand counterpart of each of the left-hand questions is very likely to be evoked and very easily answered. Your feelings about dolphins and financial crooks, your current mood, your impressions of the political skill of the primary candidate, or the current standing of the president will readily come to mind. The heuristic questions provide an off-the-shelf answer to each of the difficult target questions.
心理枪使你很容易对困难的问题产生快速的答案,而不需要对你的懒惰系统2强加许多艰苦的工作。 每一个左手的问题的右手对应物都很有可能被唤起,并且非常容易回答。你对海豚和金融 骗子的感觉,你现在的心情,你对初选候选人的政治技巧的印象,或总统目前的地位会很容易想到。 启发式问题为每个困难的目标问题提供了一个现成的答案。
Something is still missing from this story: the answers need to be fitted to the original questions. For example, my feelings about dying dolphins must be expressed in dollars. Another capability of System 1, intensity matching, is available to solve that problem. Recall that both feelings and contribution dollars are intensity scales. I can feel more or less strongly about dolphins and there is a contribution that matches the intensity of my feelings. The dollar amount that will come to my mind is the matching amount. Similar intensity matches are possible for all the questions. For example, the political skills of a candidate can range from pathetic to extraordinarily impressive, and the scale of political success can range from the low of “She will be defeated in the primary” to a high of “She will someday be president of the United States.”
这个故事中仍然缺少一些东西:答案需要与原来的问题相适应。例如,我对濒临死亡的 海豚的感受必须用美元来表达。系统1的另一项能力,即强度匹配,可用于解决这一问题。回顾一下,感受和贡献值都是强度标尺。 我可以对海豚有更多或更少的感受,有一个与我的感受强度相匹配的贡献值。会出现在我脑海中的美元数额是匹配的数额。类似的强度 ,所有的问题都有可能匹配。例如,候选人的政治能力可以从可怜到超凡脱俗,政治成功的规模可以从 "她将在初选中被击败 "的低水平到 "她有一天会成为美国总统 "的高水平。
The automatic processes of the mental shotgun and intensity matching often make available one or more answers to easy questions that could be mapped onto the target question. On some occasions, substitution will occur and a heuristic answer will be endorsed by System 2. Of course, System 2 has the opportunity to reject this intuitive answer, or to modify it by incorporating other information. However, a lazy System 2 often follows the path of least effort and endorses a heuristic answer without much scrutiny of whether it is truly appropriate. You will not be stumped, you will not have to work very hard, and you may not even notice that you did not answer the question you were asked. Furthermore, you may not realize that the target question was difficult, because an intuitive answer to it came readily to mind .
心理枪和强度匹配的自动过程经常使 ,一个或多个简单问题的答案可以映射到目标问题上。在某些情况下,替代会发生,一个启发式的答案会被系统2认可。当然,系统2有机会拒绝这个直观的答案,或通过纳入其他信息来修改它。然而,懒惰的系统2往往遵循最小努力的路径,认可启发式的 ,而不去仔细研究它是否真正合适。你不会被难住,你不必非常努力,你甚至可能没有注意到你没有回答所问的问题。 此外,你可能没有意识到目标问题是困难的,因为 对它的直觉答案很容易想到 。
THE 3-D HEURISTIC
三维启发式
Have a look at the picture of the three men and answer the question that follows.
请看一下这三个人的图片 ,并回答下面的问题。

Figure 9
图9
As printed on the page, is the figure on the right larger than the figure on the left?
按照页面上的印刷,右边的数字是否比左边的数字大?
The obvious answer comes quickly to mind: the figure on the right is larger. If you take a ruler to the two figures, however, you will discover that in fact the figures are exactly the same size. Your impression of their relative size is dominated by a powerful illusion, which neatly illustrates the process of substitution.
明显的答案很快出现在脑海中:右边的数字更大。然而,如果你用尺子去量这两个数字,你会发现事实上这两个数字的大小是完全一样的。 你对它们的相对大小的印象被一个强大的幻觉所支配,它巧妙地说明了 ,替代的过程。
The corridor in which the figures are seen is drawn in perspective and appears to go into the depth plane. Your perceptual system automatically interprets the picture as a three-dimensional scene, not as an image printed on a flat paper surface. In the 3-D interpretation, the person on the right is both much farther away and much larger than the person on the left. For most of us, this impression of 3-D size is overwhelming. Only visual artists and experienced photographers have developed the skill of seeing the drawing as an object on the page. For the rest of us, substitution occurs: the dominant impression of 3-D size dictates the judgment of 2-D size. The illusion is due to a 3-D heuristic.
看到人物的走廊是用透视法绘制的,似乎进入了深度平面。你的感知系统会自动 ,将图片解释为一个三维的场景,而不是印在平面纸上的图像。在三维解释中,右边的人既比左边的人远得多又大得多。 对于我们大多数人来说,这种三维尺寸的印象是压倒性的。只有视觉艺术家和有经验的摄影师发展了将绘画作为页面上的物体的技能。对我们其他人来说,发生了替代:三维尺寸的主导印象决定了对二维尺寸的判断。这种错觉是由于3-D启发式的作用。
What happens here is a true illusion, not a misunderstanding of the question. You knew that the question was about the size of the figures in the picture, as printed on the page. If you had been asked to estimate the size of the figures, we know from experiments that your answer would have been in inches, not feet. You were not confused about the question, but you were influenced by the answer to a question that you were not asked: “How tall are the three people?”
这里发生的是真正的错觉,而不是对问题的误解 。你知道问题是关于图片中数字的大小,正如印在页面上的那样。如果让你估计数字的大小,我们从实验中知道,你的答案会是英寸,而不是英尺。你并没有对这个问题感到困惑,但你被一个没有问你的问题的答案所影响:"这三个 人有多高?"
The essential step in the heuristic—the substitution of three-dimensional for two-dimensional size—occurred automatically. The picture contains cues that suggest a 3-D interpretation. These cues are irrelevant to the task at hand—the judgment of size of the figure on the page—and you should have ignored them, but you could not. The bias associated with the heuristic is that objects that appear to be more distant also appear to be larger on the page. As this example illustrates, a judgment that is based on substitution will inevitably be biased in predictable ways. In this case, it happens so deep in the perceptual system that you simply cannot help it.
启发式的关键步骤--用三维尺寸替代二维尺寸--是自动发生的。这些线索与手头的工作无关--判断页面上人物的大小--你应该忽略它们,但你不能忽略。与启发式相关的偏见是, ,看起来更遥远的物体在页面上也显得更大。正如这个例子所说明的,基于替代的判断将不可避免地以可预测的方式出现偏差。 在这种情况下,它发生在知觉系统的深处,你根本无法帮助它。
THE MOOD HEURISTIC FOR HAPPINESS
幸福的心情启发式
A survey of German students is one of the best examples of substitution . The survey that the young participants completed included the following two questions:
一项针对德国学生的调查是 替代的最好例子 之一。 ,年轻的参与者完成的调查包括以下两个问题。
How happy are you these days?
这些天你有多开心?
How many dates did you have last month?
上个月你有多少次约会?
The experimenters were interested in the correlation between the two answers. Would the students who reported many dates say that they were happier than those with fewer dates? Surprisingly, no: the correlation between the answers was about zero. Evidently, dating was not what came first to the students’ minds when they were asked to assess their happiness. Another group of students saw the same two questions, but in reverse order:
实验者对这两个答案之间的相关性很感兴趣。 报告约会次数多的学生会不会说他们比约会次数少的学生更快乐?令人惊讶的是,没有:答案之间的相关性约为零。显然, ,当学生被要求评估他们的幸福时,他们首先想到的不是 。 另一组学生看到了同样的两个问题,但顺序相反。
How many dates did you have last month?
上个月你有多少次约会?
How happy are you these days?
这些天你有多开心?
The results this time were completely different. In this sequence, the correlation between the number of dates and reported happiness was about as high as correlations between psychological measures can get. What happened?
这次的结果完全不同。在这个序列中,约会次数和报告的幸福感之间的相关性大约是 ,这是 心理测量之间的相关性 可以达到的高度。 发生了什么?
The explanation is straightforward, and it is a good example of substitution. Dating was apparently not the center of these students’ life (in the first survey, happiness and dating were uncorrelated), but when they were asked to think about their romantic life, they certainly had an emotional reaction. The students who had many dates were reminded of a happy aspect of their life, while those who had none were reminded of loneliness and rejection. The emotion aroused by the dating question was still on everyone’s mind when the query about general happiness came up.
解释很直接,这是一个很好的替代例子。 约会显然不是这些学生生活的中心(在第一次调查中,幸福和约会是不相关的),但是当他们被要求思考他们的浪漫生活时,他们当然会有情绪反应。有很多约会的学生 ,他们被提醒了他们生活中幸福的一面,而那些没有约会的学生则被提醒了孤独和拒绝。 当关于一般幸福的询问出现时,约会问题所引起的情绪仍然在每个人的脑海中。
The psychology of what happened is precisely analogous to the psychology of the size illusion in figure 9 . “Happiness these days” is not a natural or an easy assessment. A good answer requires a fair amount of thinking. However, the students who had just been asked about their dating did not need to think hard because they already had in their mind an answer to a related question: how happy they were with their love life. They substituted the question to which they had a ready-made answer for the question they were asked.
所发生的事情的心理学正是类似 于图9 中大小错觉的心理学。"最近的幸福 "不是一个自然的 ,也不是一个简单的评估。 一个好的答案需要相当多的思考。然而,刚刚被问到约会问题的学生不需要认真思考,因为他们心中已经有了一个相关问题的答案:他们对自己的爱情生活有多满意。 他们用自己有现成答案的问题替代了被问到的问题。
Here again, as we did for the illusion, we can ask: Are the students confused? Do they really think that the two questions—the one they were asked and the one they answer—are synonymous? Of course not. The students do not temporarily lose their ability to distinguish romantic life from life as a whole. If asked about the two concepts, they would say they are different. But they were not asked whether the concepts are different. They were asked how happy they were, and System 1 has a ready answer.
在这里,正如 ,我们为幻觉所做的那样,我们可以问:学生们感到困惑吗?他们真的认为这两个问题--他们被问到的问题和他们回答的问题--是同义的吗?当然不是。学生们并没有暂时失去区分浪漫生活和整体生活的能力。 如果问及这两个概念,他们会说它们是不同的。但他们没有被问及 ,这些概念是否不同。有人问他们有多高兴,系统1有一个现成的答案。
Dating is not unique. The same pattern is found if a question about the students’ relations with their parents or about their finances immediately precedes the question about general happiness. In both cases, satisfaction in the particular domain dominates happiness reports . Any emotionally significant question that alters a person’s mood will have the same effect. WYSIATI. The present state of mind looms very large when people evaluate their happiness .
约会并不是唯一的。 如果在关于一般幸福感的问题之前立即提出关于学生与父母的关系或关于他们的财务状况的问题,也会发现同样的模式。 在这两种情况下,特定领域的满意度 主导了幸福感的报告 。任何改变人的情绪的情感 重要的问题 ,都会产生同样的效果。 WYSIATI.当人们 评价自己的幸福 时,目前的心理状态非常重要。
THE AFFECT HEURISTIC
情感启发式
The dominance of conclusions over arguments is most pronounced where emotions are involved. The psychologist Paul Slovic has proposed an affect heuristic in which people let their likes and dislikes determine their beliefs about the world. Your political preference determines the arguments that you find compelling. If you like the current health policy, you believe its benefits are substantial and its costs more manageable than the costs of alternatives. If you are a hawk in your attitude toward other nations, you probably think they are relatively weak and likely to submit to your country’s will. If you are a dove, you probably think they are strong and will not be easily coerced. Your emotional attitude to such things as irradiated food, red meat, nuclear power, tattoos, or motorcycles drives your beliefs about their benefits and their risks. If you dislike any of these things, you probably believe that its risks are high and its benefits negligible.
在涉及情感的地方,结论对论据的支配作用最为明显。 心理学家保罗-斯洛维奇提出了一种 情感启发式 ,即人们让自己的喜怒哀乐决定他们对 。 你的政治偏好决定了你认为有说服力的论据。如果你喜欢目前的卫生政策,你相信它的好处是巨大的,而且它的成本比替代方案的成本更容易控制。 如果你对其他国家的态度是鹰派的,你可能认为他们相对软弱,可能会服从你的国家的意志。如果你是鸽子,你可能 ,认为他们很坚强,不会轻易被胁迫。你对诸如辐照食品、红肉、核电、纹身或摩托车的情感态度促使你相信它们的好处和风险。 如果你不喜欢这些东西,你可能认为它的风险很高,好处可以忽略不计。
The primacy of conclusions does not mean that your mind is completely closed and that your opinions are wholly immune to information and sensible reasoning. Your beliefs, and even your emotional attitude, may change (at least a little) when you learn that the risk of an activity you disliked is smaller than you thought. However, the information about lower risks will also change your view of the benefits (for the better) even if nothing was said about benefits in the information you received.
结论至上并不意味着你的思想完全 ,你的观点完全不受信息和理智推理的影响。当你了解到你不喜欢的活动的风险比你想象的要小时,你的信念,甚至你的情感态度都可能改变(至少是一点点)。然而,关于较低风险的信息也会改变你 对益处的看法 (向好的方面发展),即使在 你收到的信息中没有说到益处。
We see here a new side of the “personality” of System 2. Until now I have mostly described it as a more or less acquiescent monitor, which allows considerable leeway to System 1. I have also presented System 2 as active in deliberate memory search, complex computations, comparisons, planning, and choice. In the bat-and-ball problem and in many other examples of the interplay between the two systems, it appeared that System 2 is ultimately in charge, with the ability to resist the suggestions of System 1, slow things down, and impose logical analysis. Self-criticism is one of the functions of System 2. In the context of attitudes, however, System 2 is more of an apologist for the emotions of System 1 than a critic of those emotions—an endorser rather than an enforcer. Its search for information and arguments is mostly constrained to information that is consistent with existing beliefs, not with an intention to examine them. An active, coherence-seeking System 1 suggests solutions to an undemanding System 2.
我们在这里看到了系统2的 "个性 "的新一面。 到目前为止,我大多把它描述为一个或多或少默许的监控者,它允许系统1有相当大的回旋余地。 我也把系统2描述为在故意的记忆搜索、复杂的计算、比较、计划和选择中的活跃者。 在球棒和球的问题中,以及在其他许多关于两个系统之间相互作用的例子中,似乎系统2最终负责,有能力抵制系统1的建议,放慢速度,并强加逻辑分析。 自我批评是系统2的功能之一。 然而,在态度方面,系统2更像是系统1情绪的辩护人,而不是这些情绪的批评者,是认可者而不是 执行者。它对信息和论据的搜索大多局限于与现有信念相一致的信息, ,而不是有意去研究它们。一个积极的、寻求一致性的系统1向一个没有要求的系统2提出解决方案。
SPEAKING OF SUBSTITUTION AND HEURISTICS
说到替代和启发式方法
“Do we still remember the question we are trying to answer? Or have we substituted an easier one?”
"我们还记得我们要回答的问题吗?还是我们用一个更简单的替代了?"
“The question we face is whether this candidate can succeed. The question we seem to answer is whether she interviews well. Let’s not substitute.”
"我们面临的问题是,这位候选人是否能够成功。我们似乎要回答的问题是她的面试是否顺利。我们不要替代。"
“He likes the project, so he thinks its costs are low and its benefits are high. Nice example of the affect heuristic.”
"他喜欢这个项目,所以他认为它的成本很低,效益很高。影响启发式的好例子"。
“We are using last year’s performance as a heuristic to predict the value of the firm several years from now. Is this heuristic good enough? What other information do we need?”
"我们用去年的业绩作为启发式预测几年后公司的价值。这个启发式 ,是否足够好?我们还需要什么其他信息?"
The table below contains a list of features and activities that have been attributed to System 1. Each of the active sentences replaces a statement, technically more accurate but harder to understand, to the effect that a mental event occurs automatically and fast. My hope is that the list of traits will help you develop an intuitive sense of the “personality” of the fictitious System 1. As happens with other characters you know, you will have hunches about what System 1 would do under different circumstances, and most of your hunches will be correct.
下面的 表中列出了被归结为系统1的特征和活动。每个主动句子都取代了一个陈述,技术上更准确但更难理解,大意是一个心理事件自动快速发生。我希望这些特征清单能帮助你培养一种直觉, 虚构的系统1的 "个性"。就像你认识的其他人物一样,你会对系统1在不同情况下会做什么有预感,而且你的预感大多是正确的。
Characteristics of System 1
系统1的特点
generates impressions, feelings, and inclinations; when endorsed by System 2 these become beliefs, attitudes, and intentions
产生印象、感觉和倾向;当被系统2认可时,这些就变成了信念、态度和意图。
operates automatically and quickly, with little or no effort, and no sense of voluntary control
自动操作, ,速度快,几乎不费吹灰之力,也没有自愿控制的感觉
can be programmed by System 2 to mobilize attention when a particular pattern is detected (search)
可以由系统2编程,在检测到特定模式时调动注意力(搜索)。
executes skilled responses and generates skilled intuitions, after adequate training
在经过充分的培训后,执行熟练的反应并产生熟练的直觉
creates a coherent pattern of activated ideas in associative memory
在联想记忆中形成一个连贯的激活思想的模式
links a sense of cognitive ease to illusions of truth, pleasant feelings, and reduced vigilance
将认知上的轻松感与真理的幻觉、愉快的感觉和 ,降低了警惕性。
distinguishes the surprising from the normal
区别于普通人的惊奇
infers and invents causes and intentions
推断和编造原因和意图
neglects ambiguity and suppresses doubt
忽视模糊性,压制疑问
is biased to believe and confirm
是偏向于相信和确认
exaggerates emotional consistency (halo effect)
夸大情感的一致性(光环效应)。
focuses on existing evidence and ignores absent evidence (WYSIATI)
专注于现有的证据,忽略了没有的证据(WYSIATI)。
generates a limited set of basic assessments
产生一套有限的基本评估
represents sets by norms and prototypes, does not integrate
用规范和原型来代表集合,没有整合
matches intensities across scales (e.g., size to loudness)
匹配不同尺度的强度(例如,大小和响度)。
computes more than intended (mental shotgun)
计算超过预期(心理枪)。
sometimes substitutes an easier question for a difficult one (heuristics)
有时用较容易的问题代替困难的问题(启发式)。
is more sensitive to changes than to states (prospect theory) *
对变化比对状态更敏感(前景理论 )* 。
overweights low probabilities *
过分重视低概率*。
shows diminishing sensitivity to quantity (psychophysics) *
显示出对数量的敏感性越来越低(心理物理学 )* 。
responds more strongly to losses than to gains (loss aversion) *
对损失的反应比对收益的反应更强烈(损失厌恶 )* 。
frames decision problems narrowly, in isolation from one another *
将 决策问题狭义地、孤立地放在一起*。
* Feature introduced in detail in part 4 .
* 第四部分 详细介绍了这一 特点 。
10
10
The Law of Small Numbers
小数法则
A study of the incidence of kidney cancer in the 3,141 counties of the United States reveals a remarkable pattern. The counties in which the incidence of kidney cancer is lowest are mostly rural, sparsely populated, and located in traditionally Republican states in the Midwest, the South, and the West. What do you make of this?
对美国3,141个县的肾癌发病率的研究显示了一个显著的模式。肾癌发病率最低的县大多是农村,人口稀少, ,并且位于中西部、南部和西部传统上的共和党州。你对此有何看法?
Your mind has been very active in the last few seconds, and it was mainly a System 2 operation. You deliberately searched memory and formulated hypotheses. Some effort was involved; your pupils dilated, and your heart rate increased measurably. But System 1 was not idle: the operation of System 2 depended on the facts and suggestions retrieved from associative memory. You probably rejected the idea that Republican politics provide protection against kidney cancer. Very likely, you ended up focusing on the fact that the counties with low incidence of cancer are mostly rural. The witty statisticians Howard Wainer and Harris Zwerling, from whom I learned this example, commented, “It is both easy and tempting to infer that their low cancer rates are directly due to the clean living of the rural lifestyle—no air pollution, no water pollution, access to fresh food without additives.” This makes perfect sense.
在过去的几秒钟里,你的思维非常活跃,而且主要是系统2的操作。 你故意搜索记忆并提出假设。 这涉及到一些努力;你的瞳孔扩大,你的心率明显增加。但系统1并没有闲着: ,系统2的操作取决于从联想记忆中检索的事实和建议。 你可能拒绝共和党政治提供保护以防止肾癌的想法。很有可能,你最后关注的是癌症发病率低的县城大多是农村。诙谐的统计学家霍华德-怀纳和哈里斯-兹沃林,我从他们那里学到了这个例子, 。 " 推断他们的低癌症发病率是直接由于农村生活方式的清洁生活--没有空气污染,没有水污染,可以获得没有添加剂的新鲜食物,这既容易又很诱人。"这很有意义。
Now consider the counties in which the incidence of kidney cancer is highest. These ailing counties tend to be mostly rural, sparsely populated, and located in traditionally Republican states in the Midwest, the South, and the West. Tongue-in-cheek, Wainer and Zwerling comment: “It is easy to infer that their high cancer rates might be directly due to the poverty of the rural lifestyle—no access to good medical care, a high-fat diet, and too much alcohol, too much tobacco.” Something is wrong, of course. The rural lifestyle cannot explain both very high and very low incidence of kidney cancer.
现在考虑一下那些肾癌发病率最高的县城。这些生病的县往往大多是农村,人口稀少,位于中西部、南部和西部的传统共和党州 。讽刺的是,Wainer和Zwerling评论说:"很容易推断出他们的高癌症发病率可能是直接由于农村 生活方式的贫困--无法获得良好的医疗服务,高脂肪饮食,以及过多的酒精、过多的烟草"。当然,有些事情是不对的。农村的生活方式不能解释肾癌发病率非常 ,也不能解释肾癌发病率非常低。
The key factor is not that the counties were rural or predominantly Republican. It is that rural counties have small populations. And the main lesson to be learned is not about epidemiology, it is about the difficult relationship between our mind and statistics. System 1 is highly adept in one form of thinking—it automatically and effortlessly identifies causal connections between events, sometimes even when the connection is spurious. When told about the high-incidence counties, you immediately assumed that these counties are different from other counties for a reason, that there must be a cause that explains this difference. As we shall see, however, System 1 is inept when faced with “merely statistical” facts, which change the probability of outcomes but do not cause them to happen.
关键因素不是这些县是农村或以共和党人为主。而要学习的主要课程不是关于流行病学,而是关于我们的思想和统计数据之间的困难关系。系统1非常善于一种思维形式--它自动地、毫不费力地识别出 ,有时甚至是在联系是虚假的情况下。当被告知高发县的情况时,你立即认为这些县与其他县不同是有原因的,一定有一个原因可以解释这种不同。然而,正如我们将看到的,系统1在面对 "仅仅是统计学上的 "事实时是无能的,这些事实改变了结果的概率 ,但并没有导致它们发生。
A random event, by definition, does not lend itself to explanation, but collections of random events do behave in a highly regular fashion. Imagine a large urn filled with marbles. Half the marbles are red, half are white. Next, imagine a very patient person (or a robot) who blindly draws 4 marbles from the urn, records the number of red balls in the sample, throws the balls back into the urn, and then does it all again, many times. If you summarize the results, you will find that the outcome “2 red, 2 white” occurs (almost exactly) 6 times as often as the outcome “4 red” or “4 white.” This relationship is a mathematical fact. You can predict the outcome of repeated sampling from an urn just as confidently as you can predict what will happen if you hit an egg with a hammer. You cannot predict every detail of how the shell will shatter, but you can be sure of the general idea. There is a difference: the satisfying sense of causation that you experience when thinking of a hammer hitting an egg is altogether absent when you think about sampling.
根据定义,随机事件不适合解释,但随机事件的集合确实表现得非常有规律。 想象一个装满弹珠的大瓮,一半弹珠是红色,一半是白色。接下来,想象一个非常有耐心的人(或一个机器人)从瓮中盲目地抽出4颗弹珠,记录 样本中红球的数量,把球扔回瓮中,然后再做一遍,多次。如果你总结一下结果,你会发现,"2红2白 "的结果出现的频率(几乎完全一样)是 "4红 "或 "4白 "的6倍。这种关系是一个数学事实。 你可以预测从骨灰盒中反复取样的结果,就像你可以预测用锤子敲打鸡蛋会发生什么一样 。你无法预测外壳如何破碎的每一个细节,但你可以确定大致的想法。这其中有一个区别:当你想到锤子敲打鸡蛋时,你所体验到的令人满意的因果关系感,在你想到取样时完全没有。
A related statistical fact is relevant to the cancer example. From the same urn, two very patient marble counters take turns. Jack draws 4 marbles on each trial, Jill draws 7. They both record each time they observe a homogeneous sample—all white or all red. If they go on long enough, Jack will observe such extreme outcomes more often than Jill—by a factor of 8 (the expected percentages are 12.5% and 1.56%). Again, no hammer, no causation, but a mathematical fact: samples of 4 marbles yield extreme results more often than samples of 7 marbles do.
一个相关的统计事实与癌症的例子有关。 从同一个瓮中,两个非常 的病人弹珠计数器轮流进行。杰克在每次试验中抽出4颗弹珠,吉尔抽出7颗。他们都记录下每次观察到的同质样本--全部是白色或全部是红色。如果他们持续的时间足够长,杰克将比吉尔更经常地观察到这种极端的结果--8倍(预期的百分比是12.5%和1.56%)。 同样,没有锤子,没有因果关系,但有一个数学事实:4颗弹珠的样本 ,比7颗弹珠的样本更经常地产生极端的结果。
Now imagine the population of the United States as marbles in a giant urn. Some marbles are marked KC, for kidney cancer. You draw samples of marbles and populate each county in turn. Rural samples are smaller than other samples. Just as in the game of Jack and Jill, extreme outcomes (very high and/or very low cancer rates) are most likely to be found in sparsely populated counties. This is all there is to the story.
现在把美国的人口想象成一个巨大瓮中的弹珠。有些弹珠上标有KC,代表肾癌。你抽出弹珠样本,依次填充到每个县。农村样本比 其他样本小。就像在杰克和吉尔的游戏中,极端的结果(非常高和/或非常低的癌症发病率)最 可能出现在人口稀少的县。 这就是故事的全部内容。
We started from a fact that calls for a cause: the incidence of kidney cancer varies widely across counties and the differences are systematic. The explanation I offered is statistical: extreme outcomes (both high and low) are more likely to be found in small than in large samples. This explanation is not causal. The small population of a county neither causes nor prevents cancer; it merely allows the incidence of cancer to be much higher (or much lower) than it is in the larger population. The deeper truth is that there is nothing to explain. The incidence of cancer is not truly lower or higher than normal in a county with a small population, it just appears to be so in a particular year because of an accident of sampling. If we repeat the analysis next year, we will observe the same general pattern of extreme results in the small samples, but the counties where cancer was common last year will not necessarily have a high incidence this year. If this is the case, the differences between dense and rural counties do not really count as facts: they are what scientists call artifacts, observations that are produced entirely by some aspect of the method of research—in this case, by differences in sample size.
我们从一个事实出发,呼唤一个原因:各县的肾癌发病率差异很大,而且这种差异是系统性的。我提供的解释是统计学上的:极端结果(包括高和低)在小样本中比在大样本中更容易发现。这种解释不是因果关系。 一个县的人口少,既不会导致也不会预防癌症;它只是让癌症的发病率比人口多的地方高得多(或低得多)。 更深层次的真相是,没有什么可以解释。在一个人口较少的县,癌症的发病率并不真正低于或高于正常水平,它只是在某一年显得如此,因为 ,一个抽样的意外。如果我们明年重复分析,我们将观察到小样本中同样的极端结果的一般模式,但去年癌症常见的县今年不一定会有高发病率。 如果是这种情况,密集县和农村县之间的差异并不能真正算作事实:它们是科学家所谓的人工制品,观察 ,完全由研究方法的某些方面产生--在这种情况下,由样本大小的差异。
The story I have told may have surprised you, but it was not a revelation. You have long known that the results of large samples deserve more trust than smaller samples, and even people who are innocent of statistical knowledge have heard about this law of large numbers. But “knowing” is not a yes-no affair and you may find that the following statements apply to you:
我讲的这个故事可能让你感到惊讶,但这并不是一个启示。 你早就知道大样本的结果比小样本更值得信任,甚至那些对统计知识一无所知的人也听说过这个大数定律。但 "认识 " ,并不是一件 "是"-"非 "的事情,你可能会发现以下说法对你适用。
The feature “sparsely populated” did not immediately stand out as relevant when you read the epidemiological story.
当你阅读流行病学故事时,"人口稀少 "这一特征并没有立即突出其相关性。
You were at least mildly surprised by the size of the difference between samples of 4 and samples of 7.
你至少对4个样本和7个样本之间的差异大小感到轻微的惊讶。
Even now, you must exert some mental effort to see that the following two statements mean exactly the same thing:
即使是现在,你也必须付出一定的心理努力,才能看到以下两句话的意思 ,完全一样。
Large samples are more precise than small samples.
大样本比小样本更精确。
Small samples yield extreme results more often than large samples do.
小样本比大样本更经常产生极端结果。
The first statement has a clear ring of truth, but until the second version makes intuitive sense, you have not truly understood the first.
第一句话有明显的真理性,但在第二句话有直观的意义之前,你还没有真正理解第一句话。
The bottom line: yes, you did know that the results of large samples are more precise, but you may now realize that you did not know it very well. You are not alone. The first study that Amos and I did together showed that even sophisticated researchers have poor intuitions and a wobbly understanding of sampling effects.
底线:是的,你确实知道大样本的结果更精确,但你现在可能意识到 ,你并没有很好地了解它。 你并不孤单。阿莫斯和我一起做的第一个研究表明,即使是成熟的研究人员也有糟糕的直觉,对抽样效应的理解也是摇摆不定的。
THE LAW OF SMALL NUMBERS
小数法则
My collaboration with Amos in the early 1970s began with a discussion of the claim that people who have had no training in statistics are good “intuitive statisticians.” He told my seminar and me of researchers at the University of Michigan who were generally optimistic about intuitive statistics. I had strong feelings about that claim, which I took personally: I had recently discovered that I was not a good intuitive statistician, and I did not believe that I was worse than others.
我与阿莫斯在1970年代初的合作始于讨论这样的说法:没有受过统计学训练的人是优秀的 "直觉统计学家"。 他告诉我和我的研讨会,密歇根大学的研究人员普遍对直观的统计数据持乐观态度。我对这种说法有强烈的感受,我个人认为:我最近发现我不是一个好的直觉统计学家,我不相信我比别人差。
For a research psychologist, sampling variation is not a curiosity; it is a nuisance and a costly obstacle, which turns the undertaking of every research project into a gamble. Suppose that you wish to confirm the hypothesis that the vocabulary of the average six-year-old girl is larger than the vocabulary of an average boy of the same age. The hypothesis is true in the population; the average vocabulary of girls is indeed larger. Girls and boys vary a great deal, however, and by the luck of the draw you could select a sample in which the difference is inconclusive, or even one in which boys actually score higher. If you are the researcher, this outcome is costly to you because you have wasted time and effort, and failed to confirm a hypothesis that was in fact true. Using a sufficiently large sample is the only way to reduce the risk. Researchers who pick too small a sample leave themselves at the mercy of sampling luck.
对研究心理学家来说,抽样变化不是一种好奇心;它是一种讨厌的 ,也是一种昂贵的障碍,它使每一个研究项目的进行变成了一场赌博。假设你想证实这样一个假设:平均六岁女孩的词汇量比同龄男孩的词汇量大。该假设在人群中是真实的;女孩的平均词汇量确实较大。然而,女孩和男孩的差异很大, ,凭着抽签的运气,你可以选择一个差异不确定的样本,甚至是一个男孩实际上得分更高的样本。如果你是研究者,这个结果对你来说是有代价的,因为你浪费了时间和精力,而且没能证实一个事实上是真的假设。使用足够大的样本是减少风险的唯一方法。研究人员如果挑选太小的 ,就会让自己受制于取样运气。
The risk of error can be estimated for any given sample size by a fairly simple procedure. Traditionally, however, psychologists do not use calculations to decide on a sample size. They use their judgment, which is commonly flawed. An article I had read shortly before the debate with Amos demonstrated the mistake that researchers made (they still do) by a dramatic observation. The author pointed out that psychologists commonly chose samples so small that they exposed themselves to a 50% risk of failing to confirm their true hypotheses! No researcher in his right mind would accept such a risk. A plausible explanation was that psychologists’ decisions about sample size reflected prevalent intuitive misconceptions of the extent of sampling variation.
对于任何给定的样本量,可以通过一个相当简单的程序来估计错误的风险。然而,传统上,心理学家并不使用计算方法来决定样本量。在与阿莫斯辩论前不久,我读了一篇文章,通过一个戏剧性的观察,证明了研究人员所犯的错误(他们 ,现在仍然如此)。作者指出,心理学家通常选择的样本太小,以至于他们将自己暴露在 50%的风险中,无法 证实他们的真实假设!一个合理的解释是,心理学家关于样本大小的决定反映了对抽样程度普遍存在的直觉误解, 变化。
The article shocked me, because it explained some troubles I had had in my own research. Like most research psychologists, I had routinely chosen samples that were too small and had often obtained results that made no sense. Now I knew why: the odd results were actually artifacts of my research method. My mistake was particularly embarrassing because I taught statistics and knew how to compute the sample size that would reduce the risk of failure to an acceptable level. But I had never chosen a sample size by computation. Like my colleagues, I had trusted tradition and my intuition in planning my experiments and had never thought seriously about the issue. When Amos visited the seminar, I had already reached the conclusion that my intuitions were deficient, and in the course of the seminar we quickly agreed that the Michigan optimists were wrong.
这篇文章让我感到震惊,因为它解释了我在自己的研究中遇到的一些问题。 像大多数研究型心理学家一样,我经常选择太小的样本,而且经常得到毫无意义的结果。现在我知道了原因:这些奇怪的结果实际上是我的研究 方法的伪装。我的错误特别令人尴尬,因为我是教统计学的,知道如何 ,以计算出能将失败风险降低到可接受水平的样本量。但我从未通过计算选择过样本量。和我的同事一样,我在计划实验时相信传统和我的直觉,从未认真思考过这个问题。 当阿莫斯访问研讨会时,我已经得出结论,我的直觉是有缺陷的。在研讨会的课程 ,我们很快同意,密歇根乐观主义者是错误的。
Amos and I set out to examine whether I was the only fool or a member of a majority of fools, by testing whether researchers selected for mathematical expertise would make similar mistakes. We developed a questionnaire that described realistic research situations, including replications of successful experiments. It asked the researchers to choose sample sizes, to assess the risks of failure to which their decisions exposed them, and to provide advice to hypothetical graduate students planning their research. Amos collected the responses of a group of sophisticated participants (including authors of two statistical textbooks) at a meeting of the Society of Mathematical Psychology. The results were straightforward: I was not the only fool. Every one of the mistakes I had made was shared by a large majority of our respondents. It was evident that even the experts paid insufficient attention to sample size.
阿莫斯和我着手研究我是唯一的傻瓜还是大多数傻瓜中的一员,方法是测试因数学专长而被选中的研究人员是否会犯类似的错误。 我们制定了一份描述现实研究情况的调查问卷,包括成功实验的复制。它要求 ,研究人员选择样本大小,评估他们的决定使他们面临的失败风险,并向假想的研究生提供规划研究的建议。阿莫斯在数学心理学会的一次会议上收集了一组复杂的参与者(包括两本统计学教科书的作者)的回答。结果很直接: 我不是唯一的傻瓜。我所犯的每一个错误都被大多数受访者所认同。 很明显,即使是专家也没有对样本量给予足够的重视。
Amos and I called our first joint article “Belief in the Law of Small Numbers.” We explained, tongue-in-cheek, that “intuitions about random sampling appear to satisfy the law of small numbers, which asserts that the law of large numbers applies to small numbers as well.” We also included a strongly worded recommendation that researchers regard their “statistical intuitions with proper suspicion and replace impression formation by computation whenever possible.”
阿莫斯和我把我们的第一篇联合文章称为 "相信小数法则"。 我们口若悬河地解释说,"关于随机抽样的直觉似乎满足了小数定律,它断言 ,大数定律也适用于小数。"我们还包括一项措辞强烈的建议,即研究人员应以 适当的怀疑态度 看待他们的 "统计直觉,并尽可能以计算取代印象形成"。
A BIAS OF CONFIDENCE OVER DOUBT
信心大于怀疑的偏向
In a telephone poll of 300 seniors, 60% support the president.
在一项针对300名老年人的电话调查中,60%的人支持总统。
If you had to summarize the message of this sentence in exactly three words, what would they be? Almost certainly you would choose “elderly support president.” These words provide the gist of the story. The omitted details of the poll, that it was done on the phone with a sample of 300, are of no interest in themselves; they provide background information that attracts little attention. Your summary would be the same if the sample size had been different. Of course, a completely absurd number would draw your attention (“a telephone poll of 6 [or 60 million] elderly voters …”). Unless you are a professional, however, you may not react very differently to a sample of 150 and to a sample of 3,000. That is the meaning of the statement that “people are not adequately sensitive to sample size.”
如果你必须用三个词来概括 这个句子的信息,那会是什么?几乎可以肯定,你会选择 "老人支持总统"。这些话提供了故事的要点。 遗漏的民调细节,即民调是在电话中进行的,有300个样本,本身并不令人感兴趣;它们提供的背景信息很少引起注意。如果样本量 ,你的总结也会一样。当然,一个完全荒谬的数字会引起你 ("对6[或6000万]老年选民的电话调查......")。不过,除非你是专业人士,否则你对150个样本和3000个样本的反应可能不会有很大不同。这就是 "人们对样本量不够敏感 "这一说法的含义。
The message about the poll contains information of two kinds: the story and the source of the story. Naturally, you focus on the story rather than on the reliability of the results. When the reliability is obviously low, however, the message will be discredited. If you are told that “a partisan group has conducted a flawed and biased poll to show that the elderly support the president …” you will of course reject the findings of the poll, and they will not become part of what you believe. Instead, the partisan poll and its false results will become a new story about political lies. You can choose to disbelieve a message in such clear-cut cases. But do you discriminate sufficiently between “I read in The New York Times …” and “I heard at the watercooler …”? Can your System 1 distinguish degrees of belief? The principle of WY SIATI suggests that it cannot.
关于投票的信息包含 ,有两种信息:故事和故事的来源。自然,你关注的是故事,而不是结果的可靠性。 然而,当可靠性明显偏低时,信息就会失去信誉。如果有人告诉你,"一个党派团体进行了一项有缺陷和有偏见的民意调查,显示老人支持总统...... "你当然会拒绝 ,他们不会成为你所相信的一部分。相反,这个党派民意调查及其虚假的结果将成为一个关于政治谎言的新故事。 在这种明确的情况下,你可以选择不相信一个消息。 但你是否对 "我在 《纽约时报》 上看到...... "和 "我在饮水机上听到...... "有足够的分辨能力?你的系统1能区分信仰的程度吗? WY SIATI的原则表明,它不能。
As I described earlier, System 1 is not prone to doubt. It suppresses ambiguity and spontaneously constructs stories that are as coherent as possible. Unless the message is immediately negated, the associations that it evokes will spread as if the message were true. System 2 is capable of doubt, because it can maintain incompatible possibilities at the same time. However, sustaining doubt is harder work than sliding into certainty. The law of small numbers is a manifestation of a general bias that favors certainty over doubt, which will turn up in many guises in following chapters.
正如我前面所描述的,系统1不容易产生怀疑。 它抑制模糊性,自发地构建尽可能连贯的故事。 除非信息立即被否定,否则它所唤起的联想会像信息是真的一样传播。系统2有能力进行怀疑,因为它可以同时维持不相容的可能性。 然而,维持怀疑是比滑向确定性更难的工作。小数法则是一种普遍偏见的表现,这种偏见倾向于确定性而不是怀疑,它将在以下各章中以多种形式出现。
The strong bias toward believing that small samples closely resemble the population from which they are drawn is also part of a larger story: we are prone to exaggerate the consistency and coherence of what we see. The exaggerated faith of researchers in what can be learned from a few observations is closely related to the halo effect, the sense we often get that we know and understand a person about whom we actually know very little. System 1 runs ahead of the facts in constructing a rich image on the basis of scraps of evidence. A machine for jumping to conclusions will act as if it believed in the law of small numbers. More generally, it will produce a representation of reality that makes too much sense.
相信小样本与所抽取的人群密切相关的强烈偏见也是一个更大的故事的一部分:我们容易夸大我们所看到的 的一致性和协调性。研究人员对可以从少数观察中了解到的东西的夸大信仰与光环效应密切相关,即我们经常感到我们知道和了解一个人,而实际上我们对他所知甚少。系统1在根据零星的证据构建丰富的形象时跑在了事实的前面。一个跳出结论的机器 ,会表现得好像它相信小数法则一样。更广泛地说,它将产生一种对现实的表述,使之太有意义。
CAUSE AND CHANCE
因缘
The associative machinery seeks causes. The difficulty we have with statistical regularities is that they call for a different approach. Instead of focusing on how the event at hand came to be, the statistical view relates it to what could have happened instead. Nothing in particular caused it to be what it is—chance selected it from among its alternatives.
联想机制寻找原因。 我们对统计规律性的困难在于,它们需要一种不同的方法。统计学的观点不是把重点放在手头的事件是如何发生的 ,而是把它与可能发生的 。没有什么特别的原因导致它成为现在的样子--机会从它的替代品中选择了它。
Our predilection for causal thinking exposes us to serious mistakes in evaluating the randomness of truly random events. For an example, take the sex of six babies born in sequence at a hospital. The sequence of boys and girls is obviously random; the events are independent of each other, and the number of boys and girls who were born in the hospital in the last few hours has no effect whatsoever on the sex of the next baby. Now consider three possible sequences:
我们对因果思维的偏爱使我们在评估真正的随机事件的随机性时犯了严重的错误。 例如,以在一家医院依次出生的六个婴儿的性别为例。男孩和女孩的顺序显然是随机的;这些事件是相互独立的 ,过去几小时内在医院出生的男孩和女孩的数量对下一个婴儿的性别没有任何影响。现在考虑三种可能的序列。
BBBGGG
BBBGG
GGGGGG
GGGGG
BGBBGB
BGBBGB
Are the sequences equally likely? The intuitive answer—“of course not!”—is false. Because the events are independent and because the outcomes B and G are (approximately) equally likely, then any possible sequence of six births is as likely as any other. Even now that you know this conclusion is true, it remains counterintuitive, because only the third sequence appears random. As expected, B GBBGB is judged much more likely than the other two sequences. We are pattern seekers, believers in a coherent world, in which regularities (such as a sequence of six girls) appear not by accident but as a result of mechanical causality or of someone’s intention. We do not expect to see regularity produced by a random process, and when we detect what appears to be a rule, we quickly reject the idea that the process is truly random. Random processes produce many sequences that convince people that the process is not random after all. You can see why assuming causality could have had evolutionary advantages. It is part of the general vigilance that we have inherited from ancestors. We are automatically on the lookout for the possibility that the environment has changed. Lions may appear on the plain at random times, but it would be safer to notice and respond to an apparent increase in the rate of appearance of prides of lions, even if it is actually due to the fluctuations of a random process.
这些序列的可能性相同吗?直观的答案--"当然不是!"--是错误的。因为事件是独立的,而且因为结果B和G的可能性(大约)相同 ,那么任何可能的六次出生的序列都和其他的一样可能。即使现在你知道这个结论是真的,它仍然是反直觉的,因为只有第三个序列是随机的。正如预期的那样,B GBBGB被判断为比其他两个序列更有可能。 我们是模式寻求者,相信一个连贯的世界,其中规律性(如六个女孩的序列)的出现不是 ,而是机械因果关系或某人的意图的结果。我们不期望看到由随机过程产生的规律性,当我们发现似乎是一个规则时,我们很快拒绝这个过程是真正随机的想法。随机过程产生了许多序列,使人们相信这个过程毕竟不是随机的。你可以看到为什么假设因果关系可能有 ,具有进化的优势。这是我们从祖先那里继承的普遍警惕性的一部分。狮子可能在随机的时间出现在平原上,但注意到狮子群出现的速度明显增加并作出反应会更安全,即使它实际上是由于随机过程的波动 。
The widespread misunderstanding of randomness sometimes has significant consequences. In our article on representativeness, Amos and I cited the statistician William Feller, who illustrated the ease with which people see patterns where none exists. During the intensive rocket bombing of London in World War II, it was generally believed that the bombing could not be random because a map of the hits revealed conspicuous gaps. Some suspected that German spies were located in the unharmed areas. A careful statistical analysis revealed that the distribution of hits was typical of a random process—and typical as well in evoking a strong impression that it was not random. “To the untrained eye,” Feller remarks, “randomness appears as regularity or tendency to cluster.”
对随机性的普遍误解有时会产生重大的后果。 在我们关于代表性的文章中,阿莫斯和我引用了统计学家威廉-费勒的话,他说明了人们很容易看到不存在的模式。 在第二次世界大战中对伦敦进行密集的火箭轰炸期间,人们普遍认为轰炸 ,不可能是随机的 ,因为命中的地图显示了明显的差距。仔细的统计分析表明,命中率的分布是典型的随机过程--而且在唤起人们对它不是随机的强烈印象方面也很典型。"对于未经训练的眼睛来说,"费勒说,"随机性看起来是有规律的,或有集群的倾向。"
I soon had an occasion to apply what I had learned from Feller. The Yom Kippur War broke out in 1973, and my only significant contribution to the war effort was to advise high officers in the Israeli Air Force to stop an investigation. The air war initially went quite badly for Israel, because of the unexpectedly good performance of Egyptian ground-to-air missiles. Losses were high, and they appeared to be unevenly distributed. I was told of two squadrons flying from the same base, one of which had lost four planes while the other had lost none. An inquiry was initiated in the hope of learning what it was that the unfortunate squadron was doing wrong. There was no prior reason to believe that one of the squadrons was more effective than the other, and no operational differences were found, but of course the lives of the pilots differed in many random ways, including, as I recall, how often they went home between missions and something about the conduct of debriefings. My advice was that the command should accept that the different outcomes were due to blind luck, and that the interviewing of the pilots should stop. I reasoned that luck was the most likely answer, that a random search for a nonobvious cause was hopeless, and that in the meantime the pilots in the squadron that had sustained losses did not need the extra burden of being made to feel that they and their dead friends were at fault.
我很快就有机会应用我从费勒那里学到的东西。赎罪日战争在1973年爆发,我对战争的唯一重大贡献是建议以色列空军的高级官员停止调查。 空战最初对以色列相当不利,因为埃及的地对空导弹的性能出乎意料地好。损失很高,而且它们似乎 ,分布不均。我被告知有两个中队从同一个基地起飞,其中一个损失了四架飞机,而另一个却没有损失。一项调查被启动,希望了解这个不幸的中队到底做错了什么。事先没有理由相信其中一个中队比另一个中队更有效,也没有发现行动上的差异, ,但当然,飞行员的生活在许多随机的方面是不同的,包括,我记得,他们在任务之间回家的频率以及关于进行汇报的东西。我的建议是,指挥部应该接受不同的结果是由盲目的运气造成的,而对飞行员的采访应该停止。我推断,运气是最有可能的答案,随机 ,寻找一个不明显的原因是没有希望的,同时,中队中遭受损失的飞行员不需要有额外的负担,让他们觉得他们和他们死去的朋友都有错。
Some years later, Amos and his students Tom Gilovich and Robert Vallone caused a stir with their study of misperceptions of randomness in basketball . The “fact” that players occasionally acquire a hot hand is generally accepted by players, coaches, and fans. The inference is irresistible: a player sinks three or four baskets in a row and you cannot help forming the causal judgment that this player is now hot, with a temporarily increased propensity to score. Players on both teams adapt to this judgment—teammates are more likely to pass to the hot scorer and the defense is more likely to double-team. Analysis of thousands of sequences of shots led to a disappointing conclusion: there is no such thing as a hot hand in professional basketball, either in shooting from the field or scoring from the foul line. Of course, some players are more accurate than others, but the sequence of successes and missed shots satisfies all tests of randomness. The hot hand is entirely in the eye of the beholders, who are consistently too quick to perceive order and causality in randomness. The hot hand is a massive and widespread cognitive illusion.
几年后,阿莫斯和他的学生汤姆-吉洛维奇(Tom Gilovich)和罗伯特-瓦隆(Robert Vallone)对 篮球运动中随机性 的错误认识的研究引起了轰动。球员 ,偶尔会获得一手好牌,这个 "事实 "被球员、教练和球迷普遍接受。这种推论是不可抗拒的:一个球员连续投进三或四个篮子,你不能不形成因果判断,即这个球员现在很热,得分的倾向暂时增加。两队的球员都会适应这种判断--队友们更有可能传给热门的得分手,而防守方 ,更有可能进行双人防守。对数以千计的投篮序列的分析得出了一个令人失望的结论:在职业篮球中没有所谓的热手,无论是在外地投篮还是在罚球线上得分。当然,有些球员比其他球员更准确,但成功和失误的顺序满足所有的随机性测试。热手完全是 ,在 ,他们总是太快地感知到随机性中的秩序和因果关系。热手是一个大规模和广泛的认知错觉。
The public reaction to this research is part of the story. The finding was picked up by the press because of its surprising conclusion, and the general response was disbelief. When the celebrated coach of the Boston Celtics, Red Auerbach, heard of Gilovich and his study, he responded, “Who is this guy? So he makes a study. I couldn’t care less.” The tendency to see patterns in randomness is overwhelming—certainly more impressive than a guy making a study.
公众对这项研究的反应是故事的一部分。 这项发现因为其令人惊讶的结论而被媒体报道,普遍的反应是不相信。当波士顿凯尔特人队的著名教练,红 ,奥尔巴赫听说了吉洛维奇和他的研究,他回应说:"这家伙是谁?所以他做了一个研究。我不可能不在乎。"在随机性中看到模式的趋势是压倒性的--肯定比一个人做研究更令人印象深刻。
The illusion of pattern affects our lives in many ways off the basketball court. How many good years should you wait before concluding that an investment adviser is unusually skilled? How many successful acquisitions should be needed for a board of directors to believe that the CEO has extraordinary flair for such deals? The simple answer to these questions is that if you follow your intuition, you will more often than not err by misclassifying a random event as systematic. We are far too willing to reject the belief that much of what we see in life is random.
模式的错觉在篮球场以外的许多方面影响着我们的生活。在断定一个投资顾问是 异常熟练之前,你应该等待多少个好年头?需要多少次成功的收购才能让董事会相信首席执行官在此类交易中具有非凡的天赋?对这些问题的简单回答是,如果你跟随你的直觉,你会更经常地犯错,把随机事件错误地归类为系统性事件。 我们太愿意拒绝相信我们在生活中看到的许多东西是随机的。
I began this chapter with the example of cancer incidence across the United States. The example appears in a book intended for statistics teachers, but I learned about it from an amusing article by the two statisticians I quoted earlier, Howard Wainer and Harris Zwerling. Their essay focused on a large investment, some $1.7 billion, which the Gates Foundation made to follow up intriguing findings on the characteristics of the most successful schools. Many researchers have sought the secret of successful education by identifying the most successful schools in the hope of discovering what distinguishes them from others. One of the conclusions of this research is that the most successful schools, on average, are small. In a survey of 1,662 schools in Pennsylvania, for instance, 6 of the top 50 were small, which is an overrepresentation by a factor of 4. These data encouraged the Gates Foundation to make a substantial investment in the creation of small schools, sometimes by splitting large schools into smaller units. At least half a dozen other prominent institutions, such as the Annenberg Foundation and the Pew Charitable Trust, joined the effort, as did the U.S. Department of Education’s Smaller Learning Communities Program.
我以全美癌症发病率的例子开始这一章。 这个例子出现在一本为统计学教师准备的书中,但我是从我之前引用的两位统计学家Howard Wainer和Harris Zwerling的一篇有趣的文章中得知的。他们的文章集中在一项巨大的投资上,大约17亿美元,盖茨基金会为跟进耐人寻味的发现, 关于最成功学校的特点。许多研究人员通过识别最成功的学校来寻找成功教育的秘密,希望发现它们与其他学校的区别。这项研究的结论之一是,最成功的学校,平均来说,是小规模的。例如,在对宾夕法尼亚州1662所学校的调查中,排名前50位的 ,其中6所是小型学校,占比高达4倍。这些数据鼓励盖茨基金会为创建小型学校进行大量投资,有时将大型学校拆分成更小的单位。至少有半打其他知名机构,如安纳伯格基金会和皮尤慈善基金会,以及美国教育部的 小型学习社区计划,也加入了这一努力。
This probably makes intuitive sense to you. It is easy to construct a causal story that explains how small schools are able to provide superior education and thus produce high-achieving scholars by giving them more personal attention and encouragement than they could get in larger schools. Unfortunately, the causal analysis is pointless because the facts are wrong. If the statisticians who reported to the Gates Foundation had asked about the characteristics of the worst schools, they would have found that bad schools also tend to be smaller than average. The truth is that small schools are not better on average; they are simply more variable. If anything, say Wainer and Zwerling, large schools tend to produce better results, especially in higher grades where a variety of curricular options is valuable.
这对你来说可能有直观的意义。很容易构建一个因果关系的故事,解释小学校如何能够提供优越的教育,从而通过给予他们比大学校更多的个人关注和鼓励,培养出成绩优异的学者。 ,不幸的是,因果分析是毫无意义的,因为事实 是错误的。如果向盖茨基金会报告的统计学家询问了最差学校的特点,他们会发现,坏学校也往往比平均水平小。事实是,小学校平均来说并不是更好;它们只是更有变化。 如果说有什么变化的话,韦纳和兹沃林说,大学校往往能产生更好的结果,特别是在高年级 ,那里的各种课程选择很有价值。
Thanks to recent advances in cognitive psychology, we can now see clearly what Amos and I could only glimpse: the law of small numbers is part of two larger stories about the workings of the mind.
由于认知心理学的最新进展,我们现在可以清楚地看到阿莫斯和我只能瞥见的东西:小数法则是关于心灵运作的两个更大故事的一部分。
The exaggerated faith in small samples is only one example of a more general illusion—we pay more attention to the content of messages than to information about their reliability, and as a result end up with a view of the world around us that is simpler and more coherent than the data justify. Jumping to conclusions is a safer sport in the world of our imagination than it is in reality.
对小样本的夸大信仰只是一个更普遍的幻觉的例子--我们更关注信息的内容,而不是关于其可靠性的信息 ,结果是我们对周围世界的看法比数据更简单、更一致。在我们的想象世界中,跳出结论是一项比现实中更安全的运动。
Statistics produce many observations that appear to beg for causal explanations but do not lend themselves to such explanations. Many facts of the world are due to chance, including accidents of sampling. Causal explanations of chance events are inevitably wrong.
统计学产生了许多似乎需要因果解释的观察结果,但并不适合这种解释。世界上的许多事实是由于 ,包括抽样的意外。 对偶然事件的因果解释不可避免地是错误的。
SPEAKING OF THE LAW OF SMALL NUMBERS
说到小数法则
“Yes, the studio has had three successful films since the new CEO took over. But it is too early to declare he has a hot hand.”
"是的,自新任CEO接手以来,工作室已经有三部成功的电影。但现在就宣布他手感火热还为时过早。"
“I won’t believe that the new trader is a genius before consulting a statistician who could estimate the likelihood of his streak being a chance event.”
"在咨询统计学家之前,我不会相信新的交易员是个天才,因为统计学家可以估计他的连胜 ,是一个偶然事件的可能性。"
“The sample of observations is too small to make any inferences. Let’s not follow the law of small numbers.”
"观察的样本太小,无法做出任何推论。 我们不要遵循小数法则。"
“I plan to keep the results of the experiment secret until we have a sufficiently large sample. Otherwise we will face pressure to reach a conclusion prematurely.”
"我计划对实验结果保密,直到我们有足够大的样本。否则,我们将面临过早达成结论的压力。"
11
11
Anchors
锚点
Amos and I once rigged a wheel of fortune. It was marked from 0 to 100, but we had it built so that it would stop only at 10 or 65. We recruited students of the University of Oregon as participants in our experiment. One of us would stand in front of a small group, spin the wheel, and ask them to write down the number on which the wheel stopped, which of course was either 10 or 65. We then asked them two questions:
阿莫斯和我曾经操纵了一个幸运轮。它的标记从0到100,但我们把它造得只停在10或65。 我们招募了俄勒冈大学的学生作为我们实验的参与者。我们中的一个 ,站在一个小组面前,转动轮子,并要求他们写下轮子停在的数字,当然是10或65。 然后我们问他们两个问题。
Is the percentage of African nations among UN members larger or smaller than the number you just wrote?
非洲国家在联合国成员中的比例是比你刚才写的数字大还是小?
What is your best guess of the percentage of African nations in the UN?
你对非洲国家在联合国中的比例的最佳猜测是什么?
The spin of a wheel of fortune—even one that is not rigged—cannot possibly yield useful information about anything, and the participants in our experiment should simply have ignored it. But they did not ignore it. The average estimates of those who saw 10 and 65 were 25% and 45%, respectively.
转轮 ,即使是没有被操纵的,也不可能产生任何有用的信息,我们实验中的参与者应该直接忽略它。但他们并没有忽视它。看到10和65的人的平均估计分别为25%和45%。
The phenomenon we were studying is so common and so important in the everyday world that you should know its name: it is an anchoring effect . It occurs when people consider a particular value for an unknown quantity before estimating that quantity. What happens is one of the most reliable and robust results of experimental psychology: the estimates stay close to the number that people considered—hence the image of an anchor. If you are asked whether Gandhi was more than 114 years old when he died you will end up with a much higher estimate of his age at death than you would if the anchoring question referred to death at 35. If you consider how much you should pay for a house, you will be influenced by the asking price. The same house will appear more valuable if its listing price is high than if it is low, even if you are determined to resist the influence of this number; and so on—the list of anchoring effects is endless. Any number that you are asked to consider as a possible solution to an estimation problem will induce an anchoring effect.
我们所研究的现象在日常生活中是如此普遍和重要,以至于你应该知道它的名字:它是 锚定效应 。它发生在人们在估计一个未知的数量之前考虑一个特定的数值。所发生的是实验心理学中最可靠和稳健的结果之一:估计值保持在接近人们考虑的数字--因此有了锚定的形象。 如果你被问及甘地死时是否超过114岁,你最终会 ,对其死亡年龄的估计要比锚定问题中提到的35岁死亡高得多。如果你考虑应该为一个房子支付多少钱,你会受到要价的影响。同样的房子,如果它的挂牌价高,就会显得比它的挂牌价低更有价值,即使你决心抵制这个数字的影响;等等--锚定 影响的清单是无穷的。任何要求你考虑作为估计问题的可能解决方案的数字都会引起锚定效应。
We were not the first to observe the effects of anchors, but our experiment was the first demonstration of its absurdity: people’s judgments were influenced by an obviously uninformative number. There was no way to describe the anchoring effect of a wheel of fortune as reasonable. Amos and I published the experiment in our Science paper, and it is one of the best known of the findings we reported there.
我们不是第一个观察到锚的影响的人,但我们的实验是第一个证明其荒谬性的实验:人们的判断被一个明显无信息的数字所影响。没有办法把幸运轮 的锚定效应描述得那么合理。阿莫斯和我在《 科学》 杂志上发表了这个实验,这是我们在那里报告的最著名的发现之一。
There was only one trouble: Amos and I did not fully agree on the psychology of the anchoring effect. He supported one interpretation, I liked another, and we never found a way to settle the argument. The problem was finally solved decades later by the efforts of numerous investigators. It is now clear that Amos and I were both right. Two different mechanisms produce anchoring effects—one for each system. There is a form of anchoring that occurs in a deliberate process of adjustment, an operation of System 2. And there is anchoring that occurs by a priming effect, an automatic manifestation of System 1.
只有一个问题:阿莫斯和我在锚定效应的心理学上并不完全一致。 他支持一种解释,我喜欢另一种解释,我们从未找到解决争论的方法。几十年后,在众多调查人员的努力下,这个问题终于得到了解决 。 现在很清楚,阿莫斯和我都是对的。两种不同的机制产生锚定效应--每个系统都有。有一种锚定发生在一个有意的调整过程中,是系统2的操作。 还有一种锚定是通过引爆效应发生的,是系统1的自动表现形式。
ANCHORING AS ADJUSTMENT
锚定即调整
Amos liked the idea of an adjust-and-anchor heuristic as a strategy for estimating uncertain quantities: start from an anchoring number, assess whether it is too high or too low, and gradually adjust your estimate by mentally “moving” from the anchor. The adjustment typically ends prematurely, because people stop when they are no longer certain that they should move farther. Decades after our disagreement, and years after Amos’s death, convincing evidence of such a process was offered independently by two psychologists who had worked closely with Amos early in their careers: Eldar Shafir and Tom Gilovich together with their own students—Amos’s intellectual grandchildren!
Amos喜欢 "调整-锚定 "启发式的想法,作为估计不确定数量的策略:从一个锚定的数字开始,评估它是否过高或过低,并通过心理上的 "移动 "来逐渐调整你的估计。调整通常会过早结束,因为人们在不再确定自己应该进一步发展时就会停止。在我们的分歧数十年后, ,在阿莫斯去世多年后,两位在职业生涯早期与阿莫斯密切合作的心理学家独立提供了这种过程的令人信服的证据:埃尔德-沙菲尔和汤姆-吉洛维奇与他们自己的学生--阿莫斯的智力孙子一起!这就是阿莫斯。
To get the idea, take a sheet of paper and draw a 2½-inch line going up, starting at the bottom of the page—without a ruler. Now take another sheet, and start at the top and draw a line going down until it is 2½ inches from the bottom. Compare the lines. There is a good chance that your first estimate of 2½ inches was shorter than the second. The reason is that you do not know exactly what such a line looks like; there is a range of uncertainty. You stop near the bottom of the region of uncertainty when you start from the bottom of the page and near the top of the region when you start from the top. Robyn Le Boeuf and Shafir found many examples of that mechanism in daily experience. Insufficient adjustment neatly explains why you are likely to drive too fast when you come off the highway onto city streets—especially if you are talking with someone as you drive. Insufficient adjustment is also a source of tension between exasperated parents and teenagers who enjoy loud music in their room. Le Boeuf and Shafir note that a “well-intentioned child who turns down exceptionally loud music to meet a parent’s demand that it be played at a ‘reasonable’ volume may fail to adjust sufficiently from a high anchor, and may feel that genuine attempts at compromise are being overlooked.” The driver and the child both deliberately adjust down, and both fail to adjust enough.
要了解这个想法,可以拿一张纸,从页面底部开始,画一条2.5英寸的线,向上延伸--不用尺子。 现在再拿一张纸,从顶部开始,往下画一条线,直到离底部2.5英寸 。比较一下这些线条。很有可能你第一次估计的2.5英寸比第二次短。原因是你不知道这样的线到底是什么样子的;有一个不确定性的范围。当你从 页面底部开始时,你在不确定区域的底部附近停止,而当你从顶部开始时,你在该区域的顶部附近停止。罗宾-勒布夫和沙菲尔在日常经验中发现了许多这种机制的例子。调整不足很好地解释了为什么当你从高速公路进入城市街道时,你很可能开得太快--特别是如果你在开车时与人交谈。调整不足也是紧张的根源, ,在气急败坏的父母和喜欢在房间里大声音乐的青少年之间。Le Boeuf和Shafir指出,"一个善意的孩子如果为了满足父母的要求,把特别大的音乐音量调低到 '合理'的音量 ,可能无法从一个高的锚上充分调整,并可能感到真正的妥协尝试被忽视了。"司机和孩子都 ,故意往下调,而且都没有调够。
Now consider these questions:
现在考虑这些问题。
When did George Washington become president?
乔治-华盛顿何时成为总统?
What is the boiling temperature of water at the top of Mount Everest?
珠穆朗玛峰顶的水的沸腾温度是多少?
The first thing that happens when you consider each of these questions is that an anchor comes to your mind, and you know both that it is wrong and the direction of the correct answer. You know immediately that George Washington became president after 1776, and you also know that the boiling temperature of water at the top of Mount Everest is lower than 100°C. You have to adjust in the appropriate direction by finding arguments to move away from the anchor. As in the case of the lines, you are likely to stop when you are no longer sure you should go farther—at the near edge of the region of uncertainty.
当你考虑这些问题的时候,首先会有一个锚出现在你的脑海中,你既知道它是错误的,也知道正确答案的方向。你 ,立即知道乔治-华盛顿在1776年后成为总统,你还知道珠穆朗玛峰顶的水的沸腾温度低于100℃。你必须通过寻找论据来远离锚,向适当的方向调整。就像线条的情况一样,当你不再确定你应该走得更远时,你很可能会停下来--在 不确定区域的近边缘。
Nick Epley and Tom Gilovich found evidence that adjustment is a deliberate attempt to find reasons to move away from the anchor: people who are instructed to shake their head when they hear the anchor, as if they rejected it, move farther from the anchor, and people who nod their head show enhanced anchoring. Epley and Gilovich also confirmed that adjustment is an effortful operation. People adjust less ( stay closer to the anchor ) when their mental resources are depleted, either because their memory is loaded with digits or because they are slightly drunk. Insufficient adjustment is a failure of a weak or lazy System 2.
尼克-埃普利和汤姆-吉洛维奇发现有证据表明,调整是一种刻意寻找远离锚点的理由:当听到锚点时,被指示摇头的人,就像拒绝它一样,会离锚点更远,而 点头 的人则表现出加强锚点。 当人们的精神资源被耗尽时,他们的调整就会减少 (更接近锚点 ),这是因为他们的记忆中充满了 数字,或者是因为他们略微喝醉了。调整不充分是软弱或懒惰的系统2的失败。
So we now know that Amos was right for at least some cases of anchoring, which involve a deliberate System 2 adjustment in a specified direction from an anchor.
因此,我们现在知道,至少在某些锚定的情况下,阿莫斯是正确的,这涉及到在指定的方向上故意进行系统2的调整, ,从一个锚。
ANCHORING AS PRIMING EFFECT
锚定作为启动效应
When Amos and I debated anchoring, I agreed that adjustment sometimes occurs, but I was uneasy. Adjustment is a deliberate and conscious activity, but in most cases of anchoring there is no corresponding subjective experience. Consider these two questions:
当阿莫斯和我辩论抛锚时,我同意有时会发生调整,但我感到不安。调整是一种有意的、有意识的活动,但在大多数情况下,锚定没有相应的主观体验。 考虑这两个问题。
Was Gandhi more or less than 144 years old when he died?
甘地去世时,他的年龄是大于还是小于144岁?
How old was Gandhi when he died?
甘地去世时是多少岁?
Did you produce your estimate by adjusting down from 144? Probably not, but the absurdly high number still affected your estimate. My hunch was that anchoring is a case of suggestion. This is the word we use when someone causes us to see, hear, or feel something by merely bringing it to mind. For example, the question “Do you now feel a slight numbness in your left leg?” always prompts quite a few people to report that their left leg does indeed feel a little strange.
你的估计是通过从144岁开始向下调整而得出的吗?可能不是,但这个荒唐的数字还是影响了你的估计。我的直觉是,锚定是一种暗示的情况。 这是我们使用的一个词,当有人通过仅仅使我们看到、听到或感觉到某些东西时。 例如,问题 "你现在觉得你的左腿有点麻木吗?"总是促使相当多的 ,报告他们的左腿确实感觉有点奇怪。
Amos was more conservative than I was about hunches, and he correctly pointed out that appealing to suggestion did not help us understand anchoring, because we did not know how to explain suggestion. I had to agree that he was right, but I never became enthusiastic about the idea of insufficient adjustment as the sole cause of anchoring effects. We conducted many inconclusive experiments in an effort to understand anchoring, but we failed and eventually gave up the idea of writing more about it.
阿莫斯对预感比我更保守,他正确地指出,诉诸暗示并不能帮助我们理解锚定,因为我们不知道如何解释暗示。我不得不同意他是对的,但我从未热衷于把调整不足作为锚定效应的唯一原因 。 我们进行了许多没有结果的实验,努力理解锚定,但我们失败了,最终放弃了写更多关于它的想法。
The puzzle that defeated us is now solved, because the concept of suggestion is no longer obscure: suggestion is a priming effect, which selectively evokes compatible evidence. You did not believe for a moment that Gandhi lived for 144 years, but your associative machinery surely generated an impression of a very ancient person. System 1 understands sentences by trying to make them true, and the selective activation of compatible thoughts produces a family of systematic errors that make us gullible and prone to believe too strongly whatever we believe. We can now see why Amos and I did not realize that there were two types of anchoring: the research techniques and theoretical ideas we needed did not yet exist. They were developed, much later, by other people. A process that resembles suggestion is indeed at work in many situations: System 1 tries its best to construct a world in which the anchor is the true number. This is one of the manifestations of associative coherence that I described in the first part of the book.
打败我们的难题现在解决了,因为暗示的概念不再晦涩难懂:暗示是一种启动效应,它有选择地唤起相容的证据。你没有一刻相信甘地 ,活了144年,但你的联想机器肯定产生了一个非常古老的人的印象。系统1通过试图使句子成为真实来理解句子,而兼容思想的选择性激活产生了一系列系统性错误,使我们容易轻信,容易过于相信我们所相信的东西。我们现在可以看到为什么阿莫斯和我没有意识到有 两种锚定:我们需要的研究技术 和理论思想还不存在。一个类似于暗示的过程确实在许多情况下起作用:系统1尽力构建一个世界,在这个世界里,锚是真正的数字。这是我在本书的第一部分 中描述的 联想一致性 的表现之一。
The German psychologists Thomas Mussweiler and Fritz Strack offered the most compelling demonstrations of the role of associative coherence in anchoring. In one experiment, they asked an anchoring question about temperature: “Is the annual mean temperature in Germany higher or lower than 20°C (68°F)?” or “Is the annual mean temperature in Germany higher or lower than 5°C (41°F)?”
德国心理学家Thomas Mussweiler和Fritz Strack对联想一致性在锚定中的作用进行了最令人信服的证明。在一个实验中,他们提出了一个关于温度的锚定问题:"德国的年平均温度是高于还是低于20°C(68°F)?"或者 "德国的年平均温度是高于还是低于5°C(41°F)?"
All participants were then briefly shown words that they were asked to identify. The researchers found that 68°F made it easier to recognize summer words (like sun and beach ), and 40°F facilitated winter words (like frost and ski ). The selective activation of compatible memories explains anchoring: the high and the low numbers activate different sets of ideas in memory. The estimates of annual temperature draw on these biased samples of ideas and are therefore biased as well. In another elegant study in the same vein, participants were asked about the average price of German cars. A high anchor selectively primed the names of luxury brands (Mercedes, Audi), whereas the low anchor primed brands associated with mass-market cars (Volkswagen). We saw earlier that any prime will tend to evoke information that is compatible with it. Suggestion and anchoring are both explained by the same automatic operation of System 1. Although I did not know how to prove it at the time, my hunch about the link between anchoring and suggestion turned out to be correct.
然后,所有参与者都被简要地展示了他们被要求识别的单词。研究人员发现,68华氏度使人们更容易识别夏季词汇(如 太阳 和 海滩 ),而40华氏度则有利于识别冬季词汇(如 霜冻 和 滑雪 )。 兼容记忆的选择性激活解释了锚定:高数字和低数字激活了记忆中的不同概念集。 年度温度的估计值借鉴了这些有偏见的想法样本,因此也是有偏见的。在另一项同样的优雅研究中,参与者被问及德国汽车的平均价格。一个高锚点选择性地激发了豪华品牌的名称(奔驰、奥迪),而低锚点则激发了与大众汽车相关的品牌(大众)。我们在前面看到,任何锚点都会倾向于唤起与之相匹配的 。 暗示和锚点都是由系统1的相同自动操作来解释的。虽然我当时不知道如何证明,但我关于锚点和暗示之间联系的预感证明是正确的。
THE ANCHORING INDEX
锚定指数
Many psychological phenomena can be demonstrated experimentally, but few can actually be measured. The effect of anchors is an exception. Anchoring can be measured, and it is an impressively large effect. Some visitors at the San Francisco Exploratorium were asked the following two questions:
许多心理现象可以通过实验来证明,但很少有人能真正测量出来。锚定的效果 是一个例外。锚定可以被测量,而且是一个令人印象深刻的大效果。 在 旧金山探索馆 的一些游客被问到以下两个问题。
Is the height of the tallest redwood more or less than 1,200 feet?
最高的红木的高度是超过还是低于1200英尺?
What is your best guess about the height of the tallest redwood?
你对最高的红木的高度的最佳猜测是什么?
The “high anchor” in this experiment was 1,200 feet. For other participants, the first question referred to a “low anchor” of 180 feet. The difference between the two anchors was 1,020 feet.
这个实验中的 "高锚 "是1200英尺。对于其他参与者, ,第一个问题是指180英尺的 "低锚"。两个锚之间的差距是1020英尺。
As expected, the two groups produced very different mean estimates: 844 and 282 feet. The difference between them was 562 feet. The anchoring index is simply the ratio of the two differences (562/1,020) expressed as a percentage: 55%. The anchoring measure would be 100% for people who slavishly adopt the anchor as an estimate, and zero for people who are able to ignore the anchor altogether. The value of 55% that was observed in this example is typical. Similar values have been observed in numerous other problems.
正如 ,两组产生了非常不同的平均估计数:844和282英尺。他们之间的差距是562英尺。锚定指数只是两个差异的比率(562/1,020),以百分比表示:55%。锚定措施对于那些一味采用锚定作为估计的人 ,将是100%,而对于那些能够完全忽略锚定的人,则是零。本例中观察到的55%的数值是典型的。在许多其他问题中也观察到类似的数值。
The anchoring effect is not a laboratory curiosity; it can be just as strong in the real world. In an experiment conducted some years ago, real-estate agents were given an opportunity to assess the value of a house that was actually on the market. They visited the house and studied a comprehensive booklet of information that included an asking price. Half the agents saw an asking price that was substantially higher than the listed price of the house; the other half saw an asking price that was substantially lower . Each agent gave her opinion about a reasonable buying price for the house and the lowest price at which she would agree to sell the house if she owned it. The agents were then asked about the factors that had affected their judgment. Remarkably, the asking price was not one of these factors; the agents took pride in their ability to ignore it. They insisted that the listing price had no effect on their responses, but they were wrong: the anchoring effect was 41%. Indeed, the professionals were almost as susceptible to anchoring effects as business school students with no real-estate experience, whose anchoring index was 48%. The only difference between the two groups was that the students conceded that they were influenced by the anchor, while the professionals denied that influence.
锚定效应并不是实验室里的好奇心;它在现实世界中也同样强大。在几年前进行的一项实验中,房地产经纪人 ,有机会评估一栋实际在市场上的房子的价值。他们参观了这栋房子,研究了包括要价在内的综合信息手册。一半中介看到的要价大大高于房屋的挂牌价;另一半中介看到的要价 大大低于 挂牌价。每个中介都给出了自己的意见, ,这是该房屋的合理买价,也是如果她拥有该房屋她会同意出售的最低价格。值得注意的是,要价并不是这些因素之一;经纪人为自己能够忽略它而感到自豪。他们坚持认为上市价格对他们的反应没有影响,但他们错了: ,锚定效应是41%。事实上,专业人士几乎和没有房地产经验的商学院学生一样容易受到锚定效应的影响,他们的锚定指数为48%。 两组之间的唯一区别是,学生承认他们受到了锚定的影响,而专业人士则否认这种影响。
Powerful anchoring effects are found in decisions that people make about money, such as when they choose how much to contribute to a cause. To demonstrate this effect, we told participants in the Exploratorium study about the environmental damage caused by oil tankers in the Pacific Ocean and asked about their willingness to make an annual contribution “to save 50,000 offshore Pacific Coast seabirds from small offshore oil spills, until ways are found to prevent spills or require tanker owners to pay for the operation.” This question requires intensity matching: the respondents are asked, in effect, to find the dollar amount of a contribution that matches the intensity of their feelings about the plight of the seabirds. Some of the visitors were first asked an anchoring question, such as, “Would you be willing to pay $5 …,” before the point-blank question of how much they would contribute.
强大的锚定效应在人们关于金钱的决定中被发现,例如当他们选择为某项事业捐款多少时。 为了证明这种效应,我们告诉探索馆研究的参与者关于油轮在太平洋造成的环境破坏,并询问他们是否愿意每年捐款,"以拯救50,000只太平洋沿岸的近海海鸟,使其免受小型近海石油泄漏的影响,直到找到防止泄漏的方法或要求油轮所有者支付操作费用"。这个问题需要强度匹配:实际上是要求受访者找到与他们对海鸟困境的感受强度相匹配的捐款金额。一些来访者首先被问到一个锚定问题,如 "你是否愿意支付5美元......",然后 ,问他们会捐多少钱。
When no anchor was mentioned, the visitors at the Exploratorium—generally an environmentally sensitive crowd—said they were willing to pay $64, on average. When the anchoring amount was only $5, contributions averaged $20. When the anchor was a rather extravagant $400, the willingness to pay rose to an average of $143.
当 ,没有提到锚时,探索馆的游客--通常是对环境敏感的人群--说他们平均愿意支付64美元。当锚的金额只有5美元时,捐款平均为20美元。当锚是相当奢侈的400美元时,支付意愿上升到平均143美元。
The difference between the high-anchor and low-anchor groups was $123. The anchoring effect was above 30%, indicating that increasing the initial request by $100 brought a return of $30 in average willingness to pay.
高锚定组和低锚定组之间的差异为123美元。锚定效应超过30%,表明增加100美元的初始要求带来了30美元的平均支付意愿的回报。
Similar or even larger anchoring effects have been obtained in numerous studies of estimates and of willingness to pay. For example, French residents of the heavily polluted Marseilles region were asked what increase in living costs they would accept if they could live in a less polluted region. The anchoring effect was over 50% in that study. Anchoring effects are easily observed in online trading, where the same item is often offered at different “buy now” prices. The “estimate” in fine-art auctions is also an anchor that influences the first bid.
在许多关于估计和支付意愿的研究中,也得到了类似甚至更大的锚定效应。例如,法国马赛地区污染严重的居民被 ,问他们如果能生活在污染较少的地区,他们会接受多少生活费用的增加。 在该研究中,锚定效应超过50%。锚定效应在网上交易中很容易观察到,同一物品经常以不同的 "立即购买 "价格出售。 精美艺术品拍卖中的 "估价 "也是影响第一次出价的一个锚定因素。
There are situations in which anchoring appears reasonable. After all, it is not surprising that people who are asked difficult questions clutch at straws, and the anchor is a plausible straw. If you know next to nothing about the trees of California and are asked whether a redwood can be taller than 1,200 feet, you might infer that this number is not too far from the truth. Somebody who knows the true height thought up that question, so the anchor may be a valuable hint. However, a key finding of anchoring research is that anchors that are obviously random can be just as effective as potentially informative anchors. When we used a wheel of fortune to anchor estimates of the proportion of African nations in the UN, the anchoring index was 44%, well within the range of effects observed with anchors that could plausibly be taken as hints. Anchoring effects of similar size have been observed in experiments in which the last few digits of the respondent’s Social Security number was used as the anchor (e.g., for estimating the number of physicians in their city). The conclusion is clear: anchors do not have their effects because people believe they are informative.
在有些情况下,锚定 似乎是合理的。毕竟,被问到难题的人抓住稻草并不奇怪,而锚定就是一根看似合理的稻草。如果你对加州的树木几乎一无所知,并被问及红杉是否能超过1200英尺高,你可能会推断出这个数字与事实相差不大。有知道真实高度的人想出了这个问题, ,所以,这个锚可能是一个有价值的提示。然而,锚定研究的一个重要发现是,明显随机的锚定与潜在的信息锚定一样有效。 当我们使用幸运轮来锚定联合国中非洲国家的比例时,锚定指数为44%,远在可被视为暗示的锚定的效果范围之内, 。在实验中也观察到类似规模的锚定效果,其中被调查者的社会安全号码的最后几个数字被用作锚定(例如:。结论是明确的:锚的效果不是因为人们相信它们是有信息的。
The power of random anchors has been demonstrated in some unsettling ways. German judges with an average of more than fifteen years of experience on the bench first read a description of a woman who had been caught shoplifting, then rolled a pair of dice that were loaded so every roll resulted in either a 3 or a 9. As soon as the dice came to a stop, the judges were asked whether they would sentence the woman to a term in prison greater or lesser, in months, than the number showing on the dice. Finally, the judges were instructed to specify the exact prison sentence they would give to the shoplifter. On average, those who had rolled a 9 said they would sentence her to 8 months; those who rolled a 3 said they would sentence her to 5 months; the anchoring effect was 50%.
随机锚的力量已经在一些 ,令人不安。平均有15年以上法官经验的德国法官首先阅读了一个被抓获的妇女的描述,然后 掷出一对骰子 ,每一次掷出的结果不是3就是9。一旦骰子停了下来,法官们被问及他们是否会判处该妇女的刑期, ,比骰子上的数字大或小, 。最后,法官们被要求具体说明他们将对入店行窃者作出的确切监禁判决。平均而言,那些掷出9分的人说他们会判她8个月;那些掷出3分的人说他们会判她5个月;锚定效应为50%。
USES AND ABUSES OF ANCHORS
锚的使用和滥用
By now you should be convinced that anchoring effects—sometimes due to priming, sometimes to insufficient adjustment—are everywhere. The psychological mechanisms that produce anchoring make us far more suggestible than most of us would want to be. And of course there are quite a few people who are willing and able to exploit our gullibility.
现在你应该相信,锚定 ,有时是由于打底,有时是由于调整不足,这种效应无处不在。产生锚定的心理机制使我们的暗示性远远超过我们大多数人所希望的。 当然,也有相当多的人愿意并能够利用我们的轻信。
Anchoring effects explain why, for example, arbitrary rationing is an effective marketing ploy. A few years ago, supermarket shoppers in Sioux City, Iowa, encountered a sales promotion for Campbell’s soup at about 10% off the regular price. On some days, a sign on the shelf said LIMIT OF 12 PER PERSON . On other days, the sign said NO LIMIT PER PERSON . Shoppers purchased an average of 7 cans when the limit was in force, twice as many as they bought when the limit was removed. Anchoring is not the sole explanation. Rationing also implies that the goods are flying off the shelves, and shoppers should feel some urgency about stocking up. But we also know that the mention of 12 cans as a possible purchase would produce anchoring even if the number were produced by a roulette wheel.
锚定效应解释了为什么,例如,任意配给是一种有效的营销策略。A 几年前,爱荷华州苏城的超市购物者遇到了金宝汤的促销活动,其价格约为正常价格的10%。在某些日子里,架子上的牌子上写着 每人限购12只 。在其他日子里,牌子上写着 每人不限次数 。在实施限制时,购物者平均购买了7个罐头,是取消限制时的两倍。 配给制也意味着货物从货架上飞走,购物者应该对囤积货物感到一些紧迫性。但我们也知道,提到12个罐子作为一种可能的购买,即使这个数字是由轮盘产生的,也会产生锚定作用。
We see the same strategy at work in the negotiation over the price of a home, when the seller makes the first move by setting the list price. As in many other games, moving first is an advantage in single-issue negotiations—for example, when price is the only issue to be settled between a buyer and a seller. As you may have experienced when negotiating for the first time in a bazaar, the initial anchor has a powerful effect. My advice to students when I taught negotiations was that if you think the other side has made an outrageous proposal, you should not come back with an equally outrageous counteroffer, creating a gap that will be difficult to bridge in further negotiations. Instead you should make a scene, storm out or threaten to do so, and make it clear—to yourself as well as to the other side—that you will not continue the negotiation with that number on the table.
在房屋价格的谈判中,我们看到同样的策略在起作用,当卖家通过确定清单价格而做出 第一个动作。正如其他许多游戏一样,在单一问题的谈判中,先行一步是一种优势--例如,当价格是买家和卖家之间唯一需要解决的问题时。正如你第一次在集市上谈判时可能经历的那样,最初的锚具有强大的作用。我在教谈判时给学生的建议是,如果你认为 ,对方提出了一个离谱的建议,你不应该回以同样离谱的还价,造成一个在进一步谈判中难以弥补的差距。相反,你应该大吵大闹,冲出去或威胁这样做,并向自己和对方明确表示,你不会带着这个数字继续谈判。
The psychologists Adam Galinsky and Thomas Mussweiler proposed more subtle ways to resist the anchoring effect in negotiations. They instructed negotiators to focus their attention and search their memory for arguments against the anchor. The instruction to activate System 2 was successful. For example, the anchoring effect is reduced or eliminated when the second mover focuses his attention on the minimal offer that the opponent would accept, or on the costs to the opponent of failing to reach an agreement. In general, a strategy of deliberately “thinking the opposite” may be a good defense against anchoring effects, because it negates the biased recruitment of thoughts that produces these effects.
心理学家 Adam Galinsky和Thomas Mussweiler提出了在谈判中 抵制锚定效应 的更微妙的方法。他们指示谈判者集中注意力,在记忆中搜索反对锚定的 论据。例如,当第二推动者把注意力集中在对手会接受的最低报价 ,或集中在对手未能达成协议的代价上时,锚定效应就会减少或消除。一般来说,故意 "反过来想 "的策略可能是对锚定效应的良好防御,因为它否定了产生这些效应的有偏见的思想招募。
Finally, try your hand at working out the effect of anchoring on a problem of public policy: the size of damages in personal injury cases. These awards are sometimes very large. Businesses that are frequent targets of such lawsuits, such as hospitals and chemical companies, have lobbied to set a cap on the awards. Before you read this chapter you might have thought that capping awards is certainly good for potential defendants, but now you should not be so sure. Consider the effect of capping awards at $1 million. This rule would eliminate all larger awards, but the anchor would also pull up the size of many awards that would otherwise be much smaller . It would almost certainly benefit serious offenders and large firms much more than small ones.
最后,请试着计算一下锚定对公共政策问题的影响:人身伤害案件中损害赔偿的大小 。 这些赔偿金有时非常大。经常成为此类诉讼目标的企业,如医院和化学公司,已经游说为赔偿金设定上限。在你阅读本章之前,你可能认为对赔偿金设置上限对潜在的被告肯定有好处,但现在你不应该这么肯定。考虑将赔偿金上限定为100万美元的影响, 。这一规则将消除所有较大的赔偿金,但锚定也将拉高许多本来会 小得多的 赔偿金的规模。几乎可以肯定的是,严重犯罪者和大公司比小公司受益更多。
ANCHORING AND THE TWO SYSTEMS
锚定和两个系统
The effects of random anchors have much to tell us about the relationship between System 1 and System 2. Anchoring effects have always been studied in tasks of judgment and choice that are ultimately completed by System 2. However, System 2 works on data that is retrieved from memory, in an automatic and involuntary operation of System 1. System 2 is therefore susceptible to the biasing influence of anchors that make some information easier to retrieve. Furthermore, System 2 has no control over the effect and no knowledge of it. The participants who have been exposed to random or absurd anchors (such as Gandhi’s death at age 144) confidently deny that this obviously useless information could have influenced their estimate, and they are wrong.
随机锚定的效果对我们了解系统1和系统2之间的关系有很大帮助。锚定 效果一直是在最终由系统2完成的判断和选择任务中研究的。然而,系统2的工作是在系统1的自动和非自愿操作中从记忆中检索的数据。因此,系统2很容易受到使某些信息更容易被检索到的锚的偏向性影响。此外,系统2对这种影响没有控制权, ,对它一无所知。那些被暴露在随机或荒谬的锚(如甘地在144岁去世)的参与者自信地否认这些明显无用的信息可能影响他们的估计,他们是错误的。
We saw in the discussion of the law of small numbers that a message, unless it is immediately rejected as a lie, will have the same effect on the associative system regardless of its reliability. The gist of the message is the story, which is based on whatever information is available, even if the quantity of the information is slight and its quality is poor: WYSIATI. When you read a story about the heroic rescue of a wounded mountain climber, its effect on your associative memory is much the same if it is a news report or the synopsis of a film. Anchoring results from this associative activation. Whether the story is true, or believable, matters little, if at all. The powerful effect of random anchors is an extreme case of this phenomenon, because a random anchor obviously provides no information at all.
我们在讨论小数法则时看到,一个信息,除非立即被拒绝为谎言,否则无论其可靠性如何,都会对联想 系统产生相同的影响。信息的要点是故事,它基于任何可用的信息,即使信息的数量很少,质量很差:WYSIATI。当你读到一个关于英雄拯救受伤登山者的故事时,如果它是 新闻报道或电影的概要,它对你的联想记忆的影响是差不多的。 随机锚的强大作用是这种现象的一个极端例子,因为随机锚显然根本没有提供任何信息。
Earlier I discussed the bewildering variety of priming effects, in which your thoughts and behavior may be influenced by stimuli to which you pay no attention at all, and even by stimuli of which you are completely unaware. The main moral of priming research is that our thoughts and our behavior are influenced, much more than we know or want, by the environment of the moment. Many people find the priming results unbelievable, because they do not correspond to subjective experience. Many others find the results upsetting, because they threaten the subjective sense of agency and autonomy. If the content of a screen saver on an irrelevant computer can affect your willingness to help strangers without your being aware of it, how free are you? Anchoring effects are threatening in a similar way. You are always aware of the anchor and even pay attention to it, but you do not know how it guides and constrains your thinking, because you cannot imagine how you would have thought if the anchor had been different (or absent). However, you should assume that any number that is on the table has had an anchoring effect on you, and if the stakes are high you should mobilize yourself (your System 2) to combat the effect.
前面我讨论了令人困惑的各种引爆效应,在这种情况下,你的思想和行为可能被你 ,甚至被你完全不知道的刺激物所影响。引导研究的主要寓意是,我们的思想和行为受当时环境的影响,比我们知道的或想要的要大得多。 许多人觉得引导的结果不可思议,因为它们与主观经验不一致。还有许多人认为结果令人不安,因为它们威胁到 ,主观的代理和自主意识。如果无关紧要的电脑上的屏幕保护程序的内容可以影响你帮助陌生人的意愿,而你却没有意识到,你有多自由?锚定效应具有类似的威胁性。 你总是意识到锚定,甚至注意到它,但你不知道它是如何引导和制约你的思维的,因为你 ,无法想象如果锚定不同(或没有),你会如何思考。 然而,你应该假设任何摆在桌面上的数字对你有锚定效应,如果风险很大,你应该动员自己(你的系统2)来对抗这种效应。
SPEAKING OF ANCHORS
说到锚
“The firm we want to acquire sent us their business plan, with the revenue they expect. We shouldn’t let that number influence our thinking. Set it aside.”
"我们想要收购的公司给我们发送了他们的商业计划,其中有他们预期的收入。我们不应该 ,让这个数字影响我们的思考。把它放在一边。"
“Plans are best-case scenarios. Let’s avoid anchoring on plans when we forecast actual outcomes. Thinking about ways the plan could go wrong is one way to do it.”
"计划是最好的情况。 当我们预测实际结果时,让我们避免固定在计划上。 思考计划可能出错的方式是一种方法。"
“Our aim in the negotiation is to get them anchored on this number.”
"我们在谈判中的目标是让他们固定在这个数字上。"
“Let’s make it clear that if that is their proposal, the negotiations are over. We do not want to start there.”
"让我们清楚地表明,如果这就是他们的提议,谈判就结束了。我们不希望从那里开始。"
“The defendant’s lawyers put in a frivolous reference in which they mentioned a ridiculously low amount of damages , and they got the judge anchored on it!”
"被告的律师提出了一个无意义的参考意见,他们在其中提到了一个低得可笑的 赔偿金额 ,他们让法官锚定了这一点!"
12
12
The Science of Availability
可用性的科学
Amos and I had our most productive year in 1971–72, which we spent in Eugene, Oregon. We were the guests of the Oregon Research Institute, which housed several future stars of all the fields in which we worked—judgment, decision making, and intuitive prediction. Our main host was Paul Slovic, who had been Amos’s classmate at Ann Arbor and remained a lifelong friend. Paul was on his way to becoming the leading psychologist among scholars of risk, a position he has held for decades, collecting many honors along the way. Paul and his wife, Roz, introduced us to life in Eugene, and soon we were doing what people in Eugene do—jogging, barbecuing, and taking children to basketball games. We also worked very hard, running dozens of experiments and writing our articles on judgment heuristics. At night I wrote Attention and Effort . It was a busy year.
1971-72年,阿莫斯和我在俄勒冈州尤金市度过了我们最富有成效的一年。我们是俄勒冈州研究所的客人,该研究所容纳了我们所从事的所有领域--判断、决策和直觉预测--的几位未来之星。我们的主要客人是保罗-斯洛维奇,他是阿莫斯在安阿伯的同学,并一直是他的终身朋友。保罗正在成为风险学者中领先的心理学家,他在这个位置上工作了几十年,沿途获得了许多荣誉。 保罗和他的妻子罗兹向我们介绍了尤金的生活,很快我们就在做 ,就像尤金人做的那样--慢跑、烧烤和带孩子去看篮球比赛。我们也非常努力,做了几十个实验,写了我们关于判断启发式的文章。 晚上我写了《 注意力和努力 》。这是一个忙碌的一年。
One of our projects was the study of what we called the availability heuristic . We thought of that heuristic when we asked ourselves what people actually do when they wish to estimate the frequency of a category, such as “people who divorce after the age of 60” or “dangerous plants.” The answer was straightforward: instances of the class will be retrieved from memory, and if retrieval is easy and fluent, the category will be judged to be large. We defined the availability heuristic as the process of judging frequency by “ the ease with which instances come to mind.” The statement seemed clear when we formulated it, but the concept of availability has been refined since then. The two-system approach had not yet been developed when we studied availability, and we did not attempt to determine whether this heuristic is a deliberate problem-solving strategy or an automatic operation. We now know that both systems are involved.
我们的一个项目是研究我们所谓的 可用性启发式 。当我们问自己,当人们 ,当他们希望估计一个类别的频率时,例如 "60岁以后离婚的人 "或 "危险的植物 "时,我们想到了这个启发式。答案很简单:类的实例会被从记忆中检索出来,如果检索很容易,很流畅,那么这个类别就会被判断为很大。 我们把可用性启发式定义为通过 " 实例出现在脑海中的 容易程度 "来判断频率的过程。在我们制定这个声明时,这个声明似乎很清楚,但从那时起,可用性的概念已经得到完善。在我们研究可用性的时候,双系统方法还没有被开发出来,我们也没有试图确定这种启发式方法是一种 刻意的问题解决策略还是一种自动操作。我们现在知道,这两个系统都参与其中。
A question we considered early was how many instances must be retrieved to get an impression of the ease with which they come to mind. We now know the answer: none. For an example, think of the number of words that can be constructed from the two sets of letters below.
一个问题 ,我们早期考虑的是,必须检索多少个实例才能对它们想起的难易程度有一个印象。 我们现在知道答案:没有。举个例子,想一想从下面两组字母中可以构建多少个单词。
XUZONLCJM
XUZONLCJM
TAPCERHOB
TAPCERHOB
You knew almost immediately, without generating any instances, that one set offers far more possibilities than the other, probably by a factor of 10 or more. Similarly, you do not need to retrieve specific news stories to have a good idea of the relative frequency with which different countries have appeared in the news during the past year (Belgium, China, France, Congo, Nicaragua, Romania …).
你几乎立刻就知道,不用产生任何实例,一个集合提供的可能性远远多于另一个 ,可能是10倍以上。同样,你不需要检索具体的新闻故事,就可以很好地了解过去一年中不同国家出现在新闻中的相对频率(比利时、中国、法国、刚果、尼加拉瓜、罗马尼亚......)。
The availability heuristic, like other heuristics of judgment, substitutes one question for another: you wish to estimate the size of a category or the frequency of an event, but you report an impression of the ease with which instances come to mind. Substitution of questions inevitably produces systematic errors. You can discover how the heuristic leads to biases by following a simple procedure: list factors other than frequency that make it easy to come up with instances. Each factor in your list will be a potential source of bias. Here are some examples:
可用性启发式,就像其他判断的启发式一样,用一个问题代替另一个问题:你希望 ,估计一个类别的大小或一个事件的频率,但你报告的是对实例是否容易想到的印象。问题的替代不可避免地产生系统性错误。 你可以通过一个简单的程序发现启发式如何导致偏见:列出除频率以外的使实例容易想到的因素。你列表中的每个因素将被 ,成为偏见的潜在来源。这里有一些例子。
A salient event that attracts your attention will be easily retrieved from memory. Divorces among Hollywood celebrities and sex scandals among politicians attract much attention, and instances will come easily to mind. You are therefore likely to exaggerate the frequency of both Hollywood divorces and political sex scandals.
吸引你注意力的突出事件会很容易从记忆中检索出来。好莱坞名人的离婚和政治家的性丑闻吸引了很多人的注意,并且很容易想到一些例子。 因此,你很可能夸大了好莱坞离婚和政治性丑闻的频率。
A dramatic event temporarily increases the availability of its category.
一个戏剧性的事件 ,暂时增加其类别的可用性。
A plane crash that attracts media coverage will temporarily alter your feelings about the safety of flying. Accidents are on your mind, for a while, after you see a car burning at the side of the road, and the world is for a while a more dangerous place.
一场吸引媒体报道的飞机失事会暂时改变你对飞行安全的感受。 在你看到一辆汽车在路边燃烧后,事故会在你的脑海中出现一段时间,世界在一段时间内变得更加危险。
Personal experiences, pictures, and vivid examples are more available than incidents that happened to others, or mere words, or statistics. A judicial error that affects you will undermine your faith in the justice system more than a similar incident you read about in a newspaper.
个人经历、图片和生动的例子比发生在别人身上的事件 ,或仅仅是文字,或统计数据更容易获得。影响到你的司法错误比你在报纸上看到的类似事件更能破坏你对司法系统的信心。
Resisting this large collection of potential availability biases is possible, but tiresome. You must make the effort to reconsider your impressions and intuitions by asking such questions as, “Is our belief that thefts by teenagers are a major problem due to a few recent instances in our neighborhood?” or “Could it be that I feel no need to get a flu shot because none of my acquaintances got the flu last year?” Maintaining one’s vigilance against biases is a chore—but the chance to avoid a costly mistake is sometimes worth the effort.
抵制 ,这一大批潜在的可用性偏差是可能的,但令人厌烦。你必须努力重新考虑你的印象和直觉,提出这样的问题:"我们 ,认为青少年偷窃是一个大问题,是不是因为我们附近最近发生的几起案件?"或者 "会不会因为我的熟人去年都没有得流感,所以我觉得没有必要打流感疫苗?"保持对偏见的警惕是一件苦差事--但避免代价高昂的错误的机会有时是值得努力的。
One of the best-known studies of availability suggests that awareness of your own biases can contribute to peace in marriages, and probably in other joint projects. In a famous study, spouses were asked, “How large was your personal contribution to keeping the place tidy, in percentages?” They also answered similar questions about “taking out the garbage,” “initiating social engagements,” etc. Would the self-estimated contributions add up to 100%, or more, or less? As expected, the self-assessed contributions added up to more than 100%. The explanation is a simple availability bias : both spouses remember their own individual efforts and contributions much more clearly than those of the other, and the difference in availability leads to a difference in judged frequency. The bias is not necessarily self-serving: spouses also overestimated their contribution to causing quarrels, although to a smaller extent than their contributions to more desirable outcomes. The same bias contributes to the common observation that many members of a collaborative team feel they have done more than their share and also feel that the others are not adequately grateful for their individual contributions.
一项最著名的可用性研究表明, ,对自己的偏见的认识可以促进婚姻中的和平,可能在其他联合项目中也是如此。 在一项著名的研究中,配偶被问到:"你个人对保持地方整洁的贡献有多大,按百分比计算?"他们还回答了关于 "倒垃圾"、"发起社交活动 "等类似问题。自我估计的贡献加起来是100%,还是 ,还是更少?正如预期的那样, 自我评估的贡献 加起来超过了100%。解释是一个简单的 可得性偏差 :配偶双方对自己的个人努力和贡献的记忆要比对方清楚得多,而可得性的差异导致了判断频率的差异。这种偏见不一定是为自己服务的:配偶也高估了他们 ,导致争吵的贡献,尽管程度比他们对更理想的结果的贡献要小。 同样的偏见有助于常见的观察,即合作团队的许多成员觉得他们做得比自己的份额多,也觉得其他人对他们的个人贡献没有充分的感谢。
I am generally not optimistic about the potential for personal control of biases, but this is an exception. The opportunity for successful debiasing exists because the circumstances in which issues of credit allocation come up are easy to identify, the more so because tensions often arise when several people at once feel that their efforts are not adequately recognized. The mere observation that there is usually more than 100% credit to go around is sometimes sufficient to defuse the situation. In any event, it is a good thing for every individual to remember. You will occasionally do more than your share, but it is useful to know that you are likely to have that feeling even when each member of the team feels the same way.
我一般不看好个人控制偏见的潜力 ,但这是个例外。成功去伪存真的机会存在,因为信用分配问题出现的情况很容易识别,越是这样,当几个人同时感到他们的努力没有得到充分的认可时,往往会出现紧张。仅仅观察到通常有超过100%的功劳 ,有时就足以化解这种情况。无论如何,每个人都要记住这一点。你偶尔会做得比你的份额多,但知道你有可能有这种感觉是很有用的,即使团队的每个成员都有同样的感觉。
THE PSYCHOLOGY OF AVAILABILITY
可用性的心理学
A major advance in the understanding of the availability heuristic occurred in the early 1990s, when a group of German psychologists led by Norbert Schwarz raised an intriguing question: How will people’s impressions of the frequency of a category be affected by a requirement to list a specified number of instances? Imagine yourself a subject in that experiment:
对可得性启发式理解的 一个重大进展 发生在 20世纪90年代初,当时由诺伯特- 施瓦茨领导的一组德国心理学家提出了一个有趣的问题:人们对一个类别的频率的印象会受到列出特定数量的实例的要求的影响吗?想象一下你自己是这个实验的主体。
First, list six instances in which you behaved assertively.
首先,列出六个你表现得很自信的例子。
Next, evaluate how assertive you are.
接下来,评估你的自信程度。
Imagine that you had been asked for twelve instances of assertive behavior (a number most people find difficult). Would your view of your own assertiveness be different?
想象一下, ,你被要求提供12个自信的行为实例(大多数人都觉得这个数字很难)。你对自己的自信的看法会不会有所不同?
Schwarz and his colleagues observed that the task of listing instances may enhance the judgments of the trait by two different routes:
Schwarz和他的同事观察到,列举实例的任务可能通过两种不同的途径增强对特质的判断。
the number of instances retrieved
检索到的实例数量
the ease with which they come to mind
记忆中的轻松
The request to list twelve instances pits the two determinants against each other. On the one hand, you have just retrieved an impressive number of cases in which you were assertive. On the other hand, while the first three or four instances of your own assertiveness probably came easily to you, you almost certainly struggled to come up with the last few to complete a set of twelve; fluency was low. Which will count more—the amount retrieved or the ease and fluency of the retrieval?
要求列出12个实例 ,使两个决定因素相互对立。一方面,你刚刚检索到了数量可观的你自信的案例。 另一方面,虽然你自己自信的前三或四个例子可能很容易想到,但你几乎肯定要努力想出最后几个例子来完成一组12个案例;流畅度很低。哪个更重要-- 检索的数量或检索的轻松和流畅度?
The contest yielded a clear-cut winner: people who had just listed twelve instances rated themselves as less assertive than people who had listed only six. Furthermore, participants who had been asked to list twelve cases in which they had not behaved assertively ended up thinking of themselves as quite assertive! If you cannot easily come up with instances of meek behavior, you are likely to conclude that you are not meek at all. Self-ratings were dominated by the ease with which examples had come to mind. The experience of fluent retrieval of instances trumped the number retrieved.
比赛产生了一个明显的赢家:刚刚列出12个例子的人比只列出6个例子的人对自己的评价要低。此外,那些被要求列出12个他们 没有 自信行为的案例的参与者,最后都认为自己是相当自信的!这就是为什么他们会认为自己的行为是自信的。如果你不能轻易地 ,想出温顺行为的例子,你很可能会得出结论,你根本就不是温顺的。 自我评价是由想到例子的难易程度来决定的。 流畅地检索例子的经验胜过检索的数量。
An even more direct demonstration of the role of fluency was offered by other psychologists in the same group. All the participants in their experiment listed six instances of assertive (or nonassertive) behavior, while maintaining a specified facial expression. “Smilers” were instructed to contract the zygomaticus muscle, which produces a light smile; “frowners” were required to furrow their brow. As you already know, frowning normally accompanies cognitive strain and the effect is symmetric: when people are instructed to frown while doing a task, they actually try harder and experience greater cognitive strain. The researchers anticipated that the frowners would have more difficulty retrieving examples of assertive behavior and would therefore rate themselves as relatively lacking in assertiveness. And so it was.
同一小组的其他心理学家对 流畅性的作用 做了更直接的证明。在他们的 实验中,所有参与者都列出了六种自信(或不自信)的行为,同时保持特定的面部表情。"微笑者 "被要求收缩颧骨肌肉,从而产生一个浅笑;"皱眉者 "被要求皱起眉毛。正如你已经知道的那样,皱眉通常伴随着认知紧张,而且效果是对称的:当人们被指示在做一项任务时皱眉 ,他们实际上更努力 ,并经历更大的认知紧张。研究人员预计,皱眉者会更难检索出自信行为的例子,因此会把自己评为相对缺乏自信。 事实也是如此。
Psychologists enjoy experiments that yield paradoxical results, and they have applied Schwarz’s discovery with gusto. For example, people:
心理学家喜欢产生矛盾结果的实验,他们将施瓦茨的发现应用于 ,并大肆宣传。比如说,人。
believe that they use their bicycles less often after recalling many rather than few instances
在回忆了许多而不是少数的例子后,相信他们会减少使用自行车的次数
are less confident in a choice when they are asked to produce more arguments to support it
当他们被要求提出更多的论据来支持他们的选择时,他们对自己的选择不太自信
are less confident that an event was avoidable after listing more ways it could have been avoided
在列举了更多可以避免的方式后,就不太相信事件是可以避免的。
are less impressed by a car after listing many of its advantages
在列举了汽车的许多优点后,对其印象不深
A professor at UCLA found an ingenious way to exploit the availability bias. He asked different groups of students to list ways to improve the course, and he varied the required number of improvements. As expected, the students who listed more ways to improve the class rated it higher!
加州大学洛杉矶分校的一位教授 ,找到了一个巧妙的方法来利用可用性偏差。他要求不同组别的学生列出改进课程的方法,并改变了所要求的改进数量。 正如预期的那样,列出更多改进方法的学生对这门课的评价较高!这也是为什么他的学生对这门课的评价较高。
Perhaps the most interesting finding of this paradoxical research is that the paradox is not always found: people sometimes go by content rather than by ease of retrieval. The proof that you truly understand a pattern of behavior is that you know how to reverse it. Schwarz and his colleagues took on this challenge of discovering the conditions under which this reversal would take place.
也许这个矛盾的研究最有趣的发现是,矛盾并不总是被发现:人们有时会根据 内容而不是根据检索的难易程度。你真正理解一种行为模式的证明是你知道如何扭转它。施瓦茨和他的同事们接受了这一挑战,发现了这种逆转发生的条件。
The ease with which instances of assertiveness come to the subject’s mind changes during the task. The first few instances are easy, but retrieval soon becomes much harder. Of course, the subject also expects fluency to drop gradually, but the drop of fluency between six and twelve instances appears to be steeper than the participant expected. The results suggest that the participants make an inference: if I am having so much more trouble than expected coming up with instances of my assertiveness, then I can’t be very assertive. Note that this inference rests on a surprise—fluency being worse than expected. The availability heuristic that the subjects apply is better described as an “unexplained unavailability” heuristic.
在任务过程中,被试者脑海中出现自信的事例的难易程度会发生变化。最初的几个例子很容易,但 检索很快就变得更加困难。当然,被试也期望流畅度逐渐下降,但六到十二个实例之间的流畅度下降似乎比被试预期的要陡峭。 结果表明,被试做出了一个推断:如果我在想出我的自信的实例时比预期的要困难得多,那么我就不可能很自信。 请注意,这种推断是建立在一个意外的情况下,即流畅性比预期的要差。 受试者应用的可用性启发式被描述为 "无法解释的不可用性 "启发式更好。
Schwarz and his colleagues reasoned that they could disrupt the heuristic by providing the subjects with an explanation for the fluency of retrieval that they experienced. They told the participants they would hear background music while recalling instances and that the music would affect performance in the memory task. Some subjects were told that the music would help, others were told to expect diminished fluency. As predicted, participants whose experience of fluency was “explained” did not use it as a heuristic; the subjects who were told that music would make retrieval more difficult rated themselves as equally assertive when they retrieved twelve instances as when they retrieved six. Other cover stories have been used with the same result: judgments are no longer influenced by ease of retrieval when the experience of fluency is given a spurious explanation by the presence of curved or straight text boxes, by the background color of the screen, or by other irrelevant factors that the experimenters dreamed up .
Schwarz和他的同事推断,他们可以通过向受试者解释他们所经历的 检索的流畅性来破坏启发式。他们告诉受试者 ,他们在回忆事例时将听到背景音乐,而且音乐将影响记忆任务的表现。正如预测的那样,那些对流畅性的体验得到 "解释 "的参与者并没有把它作为一种启发式的方法;那些被告知音乐会使检索更加困难的受试者在检索到12个实例时和检索到6个实例时对 自己的评价是一样的自信。其他的封面故事也有同样的结果:当流畅性的体验被弯曲或笔直的文本框的存在、屏幕的背景颜色或其他不相关的因素( 实验者梦想出来的 )给予虚假的解释时,判断不再受到检索的难易程度的影响。
As I have described it, the process that leads to judgment by availability appears to involve a complex chain of reasoning. The subjects have an experience of diminishing fluency as they produce instances. They evidently have expectations about the rate at which fluency decreases, and those expectations are wrong: the difficulty of coming up with new instances increases more rapidly than they expect. It is the unexpectedly low fluency that causes people who were asked for twelve instances to describe themselves as unassertive. When the surprise is eliminated, low fluency no longer influences the judgment. The process appears to consist of a sophisticated set of inferences. Is the automatic System 1 capable of it?
正如我所描述的,导致可得性判断的过程似乎涉及一个复杂的推理链。受试者在产生实例时有一种流畅性下降的体验。 他们显然对流畅性下降的速度有预期,而这些预期是错误的:想出新实例的难度比他们预期的要快 。正是意外的低流畅性导致被要求提供12个实例的人将自己描述为不自信。当惊喜消除后,低流畅性不再影响判断。 这个过程似乎由一套复杂的推理组成。 自动系统1是否能够做到?
The answer is that in fact no complex reasoning is needed. Among the basic features of System 1 is its ability to set expectations and to be surprised when these expectations are violated. The system also retrieves possible causes of a surprise, usually by finding a possible cause among recent surprises. Furthermore, System 2 can reset the expectations of System 1 on the fly, so that an event that would normally be surprising is now almost normal. Suppose you are told that the three-year-old boy who lives next door frequently wears a top hat in his stroller. You will be far less surprised when you actually see him with his top hat than you would have been without the warning. In Schwarz’s experiment, the background music has been mentioned as a possible cause of retrieval problems. The difficulty of retrieving twelve instances is no longer a surprise and therefore is less likely to be evoked by the task of judging assertiveness.
答案是,事实上不需要复杂的推理 。在系统1的基本特征中,它能够设定预期,并在这些预期被违反时感到惊讶。该系统还能检索出惊讶的可能原因,通常是在最近的惊讶中找到一个可能的原因。 此外,系统2能够在飞行中重置系统1的预期,因此,一个通常会感到惊讶的事件现在几乎是正常的。 假设有人告诉你,住在隔壁的三岁男孩经常在婴儿车里戴着高帽。当你真正看到他戴着高帽时,你的惊讶程度会远远低于没有警告的情况下。在施瓦茨的实验中,背景音乐被提到是导致检索问题的一个可能原因。 检索12个实例的困难不再是 ,因此不太可能被判断自信的任务所唤起。
Schwarz and his colleagues discovered that people who are personally involved in the judgment are more likely to consider the number of instances they retrieve from memory and less likely to go by fluency. They recruited two groups of students for a study of risks to cardiac health. Half the students had a family history of cardiac disease and were expected to take the task more seriously than the others, who had no such history. All were asked to recall either three or eight behaviors in their routine that could affect their cardiac health (some were asked for risky behaviors, others for protective behaviors). Students with no family history of heart disease were casual about the task and followed the availability heuristic. Students who found it difficult to find eight instances of risky behavior felt themselves relatively safe, and those who struggled to retrieve examples of safe behaviors felt themselves at risk. The students with a family history of heart disease showed the opposite pattern—they felt safer when they retrieved many instances of safe behavior and felt greater danger when they retrieved many instances of risky behavior. They were also more likely to feel that their future behavior would be affected by the experience of evaluating their risk.
Schwarz和他的同事发现,亲自参与判断的人更有可能考虑他们从记忆中检索到的实例的数量,而不太可能按照流利程度来判断。 他们招募了两组学生进行一项关于心脏健康风险的研究。一半 ,这些学生有心脏疾病的家族史,预计他们会比其他没有这种历史的人更认真地对待这项任务。所有的人都被要求回忆他们日常工作中可能 影响他们心脏健康的 三种或八种行为(有些被要求回忆危险行为,有些被要求回忆保护行为)。没有心脏病家族史的学生对任务很随意, 遵循可得性启发式。那些发现很难找到8个危险行为实例的学生觉得自己相对安全,而那些努力检索安全行为实例的学生觉得自己有风险。 有心脏病家族史的学生表现出相反的模式--当他们检索到许多安全行为的实例时,他们觉得更安全,而当他们检索到许多危险行为的实例时,觉得更危险 。他们也更有可能认为他们未来的行为会受到评估其风险的经验的影响。
The conclusion is that the ease with which instances come to mind is a System 1 heuristic, which is replaced by a focus on content when System 2 is more engaged. Multiple lines of evidence converge on the conclusion that people who let themselves be guided by System 1 are more strongly susceptible to availability biases than others who are in a state of higher vigilance. The following are some conditions in which people “go with the flow” and are affected more strongly by ease of retrieval than by the content they retrieved:
结论是,实例容易出现在脑海中是系统1的启发式思维,当系统2更多的参与时,它被对内容的关注所取代。多条证据汇集在这样的结论上: ,让自己被系统1引导的人比处于更高的警惕状态的人更容易受到可用性偏见的影响。以下是一些情况,在这些情况下,人们 "随波逐流",受检索的难易程度比检索的内容影响更强烈。
when they are engaged in another effortful task at the same time
当他们同时从事另一项 努力的任务 时
when they are in a good mood because they just thought of a happy episode in their life
当他们心情好的时候, ,因为他们刚刚想到了 他们生活中 的一个 快乐的事件 。
if they score low on a depression scale
如果他们 在抑郁症量表上 得分 较低
if they are knowledgeable novices on the topic of the task, in contrast to true experts
如果他们在任务的主题上是 知识渊博的新手 ,与 真正的专家 相反
when they score high on a scale of faith in intuition
当他们在 对直觉的信任 程度上得到高分时
if they are (or are made to feel) powerful
如果他们是(或被认为是)有权力的人
I find the last finding particularly intriguing. The authors introduce their article with a famous quote: “I don’t spend a lot of time taking polls around the world to tell me what I think is the right way to act. I’ve just got to know how I feel” (George W. Bush, November 2002). They go on to show that reliance on intuition is only in part a personality trait. Merely reminding people of a time when they had power increases their apparent trust in their own intuition.
我发现最后一个发现特别耐人寻味。作者用一句名言来介绍他们的文章: "我不会花很多时间在世界各地进行民意调查,告诉我什么是正确的行为方式。 我只需要知道我的感觉"(乔治-W-布什,2002年11月)。他们继续表明,对直觉的依赖只是部分的人格特征。仅仅是提醒人们注意他们拥有权力的时候,就能增加他们对自己直觉的明显信任。
SPEAKING OF AVAILABILITY
说到 的可用性
“Because of the coincidence of two planes crashing last month, she now prefers to take the train. That’s silly. The risk hasn’t really changed; it is an availability bias.”
"因为 ,上个月发生了两架飞机坠毁的巧合,她现在更喜欢坐火车。这很傻。风险并没有真正改变;它是一种可用性偏差。"
“He underestimates the risks of indoor pollution because there are few media stories on them. That’s an availability effect. He should look at the statistics.”
"他低估了室内污染的风险,因为媒体对它们的报道很少。 这是一种可得性效应。 他应该看一下统计数据。"
“She has been watching too many spy movies recently, so she’s seeing conspiracies everywhere.”
"她最近看了太多的间谍电影,所以她到处看到阴谋 。"
“The CEO has had several successes in a row, so failure doesn’t come easily to her mind. The availability bias is making her overconfident.”
"首席执行官已经连续取得了几次成功,所以失败在她的脑海中并不容易。 可得性偏见使她过度自信。"
13
13
Availability, Emotion, and Risk
可用性、情感和风险
Students of risk were quick to see that the idea of availability was relevant to their concerns. Even before our work was published, the economist Howard Kunreuther, who was then in the early stages of a career that he has devoted to the study of risk and insurance, noticed that availability effects help explain the pattern of insurance purchase and protective action after disasters. Victims and near victims are very concerned after a disaster. After each significant earthquake, Californians are for a while diligent in purchasing insurance and adopting measures of protection and mitigation. They tie down their boiler to reduce quake damage, seal their basement doors against floods, and maintain emergency supplies in good order. However, the memories of the disaster dim over time, and so do worry and diligence. The dynamics of memory help explain the recurrent cycles of disaster, concern, and growing complacency that are familiar to students of large-scale emergencies.
研究风险的学生很快就发现,可得性的概念与他们所关注的问题有关。甚至在我们的工作发表之前,经济学家Howard Kunreuther(他当时正处于致力于研究风险和保险的职业生涯的早期阶段 )就注意到,可得性效应有助于解释购买保险和灾后保护行动的模式。灾难发生后,受害者和接近受害者的人非常担心。每次大地震发生后,加州人都会在一段时间内勤奋地购买保险,并采取保护和减灾措施。 ,他们把锅炉绑起来,以减少地震的破坏,把地下室的门封起来,防止水灾,并保持应急物资的良好状态。然而,对灾难的记忆会随着时间的推移而变淡,担心和勤奋也是如此。记忆的动力有助于解释大规模紧急情况的学生所熟悉的灾难、担忧和日益增长的自满情绪的反复循环。
Kunreuther also observed that protective actions, whether by individuals or governments, are usually designed to be adequate to the worst disaster actually experienced. As long ago as pharaonic Egypt, societies have tracked the high-water mark of rivers that periodically flood—and have always prepared accordingly, apparently assuming that floods will not rise higher than the existing high-water mark. Images of a worse disaster do not come easily to mind.
Kunreuther 还注意到,无论是个人还是政府的保护行动,通常都是为了足以应对实际发生的最严重的灾害。 早在法老时代的埃及,社会就已经追踪定期发生洪水的河流的高水位线--而且总是做相应的准备,显然是假设洪水不会高于现有的高水位线。图片 ,更糟糕的灾难不容易出现在脑海中。
AVAILABILITY AND AFFECT
可用性和影响
The most influential studies of availability biases were carried out by our friends in Eugene, where Paul Slovic and his longtime collaborator Sarah Lichtenstein were joined by our former student Baruch Fischhoff. They carried out groundbreaking research on public perceptions of risks, including a survey that has become the standard example of an availability bias. They asked participants in their survey to consider pairs of causes of death: diabetes and asthma, or stroke and accidents. For each pair, the subjects indicated the more frequent cause and estimated the ratio of the two frequencies. The judgments were compared to health statistics of the time. Here’s a sample of their findings:
对可用性偏差最有影响力的研究是由我们在尤金的朋友进行的,在那里,保罗-斯洛维奇和他的长期合作者萨拉-利希滕斯坦与我们以前的学生巴鲁克-菲施霍夫一起。他们对公众的风险认知进行了开创性的研究,包括一项调查,该调查已成为 ,成为可用性偏差的标准范例。他们在调查中要求参与者考虑成对的死亡原因:糖尿病和哮喘,或中风和意外。 对于每一对,受试者指出更频繁的原因,并估计这两个频率的比率。 判断结果与当时的健康统计数据进行了比较。 以下是他们的调查结果的一个样本。
Strokes cause almost twice as many deaths as all accidents combined, but 80% of respondents judged accidental death to be more likely.
中风造成的死亡人数几乎是 ,是所有事故的两倍,但80%的受访者判断意外死亡的可能性更大。
Tornadoes were seen as more frequent killers than asthma, although the latter cause 20 times more deaths.
龙卷风被认为是比哮喘更频繁的杀手,尽管后者造成的死亡人数是哮喘的20倍。
Death by lightning was judged less likely than death from botulism even though it is 52 times more frequent.
被雷电击死的可能性被判定为低于肉毒杆菌死亡,尽管它的发生率是52倍。
Death by disease is 18 times as likely as accidental death, but the two were judged about equally likely.
死于疾病的可能性是意外死亡的18倍,但 ,两者被判定的可能性大致相同。
Death by accidents was judged to be more than 300 times more likely than death by diabetes, but the true ratio is 1:4.
据判断,死于事故的可能性是死于糖尿病的300多倍,但真正的比例是1:4。
The lesson is clear: estimates of causes of death are warped by media coverage. The coverage is itself biased toward novelty and poignancy. The media do not just shape what the public is interested in, but also are shaped by it. Editors cannot ignore the public’s demands that certain topics and viewpoints receive extensive coverage. Unusual events (such as botulism) attract disproportionate attention and are consequently perceived as less unusual than they really are. The world in our heads is not a precise replica of reality; our expectations about the frequency of events are distorted by the prevalence and emotional intensity of the messages to which we are exposed.
教训很明显:对死亡原因的估计被媒体报道扭曲了。报道本身就偏向于新奇和悲壮。媒体不仅塑造了公众感兴趣的东西,而且也被公众所塑造。编辑们不能 ,忽视公众对某些话题和观点得到广泛报道的要求。不寻常的事件(如肉毒杆菌)吸引了不相称的注意力,因此被认为比实际情况更不寻常。 我们头脑中的世界不是现实的精确复制品;我们对事件发生频率的预期被我们所接触到的信息的普遍性和情感强度所扭曲, 。
The estimates of causes of death are an almost direct representation of the activation of ideas in associative memory, and are a good example of substitution. But Slovic and his colleagues were led to a deeper insight: they saw that the ease with which ideas of various risks come to mind and the emotional reactions to these risks are inextricably linked. Frightening thoughts and images occur to us with particular ease, and thoughts of danger that are fluent and vivid exacerbate fear.
对死亡原因的估计几乎直接体现了联想记忆中观念的激活,是替代的一个很好的例子。但斯洛维奇和他的同事被引导到一个更深的见解:他们看到,各种风险的观念在脑海中出现的难易程度和对这些风险的情绪反应是密不可分的。 令人恐惧的 思想和图像特别容易发生在我们身上,而对危险的想法如果流畅而生动,就会加剧恐惧。
As mentioned earlier, Slovic eventually developed the notion of an affect heuristic, in which people make judgments and decisions by consulting their emotions: Do I like it? Do I hate it? How strongly do I feel about it? In many domains of life, Slovic said, people form opinions and make choices that directly express their feelings and their basic tendency to approach or avoid, often without knowing that they are doing so. The affect heuristic is an instance of substitution, in which the answer to an easy question (How do I feel about it?) serves as an answer to a much harder question (What do I think about it?). Slovic and his colleagues related their views to the work of the neuroscientist Antonio Damasio, who had proposed that people’s emotional evaluations of outcomes, and the bodily states and the approach and avoidance tendencies associated with them, all play a central role in guiding decision making. Damasio and his colleagues have observed that people who do not display the appropriate emotions before they decide, sometimes because of brain damage , also have an impaired ability to make good decisions. An inability to be guided by a “healthy fear” of bad consequences is a disastrous flaw.
如前所述,斯洛维奇最终提出了情感启发式的概念,即人们通过咨询他们的情感来做出判断和决定:我喜欢它吗?我讨厌它吗?我对它的感觉有多强烈?斯洛维奇说,在生活的许多领域,人们 ,形成意见并做出选择,直接表达他们的感受和他们接近或回避的基本倾向,往往不知道他们正在这样做。情感启发式是一个替代的例子,在这个例子中,对一个简单问题(我对它的感觉如何?)的回答可以作为对一个更难的问题(我对它的看法如何?) 斯洛维奇和他的同事将他们的观点与神经科学家安东尼奥-达马西奥(Antonio Damasio)的工作联系起来,他提出人们对结果的情感评价,以及与之相关的身体状态和接近与回避倾向,都在指导决策中发挥着核心作用。达马西奥和他的同事们观察到,那些在做决定前没有表现出适当情绪的人,有时 是因为大脑受损 ,也有做出良好决定的能力受损。无法被对不良后果的 "健康恐惧 "引导是一个灾难性的缺陷。
In a compelling demonstration of the workings of the affect heuristic, Slovic’s research team surveyed opinions about various technologies, including water fluoridation, chemical plants, food preservatives, and cars, and asked their respondents to list both the benefits and the risks of each technology . They observed an implausibly high negative correlation between two estimates that their respondents made: the level of benefit and the level of risk that they attributed to the technologies. When people were favorably disposed toward a technology, they rated it as offering large benefits and imposing little risk; when they disliked a technology, they could think only of its disadvantages, and few advantages came to mind. Because the technologies lined up neatly from good to bad, no painful tradeoffs needed to be faced. Estimates of risk and benefit corresponded even more closely when people rated risks and benefits under time pressure. Remarkably, members of the British Toxicology Society responded similarly: they found little benefit in substances or technologies that they thought risky, and vice versa. Consistent affect is a central element of what I have called associative coherence.
在一个令人信服的影响启发式运作的演示中,斯洛维奇的研究小组调查了关于各种技术的意见,包括氟化水、化工厂、食品防腐剂和汽车,并要求他们的受访者列出 每项技术的 好处和 风险 。他们观察到受访者做出的两个估计之间存在令人难以置信的高度负相关:他们认为这些技术的好处和风险水平。当人们对某项技术有好感时,他们认为它能提供巨大的利益,而且风险很小;当他们不喜欢某项技术时, ,他们只能想到它的缺点,而很少想到优点。因为这些技术从好到坏整齐地排列着,不需要面对痛苦的权衡。当人们在时间压力下对风险和益处进行评价时,风险和益处的估计甚至更加接近。 值得注意的是, 英国毒理学会 的成员也做出了类似的反应:他们发现在他们认为有风险的物质或技术中几乎没有益处 ,反之亦然。一致的情感是我所称的联想一致性的一个核心要素。
The best part of the experiment came next. After completing the initial survey, the respondents read brief passages with arguments in favor of various technologies. Some were given arguments that focused on the numerous benefits of a technology; others, arguments that stressed the low risks. These messages were effective in changing the emotional appeal of the technologies. The striking finding was that people who had received a message extolling the benefits of a technology also changed their beliefs about its risks. Although they had received no relevant evidence, the technology they now liked more than before was also perceived as less risky. Similarly, respondents who were told only that the risks of a technology were mild developed a more favorable view of its benefits. The implication is clear: as the psychologist Jonathan Haidt said in another context, “The emotional tail wags the rational dog .” The affect heuristic simplifies our lives by creating a world that is much tidier than reality. Good technologies have few costs in the imaginary world we inhabit, bad technologies have no benefits, and all decisions are easy. In the real world, of course, we often face painful tradeoffs between benefits and costs.
实验最精彩的部分在接下来。 在完成最初的调查后,受访者阅读了支持各种技术的简短论据。一些人得到的论据侧重于某项技术的众多好处; ,其他的论据则强调 的低风险。这些信息有效地改变了技术的情感吸引力。 引人注目的发现是,那些收到赞美技术好处的信息的人也改变了他们对技术风险的看法。 尽管他们没有收到相关的证据,但他们现在比以前更喜欢的技术也被认为是 ,风险更低。同样,那些只被告知某项技术的风险较小的受访者,对其利益产生了更有利的看法。 其含义很清楚:正如心理学家乔纳森-海特在另一种情况下所说的,"情感的尾巴 摇动着理性的狗 。"影响启发式通过创造一个比现实更整洁的世界来简化我们的生活。在我们居住的假想世界中,好的技术几乎没有 ,坏的技术没有好处,所有的决定都很容易。当然,在现实世界中,我们经常面临利益和成本之间痛苦的权衡。
THE PUBLIC AND THE EXPERTS
公众和专家
Paul Slovic probably knows more about the peculiarities of human judgment of risk than any other individual. His work offers a picture of Mr. and Ms. Citizen that is far from flattering: guided by emotion rather than by reason, easily swayed by trivial details, and inadequately sensitive to differences between low and negligibly low probabilities. Slovic has also studied experts, who are clearly superior in dealing with numbers and amounts. Experts show many of the same biases as the rest of us in attenuated form, but often their judgments and preferences about risks diverge from those of other people.
保罗-斯洛维奇可能比其他任何人都更了解人类对风险判断的特殊性。他的工作为公民先生和女士提供了一幅远非 谄媚的画面:由情感而非理性引导,容易被琐碎的细节所左右,对低概率和可忽略不计的低概率之间的差异不够敏感。 斯洛维奇还研究了专家,他们在处理数字和金额方面显然更有优势。专家以减弱的形式显示了许多与我们其他人相同的偏见,但他们对风险的判断和偏好往往与其他人不同 。
Differences between experts and the public are explained in part by biases in lay judgments, but Slovic draws attention to situations in which the differences reflect a genuine conflict of values. He points out that experts often measure risks by the number of lives (or life-years) lost, while the public draws finer distinctions, for example between “good deaths” and “bad deaths,” or between random accidental fatalities and deaths that occur in the course of voluntary activities such as skiing. These legitimate distinctions are often ignored in statistics that merely count cases. Slovic argues from such observations that the public has a richer conception of risks than the experts do. Consequently, he strongly resists the view that the experts should rule, and that their opinions should be accepted without question when they conflict with the opinions and wishes of other citizens. When experts and the public disagree on their priorities, he says, “Each side must respect the insights and intelligence of the other.”
专家和公众之间的差异部分地被解释为非专业判断中的偏见,但斯洛维奇提请注意这些差异反映了真正的价值观冲突的情况。他指出,专家们通常用失去的生命(或生命年)的数量来衡量风险,而公众则有更精细的区分,例如 "好的死亡 " 和 "坏的死亡",或随机的意外死亡和在自愿活动(如滑雪)过程中发生的死亡。这些合理的区别往往在仅仅计算案例的统计中被忽视。 斯洛维奇根据这些观察认为,公众比专家有更丰富的风险概念。 因此,他强烈反对专家应该 ,当他们的意见与其他公民的意见和愿望相冲突时,应该毫无疑义地接受。 他说,当专家和公众在他们的优先事项上有分歧时,"双方都必须尊重对方的见解和智慧。"
In his desire to wrest sole control of risk policy from experts, Slovic has challenged the foundation of their expertise: the idea that risk is objective.
在 ,他希望从专家手中夺取对风险政策的唯一控制权,Slovic挑战了他们的专业知识的基础: ,即风险是客观的想法。
“Risk” does not exist “out there,” independent of our minds and culture, waiting to be measured. Human beings have invented the concept of “risk” to help them understand and cope with the dangers and uncertainties of life. Although these dangers are real, there is no such thing as “real risk” or “objective risk.”
"风险 "并不存在 于 "外面",独立于我们的思想和文化之外,等待我们去测量。 人类发明了 "风险 "的概念,以帮助他们理解和应对生活中的危险和不确定性。 尽管这些危险是真实的,但并不存在 "真正的风险 "或 "客观风险"。
To illustrate his claim, Slovic lists nine ways of defining the mortality risk associated with the release of a toxic material into the air, ranging from “death per million people” to “death per million dollars of product produced.” His point is that the evaluation of the risk depends on the choice of a measure—with the obvious possibility that the choice may have been guided by a preference for one outcome or another. He goes on to conclude that “defining risk is thus an exercise in power.” You might not have guessed that one can get to such thorny policy issues from experimental studies of the psychology of judgment! However, policy is ultimately about people, what they want and what is best for them. Every policy question involves assumptions about human nature, in particular about the choices that people may make and the consequences of their choices for themselves and for society.
为了说明他的主张,Slovic列出了定义与释放到空气中的有毒物质相关的死亡风险的九种方法 ,从 "每百万人死亡 "到 "每百万美元产品生产的死亡"。他的观点是,对风险的评估取决于对衡量标准的选择--显然,这种选择有可能是由对某种结果的偏好所引导。他继续总结说, "因此,对风险的定义是一种权力的行使"。你可能没有猜到,人们可以从判断心理学的实验研究中得到如此棘手的政策问题!这是很重要的。然而,政策最终是关于人的,他们想要什么,什么对他们最好。 每一个政策问题都涉及对人性的假设,特别是关于人们可能做出的选择以及他们的选择对自己和社会的后果 。
Another scholar and friend whom I greatly admire, Cass Sunstein, disagrees sharply with Slovic’s stance on the different views of experts and citizens, and defends the role of experts as a bulwark against “populist” excesses. Sunstein is one of the foremost legal scholars in the United States, and shares with other leaders of his profession the attribute of intellectual fearlessness. He knows he can master any body of knowledge quickly and thoroughly, and he has mastered many, including both the psychology of judgment and choice and issues of regulation and risk policy. His view is that the existing system of regulation in the United States displays a very poor setting of priorities, which reflects reaction to public pressures more than careful objective analysis. He starts from the position that risk regulation and government intervention to reduce risks should be guided by rational weighting of costs and benefits, and that the natural units for this analysis are the number of lives saved (or perhaps the number of life-years saved, which gives more weight to saving the young) and the dollar cost to the economy. Poor regulation is wasteful of lives and money, both of which can be measured objectively. Sunstein has not been persuaded by Slovic’s argument that risk and its measurement is subjective. Many aspects of risk assessment are debatable, but he has faith in the objectivity that may be achieved by science, expertise, and careful deliberation.
另一位我非常敬佩的学者和朋友卡斯-桑斯坦(Cass Sunstein)与斯洛维奇关于专家和公民的不同观点的立场截然不同,并为专家作为防止 "民粹主义 "过度行为的堡垒的作用辩护。 桑斯坦是美国最重要的法律学者之一,他与他的行业的其他领导人一样,具有智力无畏的 特性。他知道自己可以迅速而彻底地掌握任何知识体系,他已经掌握了许多知识,包括判断和选择的心理学以及监管和风险政策问题。 他的观点是,美国现有的监管体系显示出非常糟糕的优先事项设置,这反映了对公众压力的反应,而不是 仔细的客观分析。他的出发点是,为减少风险而进行的风险监管和政府干预应该以成本和收益的合理加权为指导,这种分析的自然单位是被拯救的生命数量(或者也许是被拯救的生命年数,这对拯救年轻人给予更多的重视)和经济的美元成本。监管不力 ,是对生命和金钱的浪费,而这两点都是可以客观衡量的。Sunstein没有被Slovic的 ,即风险及其衡量是主观的说法所说服。风险评估的许多方面是值得商榷的,但他对科学、专业知识和仔细审议可能实现的客观性有信心。
Sunstein came to believe that biased reactions to risks are an important source of erratic and misplaced priorities in public policy. Lawmakers and regulators may be overly responsive to the irrational concerns of citizens, both because of political sensitivity and because they are prone to the same cognitive biases as other citizens.
Sunstein后来认为,对风险的有偏见的反应是公共政策中反复无常和错位的重要 。立法者和监管者可能对公民的非理性关切做出过度反应,这既是因为政治敏感性,也是因为他们容易产生与其他公民一样的认知偏见。
Sunstein and a collaborator, the jurist Timur Kuran, invented a name for the mechanism through which biases flow into policy: the availability cascade . They comment that in the social context, “all heuristics are equal, but availability is more equal than the others.” They have in mind an expanded notion of the heuristic, in which availability provides a heuristic for judgments other than frequency. In particular, the importance of an idea is often judged by the fluency (and emotional charge) with which that idea comes to mind.
孙斯坦和一位合作者,法学家Timur Kuran,为偏见流入政策的机制发明了一个名字: 可用性级联 。他们评论说,在社会背景下,"所有启发式方法都是平等的,但可用性比其他方法更平等"。他们想到的是启发式的扩展概念,即可用性为频率以外的判断提供了一个启发式。 特别是,一个想法的重要性往往是通过该想法出现在脑海中的流畅性(和情感上的负担)来判断的。
An availability cascade is a self-sustaining chain of events, which may start from media reports of a relatively minor event and lead up to public panic and large-scale government action. On some occasions, a media story about a risk catches the attention of a segment of the public, which becomes aroused and worried. This emotional reaction becomes a story in itself, prompting additional coverage in the media, which in turn produces greater concern and involvement. The cycle is sometimes sped along deliberately by “availability entrepreneurs,” individuals or organizations who work to ensure a continuous flow of worrying news. The danger is increasingly exaggerated as the media compete for attention-grabbing headlines. Scientists and others who try to dampen the increasing fear and revulsion attract little attention, most of it hostile: anyone who claims that the danger is overstated is suspected of association with a “heinous cover-up.” The issue becomes politically important because it is on everyone’s mind, and the response of the political system is guided by the intensity of public sentiment. The availability cascade has now reset priorities. Other risks, and other ways that resources could be applied for the public good, all have faded into the background.
可用性 级联是一个自我维持的事件链,它可能从媒体对一个相对较小的事件的报道开始,并导致公众恐慌和大规模的政府行动。 在某些情况下,媒体对某一风险的报道吸引了一部分公众的注意,他们变得兴奋和担忧。这种情绪反应本身就成为一个故事,促使媒体进行更多的报道, ,这反过来又产生了更大的关注和参与。这种循环有时会被 "可得性企业家 "故意加速,这些人或组织努力确保令人担忧的新闻不断涌现。由于媒体争相抢夺引人注意的头条新闻,这种危险被日益夸大。试图抑制日益增长的恐惧和反感的科学家和其他人吸引了少量 ,其中大部分是敌意:任何声称危险被夸大的人都被怀疑与 "令人发指的掩盖 "有关。这个问题在政治上变得很重要,因为它在每个人的心目中,而政治系统的反应是由公众情绪的强度所引导的。 现在,可用性的级联已经重置了优先级。 其他风险,以及其他可以应用于公共利益的资源 ,都已经淡出了背景。
Kuran and Sunstein focused on two examples that are still controversial: the Love Canal affair and the so-called Alar scare. In Love Canal, buried toxic waste was exposed during a rainy season in 1979, causing contamination of the water well beyond standard limits, as well as a foul smell. The residents of the community were angry and frightened, and one of them, Lois Gibbs, was particularly active in an attempt to sustain interest in the problem. The availability cascade unfolded according to the standard script. At its peak there were daily stories about Love Canal, scientists attempting to claim that the dangers were overstated were ignored or shouted down, ABC News aired a program titled The Killing Ground , and empty baby-size coffins were paraded in front of the legislature. A large number of residents were relocated at government expense, and the control of toxic waste became the major environmental issue of the 1980s. The legislation that mandated the cleanup of toxic sites, called CERCLA, established a Superfund and is considered a significant achievement of environmental legislation. It was also expensive, and some have claimed that the same amount of money could have saved many more lives if it had been directed to other priorities. Opinions about what actually happened at Love Canal are still sharply divided, and claims of actual damage to health appear not to have been substantiated. Kuran and Sunstein wrote up the Love Canal story almost as a pseudo-event, while on the other side of the debate, environmentalists still speak of the “Love Canal disaster.”
Kuran和Sunstein集中讨论了两个仍有争议的例子:爱运河事件和所谓的阿拉尔恐慌。 在爱运河事件中,埋藏的有毒废物在1979年的一个雨季被暴露出来,导致水的污染远远超过标准限度,并产生恶臭。社区的居民们 ,感到愤怒和恐惧,其中一个人, Lois Gibbs,特别积极地试图维持对这个问题的兴趣。可用性级联按照标准脚本展开。在它的高峰期,每天都有关于爱情运河的故事,试图声称危险被夸大的科学家被忽视或喊停,ABC新闻播出了一个名为 《杀戮之地》 的节目, ,空的婴儿大小的棺材在立法机构前被游行。大量的居民由政府出资重新安置,有毒废物的控制成为80年代的主要环境问题。授权清理有毒场地的立法,称为CERCLA,建立了超级基金,被认为是环境立法的重大成就。它也很昂贵, ,一些人声称,如果把同样的钱用于其他优先事项,可以拯救更多的生命。关于爱运河实际发生的事情,人们的意见仍有很大分歧,关于对健康造成实际损害的说法似乎没有得到证实。 库兰和桑斯坦把爱运河的故事几乎写成了一个伪事件,而在 ,环保主义者仍在谈论 "爱运河灾难"。
Opinions are also divided on the second example Kuran and Sunstein used to illustrate their concept of an availability cascade, the Alar incident, known to detractors of environmental concerns as the “Alar scare” of 1989. Alar is a chemical that was sprayed on apples to regulate their growth and improve their appearance. The scare began with press stories that the chemical, when consumed in gigantic doses, caused cancerous tumors in rats and mice. The stories understandably frightened the public, and those fears encouraged more media coverage, the basic mechanism of an availability cascade. The topic dominated the news and produced dramatic media events such as the testimony of the actress Meryl Streep before Congress. The apple industry sustained large losses as apples and apple products became objects of fear. Kuran and Sunstein quote a citizen who called in to ask “whether it was safer to pour apple juice down the drain or to take it to a toxic waste dump.” The manufacturer withdrew the product and the FDA banned it. Subsequent research confirmed that the substance might pose a very small risk as a possible carcinogen, but the Alar incident was certainly an enormous overreaction to a minor problem. The net effect of the incident on public health was probably detrimental because fewer good apples were consumed.
对于Kuran和Sunstein用来说明他们的可用性级联概念的第二个例子,即Alar事件,被环境问题的诋毁者称为1989年的 "Alar恐慌",也存在意见分歧。Alar是一种化学品,被喷洒在苹果上以调节其生长并改善其外观。 恐慌始于新闻报道,即这种化学品在以巨大的剂量食用时,会在大鼠和小鼠身上引起癌症肿瘤。可以理解的是,这些故事让公众感到恐惧,而这些恐惧鼓励了更多的媒体报道,这就是可用性级联的基本机制。这个话题主导了新闻,并产生了戏剧性的媒体事件,如女演员梅丽尔-斯特里普在国会作证。 由于苹果和苹果产品成为恐惧的对象,苹果行业蒙受了巨大的损失。 库兰和桑斯坦引用了一位市民的电话,询问 "是把苹果汁倒进下水道还是把它送到有毒废物堆里更安全"。制造商撤回了该产品,美国食品和药物管理局禁止了该产品。随后的研究证实,该物质作为一种可能的 致癌物,可能构成非常小的风险,但Alar事件无疑是对一个小问题的巨大过度反应。该事件对公众健康的净影响可能是有害的,因为被消费的好苹果更少。
The Alar tale illustrates a basic limitation in the ability of our mind to deal with small risks: we either ignore them altogether or give them far too much weight— nothing in between . Every parent who has stayed up waiting for a teenage daughter who is late from a party will recognize the feeling. You may know that there is really (almost) nothing to worry about, but you cannot help images of disaster from coming to mind. As Slovic has argued, the amount of concern is not adequately sensitive to the probability of harm; you are imagining the numerator—the tragic story you saw on the news—and not thinking about the denominator. Sunstein has coined the phrase “probability neglect” to describe the pattern. The combination of probability neglect with the social mechanisms of availability cascades inevitably leads to gross exaggeration of minor threats, sometimes with important consequences.
阿拉尔的故事说明了我们处理小风险的能力的一个基本限制:我们要么完全忽视它们,要么给予它们太多重视-- 两者之间没有任何区别 。每个 ,熬夜等待 ,等待从派对上迟到的十几岁的女儿的父母都会认识到这种感觉。你可能知道真的(几乎)没有什么可担心的,但你不能不想到灾难的形象。正如斯洛维奇所认为的,关注的数量对伤害的概率并不充分敏感;你想象的是分子--你在 新闻上看到的悲剧故事,没有想到分母。孙斯坦创造了 "概率忽视 "这一短语来描述这种模式。忽视概率与可得性级联的社会机制相结合,不可避免地导致对小威胁的严重夸大,有时会造成重要的后果。
In today’s world, terrorists are the most significant practitioners of the art of inducing availability cascades. With a few horrible exceptions such as 9/11, the number of casualties from terror attacks is very small relative to other causes of death. Even in countries that have been targets of intensive terror campaigns, such as Israel, the weekly number of casualties almost never came close to the number of traffic deaths. The difference is in the availability of the two risks, the ease and the frequency with which they come to mind. Gruesome images, endlessly repeated in the media, cause everyone to be on edge. As I know from experience, it is difficult to reason oneself into a state of complete calm. Terrorism speaks directly to System 1.
在当今世界,恐怖分子是 诱发可用性级联的最重要的实践者。除了一些可怕的例外,如9/11事件,恐怖袭击造成的伤亡人数相对于其他死亡原因来说是非常小的。即使是在那些成为密集恐怖活动目标的国家,如以色列,每周的伤亡人数几乎从未接近交通事故死亡人数。区别在于这两种风险的可得性, ,它们出现在人们脑海中的难易程度和频率。 可怕的图像,在媒体上无休止地重复,使每个人都感到紧张。我从经验中知道,要把自己推到完全平静的状态是很难的。恐怖主义直接与系统1对话。
Where do I come down in the debate between my friends? Availability cascades are real and they undoubtedly distort priorities in the allocation of public resources. Cass Sunstein would seek mechanisms that insulate decision makers from public pressures, letting the allocation of resources be determined by impartial experts who have a broad view of all risks and of the resources available to reduce them. Paul Slovic trusts the experts much less and the public somewhat more than Sunstein does, and he points out that insulating the experts from the emotions of the public produces policies that the public will reject—an impossible situation in a democracy. Both are eminently sensible, and I agree with both.
在我的朋友之间的辩论中,我的立场是什么?可用性级联是真实存在的,它们无疑扭曲了 公共资源分配的优先次序。 Cass Sunstein将寻求使决策者免受公众压力的机制,让资源分配由公正的专家决定,他们对所有风险和可用于减少风险的资源有广泛的看法。保罗-斯洛维奇比孙斯坦更不信任专家,而更信任公众,他指出,将专家与公众的情绪隔绝 ,会产生公众会拒绝的政策--这在民主制度中是不可能的情况。两者都是非常明智的,我也同意两者。
I share Sunstein’s discomfort with the influence of irrational fears and availability cascades on public policy in the domain of risk. However, I also share Slovic’s belief that widespread fears, even if they are unreasonable, should not be ignored by policy makers. Rational or not, fear is painful and debilitating, and policy makers must endeavor to protect the public from fear, not only from real dangers.
我和Sunstein一样,对非理性的恐惧和可得性连带对风险领域的公共政策的影响感到不安。 然而,我也和Slovic一样认为,广泛的恐惧,即使它们是 不合理的,也不应该被政策制定者忽视。无论理性与否,恐惧都是痛苦和衰弱的,政策制定者必须努力保护公众免受恐惧,而不仅仅是免受真正的危险。
Slovic rightly stresses the resistance of the public to the idea of decisions being made by unelected and unaccountable experts. Furthermore, availability cascades may have a long-term benefit by calling attention to classes of risks and by increasing the overall size of the risk-reduction budget. The Love Canal incident may have caused excessive resources to be allocated to the management of toxic waste, but it also had a more general effect in raising the priority level of environmental concerns. Democracy is inevitably messy, in part because the availability and affect heuristics that guide citizens’ beliefs and attitudes are inevitably biased, even if they generally point in the right direction. Psychology should inform the design of risk policies that combine the experts’ knowledge with the public’s emotions and intuitions.
斯洛维奇正确地强调了公众对由未经选举和不负责任的专家作出决定的想法的抵制。此外,可用性级联可能有一个长期的好处,即呼吁 注意风险的类 ,并增加减少风险预算的总体规模。爱情运河事件可能导致了过多的资源被分配到有毒废物的管理上,但它也在提高环境问题的优先级方面产生了更普遍的影响。 民主不可避免地是混乱的,部分原因是指导 公民的信念和态度的可用性和影响启发式方法不可避免地有偏见,即使它们通常指向正确的方向。心理学应该为设计风险政策提供信息,将专家的知识与公众的情感和直觉相结合。
SPEAKING OF AVAILABILITY CASCADES
说到可得性级联
“She’s raving about an innovation that has large benefits and no costs. I suspect the affect heuristic.”
"她在大肆宣扬一项有巨大利益而没有成本的创新。 我怀疑是影响启发式的。"
“This is an availability cascade: a nonevent that is inflated by the media and the public until it fills our TV screens and becomes all anyone is talking about.”
"这是一个可用性 级联:一个非事件被媒体和公众夸大,直到它填满我们的电视屏幕,成为所有人都在谈论的事情。"
14
14
Tom W’s Specialty
汤姆-W的专长
Have a look at a simple puzzle:
请看一个简单的拼图。
Tom W is a graduate student at the main university in your state. Please rank the following nine fields of graduate specialization in order of the likelihood that Tom W is now a student in each of these fields. Use 1 for the most likely, 9 for the least likely.
汤姆-W是你所在州的主要大学的一名研究生。请按照汤姆-W现在在以下九个研究生专业领域的可能性排序: ,每个领域的学生。用1表示最可能,用9表示最不可能。
business administration
商业管理
computer science
计算机科学
engineering
工程
humanities and education
人文学科和教育
law
法律
medicine
医学
library science
图书馆学
physical and life sciences
物理科学和生命科学
social science and social work
社会科学和社会工作
This question is easy, and you knew immediately that the relative size of enrollment in the different fields is the key to a solution. So far as you know, Tom W was picked at random from the graduate students at the university, like a single marble drawn from an urn. To decide whether a marble is more likely to be red or green, you need to know how many marbles of each color there are in the urn. The proportion of marbles of a particular kind is called a base rate . Similarly, the base rate of humanities and education in this problem is the proportion of students of that field among all the graduate students. In the absence of specific information about Tom W, you will go by the base rates and guess that he is more likely to be enrolled in humanities and education than in computer science or library science, because there are more students overall in the humanities and education than in the other two fields. Using base-rate information is the obvious move when no other information is provided.
这个问题很简单,你马上就知道,不同领域招生的相对规模是解决问题的关键。 就你所知,汤姆-W是在大学的研究生中随机挑选的,就像从骨灰盒中抽出的一颗弹珠。要决定一个弹珠是红色还是绿色的可能性更大,你需要知道瓮里有多少种颜色的弹珠 。某一种类的大理石的比例被称为 基准率 。同样,在这个 问题中,人文和教育的基本比率是该领域的学生在所有研究生中的比例。在没有汤姆-W的具体信息的情况下,你会根据基本比率,猜测他更有可能被录取在人文和教育领域,而不是计算机科学或图书馆科学,因为人文和教育领域的学生总体上比其他两个领域多。在没有提供其他信息的情况下,使用基数 信息是明显的举措。
Next comes a task that has nothing to do with base rates.
接下来是一项与基准利率无关的任务。
The following is a personality sketch of Tom W written during Tom’s senior year in high school by a psychologist, on the basis of psychological tests of uncertain validity:
以下是一位心理学家在汤姆高三时,根据有效性不确定的心理测试,为汤姆-W写的人格素描。
Tom W is of high intelligence, although lacking in true creativity. He has a need for order and clarity, and for neat and tidy systems in which every detail finds its appropriate place. His writing is rather dull and mechanical, occasionally enlivened by somewhat corny puns and flashes of imagination of the sci-fi type. He has a strong drive for competence. He seems to have little feel and little sympathy for other people, and does not enjoy interacting with others. Self-centered, he nonetheless has a deep moral sense.
汤姆-W具有很高的智力,尽管缺乏真正的创造力。他需要秩序 ,需要清晰,需要整齐划一的系统,每一个细节都能找到其适当的位置。他的写作相当沉闷和机械,偶尔会有一些老套的双关语和科幻类型的想象力的闪现,使他的写作活跃起来。他对能力有很强的驱动力。他似乎对其他人没有什么感觉,也没有什么同情心,不喜欢与人交往。他以自我为中心,然而 ,具有深刻的道德感。
Now please take a sheet of paper and rank the nine fields of specialization listed below by how similar the description of Tom W is to the typical graduate student in each of the following fields. Use 1 for the most likely and 9 for the least likely.
现在请你拿一张纸,按照对汤姆-W的描述与以下各领域的典型研究生的相似程度,对下面列出的九个专业领域进行排名。用1表示最可能,用9表示最不可能。
You will get more out of the chapter if you give the task a quick try; reading the report on Tom W is necessary to make your judgments about the various graduate specialties.
如果你快速尝试一下这个任务,你会从这一章中得到更多的收获;阅读关于汤姆-W的报告是必要的,可以使你对各种研究生专业的判断 。
This question too is straightforward. It requires you to retrieve, or perhaps to construct, a stereotype of graduate students in the different fields. When the experiment was first conducted, in the early 1970s, the average ordering was as follows. Yours is probably not very different:
这个问题也很直截了当。它要求你检索,或者说构建不同领域的研究生的刻板印象。 在1970年代初首次进行实验时,平均排序如下。 你的可能没有什么不同。
computer science
计算机 科学
engineering
工程
business administration
商业管理
physical and life sciences
物理科学和生命科学
library science
图书馆学
law
法律
medicine
医学
humanities and education
人文学科和教育
social science and social work
社会科学和社会工作
You probably ranked computer science among the best fitting because of hints of nerdiness (“corny puns”). In fact, the description of Tom W was written to fit that stereotype. Another specialty that most people ranked high is engineering (“neat and tidy systems”). You probably thought that Tom W is not a good fit with your idea of social science and social work (“little feel and little sympathy for other people”). Professional stereotypes appear to have changed little in the nearly forty years since I designed the description of Tom W.
你可能把计算机科学列为最适合的学科之一,因为有书呆子的暗示("老套的双关语")。 事实上,对Tom W的描述就是为了符合这种刻板印象。另一个被大多数人排在高位的专业是工程("整齐划一的系统")。你可能认为 ,汤姆-W与你对社会科学和社会工作的想法不相符合("对他人没有感觉,没有同情心")。在我设计汤姆-W的描述以来的近四十年里,专业定型观念似乎没有什么变化。
The task of ranking the nine careers is complex and certainly requires the discipline and sequential organization of which only System 2 is capable. However, the hints planted in the description (corny puns and others) were intended to activate an association with a stereotype, an automatic activity of System 1.
对九种职业进行排名的任务很复杂,当然需要有纪律和顺序的组织,而这只有系统2能够做到, 。然而,描述中埋下的伏笔(老套的双关语和其他)是为了激活与刻板印象的关联,这是系统1的自动活动。
The instructions for this similarity task required a comparison of the description of Tom W to the stereotypes of the various fields of specialization. For the purposes of that task, the accuracy of the description—whether or not it is a true portrait of Tom W—is irrelevant. So is your knowledge of the base rates of the various fields. The similarity of an individual to the stereotype of a group is unaffected by the size of the group. Indeed, you could compare the description of Tom to an image of graduate students in library science even if there is no such department at the university.
这项相似性任务的说明要求将对汤姆-W的描述与各专业领域的定型观念进行比较。就该任务而言,描述的准确性--无论 ,是否是对汤姆-W的真实写照--都无关紧要。你对各领域的基本比率的了解也是如此。一个人与一个群体的刻板印象的相似性不受这个群体规模的影响。 事实上,你可以把对汤姆的描述与图书馆学的研究生的形象进行比较,即使大学里没有这样的系。
If you examine Tom W again, you will see that he is a good fit to stereotypes of some small groups of students (computer scientists, librarians, engineers) and a much poorer fit to the largest groups (humanities and education, social science and social work). Indeed, the participants almost always ranked the two largest fields very low. Tom W was intentionally designed as an “anti-base-rate” character, a good fit to small fields and a poor fit to the most populated specialties.
如果你再研究一下汤姆-W, ,你会发现他很适合一些小的学生群体(计算机科学家、图书管理员、工程师)的刻板印象,而对最大的群体(人文和教育、社会科学和社会工作)的适应性则差得多。事实上,参与者几乎总是把这两个最大的领域排在很低的位置。Tom W被有意设计成一个 "反基数 "的角色,很适合小 ,而不适合人口最多的专业。
PREDICTING BY REPRESENTATIVENESS
通过代表性进行预测
The third task in the sequence was administered to graduate students in psychology, and it is the critical one: rank the fields of specialization in order of the likelihood that Tom W is now a graduate student in each of these fields. The members of this prediction group knew the relevant statistical facts: they were familiar with the base rates of the different fields, and they knew that the source of Tom W’s description was not highly trustworthy. However, we expected them to focus exclusively on the similarity of the description to the stereotypes—we called it representativeness —ignoring both the base rates and the doubts about the veracity of the description. They would then rank the small specialty—computer science—as highly probable, because that outcome gets the highest representativeness score.
这一连串的第三个任务是对心理学的研究生进行的,这也是最关键的任务:按照汤姆-W现在是这些领域的研究生的可能性来排列专业领域的顺序。这个预测小组的成员知道相关的统计事实: ,他们熟悉不同领域的基数,他们知道汤姆-W的描述来源的可信度不高。然而,我们希望他们只关注描述与刻板印象的相似性--我们称之为 代表性--而忽略 基本比率和对描述真实性的怀疑。 然后他们会把小专业--计算机 ,认为可能性很大,因为这个结果得到了最高的代表性得分。
Amos and I worked hard during the year we spent in Eugene, and I sometimes stayed in the office through the night. One of my tasks for such a night was to make up a description that would pit representativeness and base rates against each other. Tom W was the result of my efforts, and I completed the description in the early morning hours. The first person who showed up to work that morning was our colleague and friend Robyn Dawes, who was both a sophisticated statistician and a skeptic about the validity of intuitive judgment. If anyone would see the relevance of the base rate, it would have to be Robyn. I called Robyn over, gave him the question I had just typed, and asked him to guess Tom W’s profession. I still remember his sly smile as he said tentatively, “computer scientist?” That was a happy moment—even the mighty had fallen. Of course, Robyn immediately recognized his mistake as soon as I mentioned “base rate,” but he had not spontaneously thought of it. Although he knew as much as anyone about the role of base rates in prediction, he neglected them when presented with the description of an individual’s personality. As expected, he substituted a judgment of representativeness for the probability he was asked to assess.
阿莫斯和我在尤金度过的一年中努力工作,我有时在办公室呆到晚上。我在这样一个夜晚的任务之一是编造一个描述,将代表性和基础率对立起来。Tom W是我努力的结果,我在凌晨时分完成了描述 。那天早上第一个来上班的是我们的同事和朋友Robyn Dawes,他既是一个成熟的统计学家,也是一个对直觉判断的有效性持怀疑态度的人。 如果有人会看到基准利率的相关性,那一定是Robyn。我把罗宾叫过来,把我刚打出的问题交给他,让他猜猜汤姆-W的职业 。我还记得他试探性地说 "计算机科学家 "时的狡猾笑容。那是一个快乐的时刻--即使是强大的人也已经倒下了。当然,我一提到 "基准率",罗宾就立即意识到了自己的错误,但他并没有自发地想到这一点。尽管他和其他人一样了解基准率在预测中的作用,但当遇到个人性格的描述 ,他却忽略了这些。正如所料,他用对代表性的判断取代了要求他评估的概率。
Amos and I then collected answers to the same question from 114 graduate students in psychology at three major universities, all of whom had taken several courses in statistics. They did not disappoint us. Their rankings of the nine fields by probability did not differ from ratings by similarity to the stereotype. Substitution was perfect in this case: there was no indication that the participants did anything else but judge representativeness. The question about probability (likelihood) was difficult, but the question about similarity was easier, and it was answered instead. This is a serious mistake, because judgments of similarity and probability are not constrained by the same logical rules. It is entirely acceptable for judgments of similarity to be unaffected by base rates and also by the possibility that the description was inaccurate, but anyone who ignores base rates and the quality of evidence in probability assessments will certainly make mistakes.
随后,阿莫斯和我从三所主要大学的114名心理学研究生那里收集了对同一问题的答案,这些人都修过几门统计学课程。他们按概率对九个领域进行的排名与按与定型的相似度进行的评分没有 。在这种情况下,替代是完美的:没有迹象表明参与者除了判断代表性之外还做了其他事情。 关于概率(可能性)的问题很难,但关于相似性的问题更容易,它反而被回答了。这是一个严重的错误, ,因为对相似性和概率的判断 ,不受相同的逻辑规则的制约。对相似性的判断不受基本比率的影响,也不受描述不准确的可能性影响,这是完全可以接受的,但任何在概率评估中忽视基本比率和证据质量的人肯定会犯错误。
The concept “the probability that Tom W studies computer science” is not a simple one. Logicians and statisticians disagree about its meaning, and some would say it has no meaning at all. For many experts it is a measure of subjective degree of belief. There are some events you are sure of, for example, that the sun rose this morning, and others you consider impossible, such as the Pacific Ocean freezing all at once. Then there are many events, such as your next-door neighbor being a computer scientist, to which you assign an intermediate degree of belief—which is your probability of that event.
"Tom W学习计算机科学的概率 "这个概念不是一个简单的 。逻辑学家和统计学家对它的意义有不同意见,有些人会说它根本没有意义。 对许多专家来说,它是对主观信仰程度的一种衡量。有一些事件你是肯定的,比如说今天早上太阳升起了,还有一些你认为是不可能的,比如说太平洋一下子就结冰了。然后有许多事件,例如你的隔壁邻居 是一名计算机科学家,你对这些事件分配了一个中间的相信程度--这是你对该事件的概率。
Logicians and statisticians have developed competing definitions of probability, all very precise. For laypeople, however, probability (a synonym of likelihood in everyday language) is a vague notion, related to uncertainty, propensity, plausibility, and surprise. The vagueness is not particular to this concept, nor is it especially troublesome. We know, more or less, what we mean when we use a word such as democracy or beauty and the people we are talking to understand, more or less, what we intended to say. In all the years I spent asking questions about the probability of events, no one ever raised a hand to ask me, “Sir, what do you mean by probability?” as they would have done if I had asked them to assess a strange concept such as globability. Everyone acted as if they knew how to answer my questions, although we all understood that it would be unfair to ask them for an explanation of what the word means.
逻辑学家和统计学家对概率的定义各不相同,都非常精确。 然而,对于普通人来说,概率(在日常语言中是 可能性的 同义词)是一个模糊的概念,与不确定性、倾向性、合理性和惊喜有关。模糊性 ,并不是这个概念所特有的,也不是特别麻烦。当我们使用 民主 或美 这样的词时,我们或多或少知道自己的意思,而与我们交谈的人也或多或少明白我们想说什么。在我花了这么多年时间询问有关事件概率的问题,从来没有人举手问我:"先生,你说的概率是什么意思?" 每个人都表现得好像他们知道如何回答我的问题,尽管我们都明白,要求他们解释这个词的含义是不公平的。
People who are asked to assess probability are not stumped, because they do not try to judge probability as statisticians and philosophers use the word. A question about probability or likelihood activates a mental shotgun, evoking answers to easier questions. One of the easy answers is an automatic assessment of representativeness—routine in understanding language. The (false) statement that “Elvis Presley’s parents wanted him to be a dentist” is mildly funny because the discrepancy between the images of Presley and a dentist is detected automatically. System 1 generates an impression of similarity without intending to do so. The representativeness heuristic is involved when someone says “She will win the election; you can see she is a winner” or “He won’t go far as an academic; too many tattoos.” We rely on representativeness when we judge the potential leadership of a candidate for office by the shape of his chin or the forcefulness of his speeches.
被要求评估概率的人不会被难倒,因为他们不会像统计学家和哲学家使用 这个词那样试图判断概率。一个关于概率或可能性的问题会激活一个心理猎枪,唤起对更简单问题的回答。 其中一个简单的答案是对代表性的自动评估--这是理解语言的常规。关于 "猫王的父母希望他成为一名牙医 "的(错误)声明是温和的,因为猫王和牙医的图像之间的差异被自动检测到 。系统1产生了相似的印象,但并不打算这样做。当有人说 "她会赢得选举;你可以看到她是个赢家 "或 "他作为一个学者不会走得太远;有太多的纹身 "时,就会涉及到代表性启发式。当我们通过下巴的形状或演讲的力度 ,来判断一个办公室候选人的潜在领导力时,我们依靠的是代表性。
Although it is common, prediction by representativeness is not statistically optimal. Michael Lewis’s bestselling Moneyball is a story about the inefficiency of this mode of prediction. Professional baseball scouts traditionally forecast the success of possible players in part by their build and look. The hero of Lewis’s book is Billy Beane, the manager of the Oakland A’s, who made the unpopular decision to overrule his scouts and to select players by the statistics of past performance. The players the A’s picked were inexpensive, because other teams had rejected them for not looking the part. The team soon achieved excellent results at low cost.
虽然这很常见,但通过代表性进行预测并不是统计学上的 。迈克尔-刘易斯(Michael Lewis)的畅销书 《钱球》(Moneyball )讲述了这种预测模式的低效率。 职业棒球球探传统上会根据球员的身材和长相来预测他们的成功。刘易斯书中的主人公是奥克兰运动家队的经理比利-比恩,他 ,做出了一个不受欢迎的决定,推翻了他的球探,并根据过去的表现的统计数据来选择球员。A队挑选的球员价格低廉,因为其他球队曾因他们长得不好看而拒绝他们。 该队很快就以低成本取得了优异的成绩。
THE SINS OF REPRESENTATIVENESS
代表性的罪过
Judging probability by representativeness has important virtues: the intuitive impressions that it produces are often—indeed, usually—more accurate than chance guesses would be.
通过代表性来判断概率具有重要的优点:它所产生的直观印象 ,往往--事实上,通常--比机会猜测更准确。
On most occasions, people who act friendly are in fact friendly.
在大多数情况下,表现友好的人实际上是友好的。
A professional athlete who is very tall and thin is much more likely to play basketball than football.
一个很高很瘦的职业运动员,打篮球的可能性要比打足球大得多。
People with a PhD are more likely to subscribe to The New York Times than people who ended their education after high school.
拥有博士学位的人比那些在高中毕业后就结束教育的人更有可能订阅 《纽约时报》 。
Young men are more likely than elderly women to drive aggressively.
年轻男性比老年女性更有可能在驾驶过程中表现出攻击性 。
In all these cases and in many others, there is some truth to the stereotypes that govern judgments of representativeness, and predictions that follow this heuristic may be accurate. In other situations, the stereotypes are false and the representativeness heuristic will mislead, especially if it causes people to neglect base-rate information that points in another direction. Even when the heuristic has some validity, exclusive reliance on it is associated with grave sins against statistical logic.
在所有这些情况下,以及在其他许多情况下,支配代表性判断的定型观念是有一定道理的,而遵循这种启发式的预测可能是准确的。在其他情况下,刻板印象是错误的,代表性启发式会误导人,特别是如果它导致人们忽视了指向另一个方向的基本信息 。即使启发式有一定的有效性,完全依赖它也是与违反统计逻辑的严重罪行有关。
One sin of representativeness is an excessive willingness to predict the occurrence of unlikely (low base-rate) events. Here is an example: you see a person reading The New York Times on the New York subway. Which of the following is a better bet about the reading stranger?
代表性的一个罪过是过分愿意预测不可能发生的(低基数)事件的发生。 这里有一个例子:你看到一个人在纽约地铁上阅读 《纽约时报》 。以下哪一个是关于这个阅读的陌生人的 更好的赌注?
She has a PhD.
她拥有博士学位。
She does not have a college degree.
她没有大学学位。
Representativeness would tell you to bet on the PhD, but this is not necessarily wise. You should seriously consider the second alternative, because many more nongraduates than PhDs ride in New York subways. And if you must guess whether a woman who is described as “a shy poetry lover” studies Chinese literature or business administration, you should opt for the latter option. Even if every female student of Chinese literature is shy and loves poetry, it is almost certain that there are more bashful poetry lovers in the much larger population of business students.
代表性会告诉你把赌注押在博士身上,但这不一定是明智之举。你应该认真考虑第二种选择,因为 ,乘坐纽约地铁的非毕业生比博士多得多。如果你必须猜测一个被描述为 "害羞的诗歌爱好者 " 的女性是学习中国 文学还是工商管理,你应该选择后者。即使每个学习中国文学的女性都是害羞的,都喜欢诗歌,几乎可以肯定的是,在更大的商业学生群体中,有更多腼腆的诗歌爱好者。
People without training in statistics are quite capable of using base rates in predictions under some conditions. In the first version of the Tom W problem, which provides no details about him, it is obvious to everyone that the probability of Tom W’s being in a particular field is simply the base-rate frequency of enrollment in that field. However, concern for base rates evidently disappears as soon as Tom W’s personality is described.
没有受过统计学训练的人在某些条件下很有能力在预测中使用基数率。在汤姆-W问题的第一个 版本中,没有提供关于他的细节,对每个人来说都很明显,汤姆-W在某个领域的概率只是在该领域入学的基数率频率。然而,对基准利率的关注显然在描述汤姆-W的个性时就消失了。
Amos and I originally believed, on the basis of our early evidence, that base-rate information will always be neglected when information about the specific instance is available, but that conclusion was too strong. Psychologists have conducted many experiments in which base-rate information is explicitly provided as part of the problem, and many of the participants are influenced by those base rates, although the information about the individual case is almost always weighted more than mere statistics. Norbert Schwarz and his colleagues showed that instructing people to “think like a statistician” enhanced the use of base-rate information, while the instruction to “think like a clinician” had the opposite effect .
阿莫斯和我最初认为,根据我们早期的证据,当有关具体实例的信息可用时,基数 ,信息将 总是 被忽视,但这个结论太强烈了。心理学家进行了许多实验,在这些实验中,基数信息被明确地作为问题的一部分提供,许多参与者受到这些基数的影响,尽管关于个案的信息几乎 总是 比单纯的统计数据 更有分量 。诺伯特-施瓦茨和他的同事表明,指导人们 "像统计学家一样思考 "可以提高对基数信息的使用,而指导人们 "像临床医生一样思考 "则有 相反的效果 。
An experiment that was conducted a few years ago with Harvard undergraduates yielded a finding that surprised me: enhanced activation of System 2 caused a significant improvement of predictive accuracy in the Tom W problem. The experiment combined the old problem with a modern variation of cognitive fluency. Half the students were told to puff out their cheeks during the task, while the others were told to frown . Frowning, as we have seen, generally increases the vigilance of System 2 and reduces both overconfidence and the reliance on intuition. The students who puffed out their cheeks (an emotionally neutral expression) replicated the original results: they relied exclusively on representativeness and ignored the base rates. As the authors had predicted, however, the frowners did show some sensitivity to the base rates. This is an instructive finding.
几年前在哈佛大学本科生中进行的一项实验产生了一个令我惊讶的发现:增强系统2的激活导致了在汤姆-W问题上预测准确性的显著提高, 。该实验将这个老问题与现代认知流畅性的变化结合起来。 一半的学生被告知在任务中鼓起脸颊,而其他学生则被 告知皱起眉头 。正如我们所看到的,皱眉通常会提高系统2的警惕性,并减少过度自信和对直觉的依赖。学生 ,他们鼓起脸颊(一种情绪中立的表情)复制了原来的结果:他们完全依赖代表性,忽略了基准率。 然而,正如作者所预测的,皱眉者确实显示出对基准率的一些敏感性。 这是一个有启发性的发现。
When an incorrect intuitive judgment is made, System 1 and System 2 should both be indicted. System 1 suggested the incorrect intuition, and System 2 endorsed it and expressed it in a judgment. However, there are two possible reasons for the failure of System 2—ignorance or laziness. Some people ignore base rates because they believe them to be irrelevant in the presence of individual information. Others make the same mistake because they are not focused on the task. If frowning makes a difference, laziness seems to be the proper explanation of base-rate neglect, at least among Harvard undergrads. Their System 2 “knows” that base rates are relevant even when they are not explicitly mentioned, but applies that knowledge only when it invests special effort in the task.
当做出不正确的直觉判断时,系统1和系统2都应该被起诉。系统 1提出了不正确的直觉,而 系统2认可了它并在判断中表达了它。然而,系统2的失败有两个可能的原因:一是疏忽,二是懒惰。 有些人忽视了基数,因为他们认为在个别信息面前它们是不相关的。 其他人犯同样的错误是因为他们没有专注于任务。如果皱眉头会有影响, 懒惰似乎是对基数忽视的适当解释,至少在哈佛大学本科生中是这样。他们的系统2 "知道 "即使没有明确提到基数,也是相关的,但只有当它在任务中投入特别的努力时才会应用这一知识。
The second sin of representativeness is insensitivity to the quality of evidence. Recall the rule of System 1: WYSIATI. In the Tom W example, what activates your associative machinery is a description of Tom, which may or may not be an accurate portrayal. The statement that Tom W “has little feel and little sympathy for people” was probably enough to convince you (and most other readers) that he is very unlikely to be a student of social science or social work. But you were explicitly told that the description should not be trusted!
代表性的第二个罪过是对证据的质量不敏感。回顾系统1的规则:所见即所得。在 Tom W的例子中,激活你的联想机制的是对汤姆的描述,这可能是也可能不是一个准确的描绘。汤姆-W "对人没什么感觉,也没什么同情心 "的说法可能足以让你(和其他大多数读者)相信,他很不可能是社会科学或社会工作的学生。但有人明确告诉你,这段描述 ,不要相信!
You surely understand in principle that worthless information should not be treated differently from a complete lack of information, but WY SIATI makes it very difficult to apply that principle. Unless you decide immediately to reject evidence (for example, by determining that you received it from a liar), your System 1 will automatically process the information available as if it were true. There is one thing you can do when you have doubts about the quality of the evidence: let your judgments of probability stay close to the base rate. Don’t expect this exercise of discipline to be easy—it requires a significant effort of self-monitoring and self-control.
你肯定在原则上明白,没有价值的信息不应该和完全没有的信息区别对待,但是WY SIATI让你很难应用这个原则。 除非你立即决定拒绝证据(例如,确定你是从一个骗子那里得到的),否则你的系统1会自动处理现有的信息,好像 ,它是真的。当你对证据的质量有怀疑时,有一件事你可以做:让你对概率的判断保持在接近基本比率的水平。不要指望这种纪律性的锻炼是容易的,它需要大量的自我监督和自我控制的努力。
The correct answer to the Tom W puzzle is that you should stay very close to your prior beliefs, slightly reducing the initially high probabilities of well-populated fields (humanities and education; social science and social work) and slightly raising the low probabilities of rare specialties (library science, computer science). You are not exactly where you would be if you had known nothing at all about Tom W, but the little evidence you have is not trustworthy, so the base rates should dominate your estimates.
对汤姆-W之谜的正确答案是,你应该非常接近你的先验信念,略微降低 ,最初人口众多的领域(人文学科和教育;社会科学和社会工作)的高概率,略微提高稀有专业(图书馆学、计算机科学)的低概率。你并不完全是在你对汤姆-W一无所知的情况下,但你拥有的少量证据是不可信的,所以基础率应该主导你的估计。
HOW TO DISCIPLINE INTUITION
如何约束直觉
Your probability that it will rain tomorrow is your subjective degree of belief, but you should not let yourself believe whatever comes to your mind. To be useful, your beliefs should be constrained by the logic of probability. So if you believe that there is a 40% chance that it will rain sometime tomorrow, you must also believe that there is a 60% chance it will not rain tomorrow, and you must not believe that there is a 50% chance that it will rain tomorrow morning. And if you believe that there is a 30% chance that candidate X will be elected president, and an 80% chance that he will be reelected if he wins the first time, then you must believe that the chances that he will be elected twice in a row are 24%.
你认为明天会下雨的概率是你主观的相信程度,但你不应该让自己相信你脑海中出现的任何东西。 为了有用,你的信念应该受到概率逻辑的约束。因此,如果你相信有40%的机会明天某个时候会下雨,你也必须相信有60%的机会明天不会下雨 ,你也不能相信有50%的机会明天早上会下雨。而如果你相信候选人X有30%的机会当选总统,如果他第一次获胜,有80%的机会连任,那么你一定相信他连续两次当选的机会是24%。
The relevant “rules” for cases such as the Tom W problem are provided by Bayesian statistics. This influential modern approach to statistics is named after an English minister of the eighteenth century, the Reverend Thomas Bayes, who is credited with the first major contribution to a large problem: the logic of how people should change their mind in the light of evidence. Bayes’s rule specifies how prior beliefs (in the examples of this chapter, base rates) should be combined with the diagnosticity of the evidence, the degree to which it favors the hypothesis over the alternative. For example, if you believe that 3% of graduate students are enrolled in computer science (the base rate), and you also believe that the description of Tom W is 4 times more likely for a graduate student in that field than in other fields, then Bayes’s rule says you must believe that the probability that Tom W is a computer scientist is now 11%. If the base rate had been 80%, the new degree of belief would be 94.1%. And so on.
对于像汤姆 W问题这样的情况,相关的 "规则 "是由贝叶斯统计学提供的。这种有影响力的现代统计方法是以十八世纪的一位英国牧师托马斯-贝叶斯牧师命名的,他被认为是对一个大问题的第一个重大贡献:人们应该如何根据证据改变他们的想法的逻辑。 贝叶斯规则 规定了先验信念(在本章的例子中, 基准率)应如何与证据的诊断性相结合,即证据对假设的偏爱程度。例如,如果你相信有3%的研究生就读于计算机科学专业(基本比率),而且你还相信汤姆-W的描述对于该领域的研究生来说是其他领域的4倍,那么贝叶斯法则 ,你必须相信汤姆-W是计算机科学家的概率现在是11%。如果基准率是80%,那么新的相信程度将是94.1%。以此类推。
The mathematical details are not relevant in this book. There are two ideas to keep in mind about Bayesian reasoning and how we tend to mess it up. The first is that base rates matter, even in the presence of evidence about the case at hand. This is often not intuitively obvious. The second is that intuitive impressions of the diagnosticity of evidence are often exaggerated. The combination of WY SIATI and associative coherence tends to make us believe in the stories we spin for ourselves. The essential keys to disciplined Bayesian reasoning can be simply summarized:
数学上的细节在本书中并不相关。关于贝叶斯推理和我们如何倾向于把它搞乱,有两个想法要牢记。首先是基准费率很重要,即使在有关于 手头案件的证据的情况下。这往往不是直觉上的显而易见。第二,对证据诊断性的直观印象往往被夸大了。 WY SIATI和联想一致性的结合往往使我们相信我们为自己编织的故事。 严谨的贝叶斯推理的基本关键可以简单概括。
Anchor your judgment of the probability of an outcome on a plausible base rate.
将你对一个 结果的概率的判断建立在一个可信的基准利率上。
Question the diagnosticity of your evidence.
质疑你的证据的诊断性。
Both ideas are straightforward. It came as a shock to me when I realized that I was never taught how to implement them, and that even now I find it unnatural to do so.
这两种想法都很直截了当。当我意识到从来没有人教过我如何实施它们时,我感到很震惊,甚至现在我发现这样做是不自然的。
SPEAKING OF REPRESENTATIVENESS
谈到 代表性
“The lawn is well trimmed, the receptionist looks competent, and the furniture is attractive, but this doesn’t mean it is a well-managed company. I hope the board does not go by representativeness.”
"草坪修剪得很好,接待员看起来很能干,家具也很有吸引力,但这并不意味着 ,这是一家管理良好的公司。我希望董事会不要以代表性为标准"。
“This start-up looks as if it could not fail, but the base rate of success in the industry is extremely low. How do we know this case is different?”
"这个初创企业看起来好像不会失败,但这个行业的基本成功率极低。我们怎么知道这个案子有什么不同呢?"
“They keep making the same mistake: predicting rare events from weak evidence. When the evidence is weak, one should stick with the base rates.”
"他们一直在犯同样的错误:从薄弱的证据中预测罕见的事件。 当证据薄弱时,人们应该坚持使用基本比率。"
“I know this report is absolutely damning, and it may be based on solid evidence, but how sure are we? We must allow for that uncertainty in our thinking.”
"我知道这份报告 ,它绝对是有害的,它可能是基于坚实的证据,但我们有多大把握? 我们必须在思考中允许这种不确定性。"
15
15
Linda: Less is More
琳达:少即是多
The best-known and most controversial of our experiments involved a fictitious lady called Linda. Amos and I made up the Linda problem to provide conclusive evidence of the role of heuristics in judgment and of their incompatibility with logic. This is how we described Linda:
在我们的实验中,最著名和最有争议的是一位名叫琳达的虚构女士。阿莫斯和我编造了琳达问题,为 启发式方法 在判断中的 作用 提供了确凿的证据,并 ,说明其与逻辑不相容。这就是我们对琳达的描述。
Linda is thirty-one years old, single, outspoken, and very bright. She majored in philosophy. As a student, she was deeply concerned with issues of discrimination and social justice, and also participated in antinuclear demonstrations.
琳达今年三十一岁,单身,直言不讳,非常聪明。她主修的是哲学。作为一名学生,她深切关注歧视和社会正义问题,还参加了反核示威活动。
The audiences who heard this description in the 1980s always laughed because they immediately knew that Linda had attended the University of California at Berkeley, which was famous at the time for its radical, politically engaged students. In one of our experiments we presented participants with a list of eight possible scenarios for Linda. As in the Tom W problem, some ranked the scenarios by representativeness, others by probability. The Linda problem is similar, but with a twist.
20世纪80年代听到这种描述的观众总是大笑,因为他们立即 ,知道琳达曾在加州大学伯克利分校就读,该校当时以其激进的、参与政治的学生而闻名。 在我们的一个实验中,我们向参与者提供了一份琳达的八种可能情景的清单。与汤姆-W的问题一样,一些人按代表性对方案进行排名,另一些人则按概率进行排名。 琳达的问题类似,但有一个转折。
Linda is a teacher in elementary school.
琳达是一名小学教师。
Linda works in a bookstore and takes yoga classes.
琳达在一家书店工作并参加瑜伽课程。
Linda is active in the feminist movement.
琳达积极参加女权运动。
Linda is a psychiatric social worker.
琳达是一名精神病学社会工作者。
Linda is a member of the League of Women Voters.
琳达是妇女选民联盟的成员。
Linda is a bank teller.
琳达 ,是一名银行出纳员。
Linda is an insurance salesperson.
琳达是一名保险销售员。
Linda is a bank teller and is active in the feminist movement.
琳达是一名银行出纳员,积极参加女权运动。
The problem shows its age in several ways. The League of Women Voters is no longer as prominent as it was, and the idea of a feminist “movement” sounds quaint, a testimonial to the change in the status of women over the last thirty years. Even in the Facebook era, however, it is still easy to guess the almost perfect consensus of judgments: Linda is a very good fit for an active feminist, a fairly good fit for someone who works in a bookstore and takes yoga classes—and a very poor fit for a bank teller or an insurance salesperson.
这个问题在几个方面显示了它的年龄。 妇女选民联盟不再像以前那样突出,女权主义 "运动 "的想法听起来很陈旧,是过去三十年来妇女地位变化的证明。然而,即使在Facebook时代,仍然很容易猜到几乎完美的判断共识:琳达非常适合活跃的女权主义者,相当适合在书店工作的人 ,并参加瑜伽课程--而非常不适合银行柜员或保险销售人员。
Now focus on the critical items in the list: Does Linda look more like a bank teller, or more like a bank teller who is active in the feminist movement? Everyone agrees that Linda fits the idea of a “feminist bank teller” better than she fits the stereotype of bank tellers. The stereotypical bank teller is not a feminist activist, and adding that detail to the description makes for a more coherent story.
现在关注清单中的关键项目:琳达看起来更像一个银行出纳员,还是更像一个积极参加女权运动的银行出纳员?大家都同意,琳达符合 "女权主义银行出纳员 "的概念,而不是符合银行出纳员的刻板印象。刻板的银行出纳员 ,不是女权主义活动家,在描述中加入这一细节,使故事更加连贯。
The twist comes in the judgments of likelihood, because there is a logical relation between the two scenarios. Think in terms of Venn diagrams. The set of feminist bank tellers is wholly included in the set of bank tellers, as every feminist bank teller is a bank teller. Therefore the probability that Linda is a feminist bank teller must be lower than the probability of her being a bank teller. When you specify a possible event in greater detail you can only lower its probability. The problem therefore sets up a conflict between the intuition of representativeness and the logic of probability.
转折出现在对可能性的判断上,因为两种情况之间存在着逻辑关系。 用维恩图的方式思考。女权主义银行出纳员的集合完全包括在银行出纳员的集合中,因为每个女权主义银行出纳员都是一个银行出纳员。因此,琳达是一个女权主义银行出纳员的概率 , 一定 低于她是一个银行出纳员的概率。当你更详细地说明一个可能的事件时,你只能降低它的概率。 因此,这个问题在代表性的直觉和概率的逻辑之间建立了冲突。
Our initial experiment was between-subjects. Each participant saw a set of seven outcomes that included only one of the critical items (“bank teller” or “feminist bank teller”). Some ranked the outcomes by resemblance, others by likelihood. As in the case of Tom W, the average rankings by resemblance and by likelihood were identical; “feminist bank teller” ranked higher than “bank teller” in both.
我们最初的实验是在被试之间进行的。每个被试都看到了一组七种结果, ,其中只包括一个关键项目("银行出纳员 "或 "女权银行出纳员")。有些人按相似度对这些结果进行排序,有些人则按可能性排序。和汤姆-W的情况一样,按相似度和按可能性的平均排名是相同的;"女权主义银行出纳员 "在两者中的排名都高于 "银行出纳员"。
Then we took the experiment further, using a within-subject design. We made up the questionnaire as you saw it, with “bank teller” in the sixth position in the list and “feminist bank teller” as the last item. We were convinced that subjects would notice the relation between the two outcomes, and that their rankings would be consistent with logic. Indeed, we were so certain of this that we did not think it worthwhile to conduct a special experiment. My assistant was running another experiment in the lab, and she asked the subjects to complete the new Linda questionnaire while signing out, just before they got paid.
然后我们进一步进行了实验,使用了主体内设计。我们按照你所看到的那样,把问卷 ,把 "银行出纳员 "放在列表的第六位,把 "女权主义银行出纳员 "作为最后一项。我们相信,受试者会注意到两种结果之间的关系,而且他们的排名会符合逻辑。事实上,我们非常肯定这一点,以至于我们认为不值得进行特别的实验。我的助手正在实验室里进行另一个实验 ,她要求受试者在签出时完成新的琳达问卷,就在他们拿到钱之前。
About ten questionnaires had accumulated in a tray on my assistant’s desk before I casually glanced at them and found that all the subjects had ranked “feminist bank teller” as more probable than “bank teller.” I was so surprised that I still retain a “flashbulb memory” of the gray color of the metal desk and of where everyone was when I made that discovery. I quickly called Amos in great excitement to tell him what we had found: we had pitted logic against representativeness, and representativeness had won!
大约 ,十份问卷已经积压在我助手桌上的一个托盘里,然后我随手扫了一眼,发现所有受试者都把 "女权主义银行出纳员 "列为比 "银行出纳员 "更有可能。我是如此惊讶,以至于我仍然保留着 "闪光记忆 " ,即金属桌子的灰色,以及我有此发现时每个人都在哪里。我非常兴奋地迅速打电话给阿莫斯,告诉他我们的发现:我们把逻辑与代表性对立起来,而代表性赢了!"。
In the language of this book, we had observed a failure of System 2: our participants had a fair opportunity to detect the relevance of the logical rule, since both outcomes were included in the same ranking. They did not take advantage of that opportunity. When we extended the experiment, we found that 89% of the undergraduates in our sample violated the logic of probability. We were convinced that statistically sophisticated respondents would do better, so we administered the same questionnaire to doctoral students in the decision-science program of the Stanford Graduate School of Business, all of whom had taken several advanced courses in probability, statistics, and decision theory. We were surprised again: 85% of these respondents also ranked “feminist bank teller” as more likely than “bank teller.”
在这本书的语言中,我们曾观察到系统2的失败:我们的参与者有一个公平的机会来检测逻辑规则的相关性, ,因为这两种结果都包含在同一个排名中。当我们扩展实验时,我们发现样本中89%的本科生违反了概率的逻辑。我们相信统计学上的成熟受访者会做得更好,所以我们对斯坦福大学商学院决策科学项目的博士生进行了同样的问卷调查 ,他们都修过几门关于概率、统计学和决策理论的高级课程。我们再次感到惊讶:这些受访者中85%的人也将 "女权主义银行出纳员 "列为比 "银行出纳员 "更有可能。
In what we later described as “increasingly desperate” attempts to eliminate the error, we introduced large groups of people to Linda and asked them this simple question:
在我们后来描述为 "越来越绝望 "的消除错误的尝试中,我们向大群人介绍了琳达 ,并问他们这个简单的问题。
Which alternative is more probable?
哪种选择更有可能?
Linda is a bank teller.
琳达是一名银行出纳员。
Linda is a bank teller and is active in the feminist movement.
琳达是一名银行出纳员,积极参加女权运动。
This stark version of the problem made Linda famous in some circles, and it earned us years of controversy. About 85% to 90% of undergraduates at several major universities chose the second option, contrary to logic. Remarkably, the sinners seemed to have no shame. When I asked my large undergraduate class in some indignation, “Do you realize that you have violated an elementary logical rule?” someone in the back row shouted, “So what?” and a graduate student who made the same error explained herself by saying, “I thought you just asked for my opinion.”
这个问题的严酷版本使琳达在某些圈子里出名,它为我们赢得了多年的争议。在几所主要大学中,大约85%至90%的本科生选择了第二个选项,这与逻辑相悖。值得注意的是, 的罪人似乎没有羞耻心。当我有些愤愤不平地问我的本科大班:"你们意识到你们违反了一条基本的逻辑规则吗?"后排有人喊道:"那又怎样?"一个犯了同样错误的研究生解释说:"我以为你只是问我的意见。"
The word fallacy is used, in general, when people fail to apply a logical rule that is obviously relevant. Amos and I introduced the idea of a conjunction fallacy , which people commit when they judge a conjunction of two events (here, bank teller and feminist) to be more probable than one of the events (bank teller) in a direct comparison.
一般来说,当人们没有应用明显相关的逻辑 规则时,就会使用 谬误 这个词。阿莫斯和我介绍了 连带谬误 的概念,当人们判断两个事件(这里是银行出纳员和女权主义者)的连带关系比直接比较中的一个事件(银行出纳员)的可能性更大时,就会犯这个错误。
As in the Muller-Lyer illusion, the fallacy remains attractive even when you recognize it for what it is. The naturalist Stephen Jay Gould described his own struggle with the Linda problem. He knew the correct answer, of course, and yet, he wrote, “a little homunculus in my head continues to jump up and down, shouting at me—‘but she can’t just be a bank teller; read the description.’” The little homunculus is of course Gould’s System 1 speaking to him in insistent tones. (The two-system terminology had not yet been introduced when he wrote.)
正如 ,在Muller-Lyer的幻觉中,即使你认识到它的本质,这种谬论仍然具有吸引力。 自然学家Stephen Jay Gould描述了他自己与Linda问题的斗争。他当然知道正确的答案,然而,他写道, " 我脑子里的 一个小同胞 继续上蹿下跳,对我喊道--'但她不可能只是个银行出纳员;请看描述'。"小同胞当然是古尔德的系统1以坚持不懈的语调对他说话。(他写这篇文章时,双系统的术语还没有被引入 )。
The correct answer to the short version of the Linda problem was the majority response in only one of our studies: 64% of a group of graduate students in the social sciences at Stanford and at Berkeley correctly judged “feminist bank teller” to be less probable than “bank teller.” In the original version with eight outcomes (shown above), only 15% of a similar group of graduate students had made that choice. The difference is instructive. The longer version separated the two critical outcomes by an intervening item (insurance salesperson), and the readers judged each outcome independently, without comparing them. The shorter version, in contrast, required an explicit comparison that mobilized System 2 and allowed most of the statistically sophisticated students to avoid the fallacy. Unfortunately, we did not explore the reasoning of the substantial minority (36%) of this knowledgeable group who chose incorrectly.
在我们的研究中,只有一项研究对琳达问题的简短版本的正确答案是大多数人的反应:在斯坦福大学和伯克利大学的一组社会科学研究生中,64%的人正确判断 "女权主义银行出纳员 "的可能性低于 "银行出纳员"。在有八个结果的原始版本中(如上图所示),在一组类似的研究生 ,只有15%的学生做出了这样的选择。这种差异很有启发性。较长的版本通过一个中间项目(保险销售人员)将两个关键结果分开,读者独立判断每个结果,不进行比较。相反,较短的版本需要进行明确的比较,调动了系统2,使大多数统计学上的学生能够避免 谬误。遗憾的是,我们没有探讨这个有知识的群体中相当少数(36%)选择错误的理由。
The judgments of probability that our respondents offered, in both the Tom W and Linda problems, corresponded precisely to judgments of representativeness (similarity to stereotypes). Representativeness belongs to a cluster of closely related basic assessments that are likely to be generated together. The most representative outcomes combine with the personality description to produce the most coherent stories. The most coherent stories are not necessarily the most probable, but they are plausible , and the notions of coherence, plausibility, and probability are easily confused by the unwary.
在汤姆-W和琳达的问题中,我们的受访者提供的概率判断恰恰与代表性(与刻板印象的相似性)的判断相对应。代表性属于一组密切相关的基本 ,这些评估可能会一起产生。最具代表性的结果与个性描述相结合,产生了最连贯的故事。 最连贯的故事不一定是最有可能的,但它们是 合理的 ,连贯性、合理性和概率的概念很容易被不了解的人混淆。
The uncritical substitution of plausibility for probability has pernicious effects on judgments when scenarios are used as tools of forecasting. Consider these two scenarios, which were presented to different groups, with a request to evaluate their probability:
当情景被用作预测工具时,不加批判地用可信度代替 概率,对判断产生了有害的影响。 考虑这两种情景,它们被提交给不同的小组,并要求评估它们的概率。
A massive flood somewhere in North America next year, in which more than 1,000 people drown
明年北美某地发生大规模洪水,1000多人被淹死
An earthquake in California sometime next year, causing a flood in which more than 1,000 people drown
明年某个时候加州发生地震,造成1000多人淹死在洪水中
The California earthquake scenario is more plausible than the North America scenario, although its probability is certainly smaller. As expected, probability judgments were higher for the richer and more detailed scenario, contrary to logic. This is a trap for forecasters and their clients: adding detail to scenarios makes them more persuasive, but less likely to come true.
加利福尼亚州的地震情况比北美的情况更可信,尽管其概率肯定较小。正如预期的那样,对于更丰富和更详细的场景,概率判断更高,这与逻辑相反。这对预测者和他们的客户来说是个陷阱:在方案中增加细节使其更有说服力,但实现的可能性较小。
To appreciate the role of plausibility, consider the following questions:
为了 ,了解可信度的作用,考虑以下问题。
Which alternative is more probable?
哪种选择更有可能?
Mark has hair.
马克有头发。
Mark has blond hair.
马克有一头金发。
and
和
Which alternative is more probable?
哪种选择更有可能?
Jane is a teacher.
简是一名教师。
Jane is a teacher and walks to work.
简是一名教师,走着去上班。
The two questions have the same logical structure as the Linda problem, but they cause no fallacy, because the more detailed outcome is only more detailed—it is not more plausible, or more coherent, or a better story. The evaluation of plausibility and coherence does not suggest an answer to the probability question. In the absence of a competing intuition, logic prevails.
这两个问题与琳达问题具有相同的逻辑结构,但它们不会造成谬误,因为更详细的结果只是更 ,它不是更可信,或更连贯,或更好的故事。对可信性和连贯性的评价并不表明对概率问题的答案。 在没有竞争的直觉的情况下,逻辑优先。
LESS IS MORE, SOMETIMES EVEN IN JOINT EVALUATION
少即是多,有时甚至在联合评估中
Christopher Hsee, of the University of Chicago, asked people to price sets of dinnerware offered in a clearance sale in a local store, where dinnerware regularly runs between $30 and $60. There were three groups in his experiment. The display below was shown to one group; Hsee labels that joint evaluation , because it allows a comparison of the two sets. The other two groups were shown only one of the two sets; this is single evaluation . Joint evaluation is a within-subject experiment, and single evaluation is between-subjects.
芝加哥大学的克里斯托弗-赫斯(Christopher Hsee)要求人们对当地商店清仓大甩卖中提供的成套餐具进行定价, ,那里的餐具价格通常在30至60美元之间。在他的实验中,有三个小组。 下面的显示是给一个小组看的;Hsee将其称为 联合评价 ,因为它可以对两组进行比较。 另外两个小组只看了两组中的一组;这是 单一评价 。 联合评价是主体内实验,而单一评价是 主体间实验。
Set A: 40 pieces
套装A:40件
Set B: 24 pieces
B组:24件
Dinner plates
餐盘
8, all in good condition
8个,全部状况良好
8, all in good condition
8个,全部状况良好
Soup/salad bowls
汤/沙拉碗
8, all in good condition
8个,全部状况良好
8, all in good condition
8个,都在良好的 状态
Dessert plates
甜品盘
8, all in good condition
8个,全部状况良好
8, all in good condition
8个,全部状况良好
Cups
杯子
8, 2 of them broken
8个,其中2个坏了
Saucers
碟子
8, 7 of them broken
8个,其中7个坏了
Assuming that the dishes in the two sets are of equal quality, which is worth more? This question is easy. You can see that Set A contains all the dishes of Set B, and seven additional intact dishes, and it must be valued more. Indeed, the participants in Hsee’s joint evaluation experiment were willing to pay a little more for Set A than for Set B: $32 versus $30.
假设两套餐具的质量相同,哪个更值得 ?这个问题很简单,你可以看到A组包含B组的所有菜品,另外还有7个完整的菜品,它的价值 肯定 更高。事实上,在Hsee的联合评估实验中,参与者愿意为A组比B组多支付一点:32美元对30美元。
The results reversed in single evaluation, where Set B was priced much higher than Set A: $33 versus $23. We know why this happened. Sets (including dinnerware sets!) are represented by norms and prototypes. You can sense immediately that the average value of the dishes is much lower for Set A than for Set B, because no one wants to pay for broken dishes. If the average dominates the evaluation, it is not surprising that Set B is valued more. Hsee called the resulting pattern less is more . By removing 16 items from Set A (7 of them intact), its value is improved.
我们知道为什么会发生这种情况。套装(包括餐具套装!)是由规范和原型代表的。你可以立即感觉到,A组的餐具的平均价值比B组低得多,因为没有人愿意为破损的餐具付钱。如果平均数在评估中占主导地位,那么B组被更多地重视就不奇怪了。Hsee称由此产生的模式 为少即是多 。通过从A组中删除 16个项目(其中7个是完整的),其价值得到了提高。
Hsee’s finding was replicated by the experimental economist John List in a real market for baseball cards. He auctioned sets of ten high-value cards, and identical sets to which three cards of modest value were added. As in the dinnerware experiment, the larger sets were valued more than the smaller ones in joint evaluation, but less in single evaluation. From the perspective of economic theory, this result is troubling: the economic value of a dinnerware set or of a collection of baseball cards is a sum-like variable. Adding a positively valued item to the set can only increase its value.
Hsee的发现被实验经济学家John List在一个真实的棒球卡市场上复制了。他拍卖的是一套10张高价值的牌,以及加入了三张价值不高的牌的相同的牌。如同在餐具实验中一样,在联合评价中,较大的餐具比较小的餐具更有价值,但在单一评价中, 。从经济理论的角度来看,这个结果令人不安:一套餐具或一套棒球卡的经济价值是一个类似总和的变量。 在这套餐具中增加一个正面价值的物品,只能增加其价值。
The Linda problem and the dinnerware problem have exactly the same structure. Probability, like economic value, is a sum-like variable, as illustrated by this example:
琳达问题和餐具问题具有完全相同的结构。概率,像经济价值一样,是一个类似于总和的变量, ,正如这个例子所说明的。
probability (Linda is a teller)
概率(琳达是个出纳员)
=
=
probability (Linda is feminist teller) + probability (Linda is non-feminist teller)
概率(琳达是女权主义的出纳员)+ 概率(琳达为非女权主义讲述者)
This is also why, as in Hsee’s dinnerware study, single evaluations of the Linda problem produce a less-is-more pattern. System 1 averages instead of adding, so when the non-feminist bank tellers are removed from the set, subjective probability increases. However, the sum-like nature of the variable is less obvious for probability than for money. As a result, joint evaluation eliminates the error only in Hsee’s experiment, not in the Linda experiment.
这也是为什么像Hsee的餐具研究一样,对Linda问题的单一评价会产生一个少即是多的模式。系统1是平均数而不是加法,所以当非女权主义的银行出纳员从集合中删除时,主观概率 。然而,对于概率来说,变量的类和性质不如金钱那么明显。因此,联合评价只在Hsee的实验中消除了误差,而在Linda的实验中没有消除。
Linda was not the only conjunction error that survived joint evaluation. We found similar violations of logic in many other judgments. Participants in one of these studies were asked to rank four possible outcomes of the next Wimbledon tournament from most to least probable. Björn Borg was the dominant tennis player of the day when the study was conducted. These were the outcomes:
琳达并不是唯一在联合评估中幸存下来的连带错误。 我们在其他许多判断中发现了类似的违反逻辑的情况。其中一项研究的参与者被要求对下一届温布尔登网球赛的四种可能结果从最可能到最不可能进行排名。在进行这项研究时,比约恩-博格是当时的主导网球运动员。这些是结果。
A. Borg will win the match.
A.博格将赢得比赛。
B. Borg will lose the first set.
B. 博格将输掉第一盘。
C. Borg will lose the first set but win the match.
C. 博格将输掉第一盘但赢得比赛。
D. Borg will win the first set but lose the match.
D. 博格将赢得第一盘但输掉比赛。
The critical items are B and C. B is the more inclusive event and its probability must be higher than that of an event it includes. Contrary to logic, but not to representativeness or plausibility, 72% assigned B a lower probability than C—another instance of less is more in a direct comparison. Here again, the scenario that was judged more probable was unquestionably more plausible, a more coherent fit with all that was known about the best tennis player in the world.
关键项目是B和 C。B是更具包容性的事件,其概率 必须 高于它所包括的事件的概率。与逻辑相反,但与代表性或合理性无关,72%的人认为B的概率低于C--这是直接比较中少即是多的另一个例子。在这里,被判断为更有可能的情况无疑是更有道理的,与所有关于世界上最好的网球运动员的 ,是更一致的。
To head off the possible objection that the conjunction fallacy is due to a misinterpretation of probability, we constructed a problem that required probability judgments, but in which the events were not described in words, and the term probability did not appear at all. We told participants about a regular six-sided die with four green faces and two red faces, which would be rolled 20 times. They were shown three sequences of greens (G) and reds (R), and were asked to choose one. They would (hypothetically) win $25 if their chosen sequence showed up. The sequences were:
为了避免可能出现的反对意见,即连带谬误是由于对概率的误解,我们构建了一个需要进行概率判断的问题,但其中的事件没有用文字描述,而且 概率 一词根本没有出现。 我们告诉参与者一个有四个绿面的普通六面骰子 ,两个红面,将被掷出20次。他们看到三个绿色(G)和红色(R)的序列,并被要求选择一个。如果他们选择的序列出现,他们将(假设)赢得25美元。这些序列是:。
1. RGRRR
1.RGRRR
2. GRGRRR
2.GRGRRR
3. GRRRRR
3.GRRRRRR
Because the die has twice as many green as red faces, the first sequence is quite unrepresentative—like Linda being a bank teller. The second sequence, which contains six tosses, is a better fit to what we would expect from this die, because it includes two G’s. However, this sequence was constructed by adding a G to the beginning of the first sequence, so it can only be less likely than the first. This is the nonverbal equivalent to Linda being a feminist bank teller. As in the Linda study, representativeness dominated. Almost two-thirds of respondents preferred to bet on sequence 2 rather than on sequence 1. When presented with arguments for the two choices, however, a large majority found the correct argument (favoring sequence 1) more convincing.
因为骰子的绿色面数是红色面数的两倍,所以第一个序列很不具代表性,就像琳达是一个银行出纳员。 第二个序列包含六次抛掷,更符合我们对这个骰子的预期,因为它包括两个G。然而,这个序列是通过在第一个序列的开头加上一个G来构建的,所以它只能比第一个序列的可能性小。这相当于琳达是一个女权主义银行出纳员的非语言性。与琳达的研究一样,代表性占主导地位。几乎 ,三分之二的受访者倾向于在序列2上下注,而不是在 序列1上下注。然而,当向他们提供两种选择的论据时,绝大多数人认为正确的论据(倾向于序列1)更有说服力。
The next problem was a breakthrough, because we finally found a condition in which the incidence of the conjunction fallacy was much reduced. Two groups of subjects saw slightly different variants of the same problem:
下一个问题是一个突破,因为我们终于找到了一个条件,在这个条件下,连词谬误的发生率大大降低。两组被试看到 ,同一问题的变体略有不同。
A health survey was conducted in a sample of adult males in British Columbia, of all ages and occupations. Please give your best estimate of the following values:
在不列颠哥伦比亚省所有年龄和职业的成年男性中进行了一项健康调查。 请对以下数值给出你的最佳估计。
A health survey was conducted in a sample of 100 adult males in British Columbia, of all ages and occupations. Please give your best estimate of the following values:
在不列颠哥伦比亚省的100名成年男性中进行了一项健康调查,这些人的年龄和职业各不相同。 请给出你对以下数值的最佳估计。
What percentage of the men surveyed have had one or more heart attacks?
在接受调查的男性中,有多少百分比 ,有过一次或多次心脏病发作?
How many of the 100 participants have had one or more heart attacks?
在这100名参与者中,有多少人曾经有过一次或多次心脏病发作?
What percentage of the men surveyed are both over 55 years old and have had one or more heart attacks?
在接受调查的男性中,55岁以上并有过一次或多次心脏病发作的比例是多少?
How many of the 100 participants both are over 55 years old and have had one or more heart attacks?
在100名参与者中,有多少人既超过55岁,又有过一次或多次心脏病发作?
The incidence of errors was 65% in the group that saw the problem on the left, and only 25% in the group that saw the problem on the right.
在左边看到问题 ,错误的发生率为65%,而在右边看到问题的小组只有25%。
Why is the question “How many of the 100 participants …” so much easier than “What percentage …”? A likely explanation is that the reference to 100 individuals brings a spatial representation to mind. Imagine that a large number of people are instructed to sort themselves into groups in a room: “Those whose names begin with the letters A to L are told to gather in the front left corner.” They are then instructed to sort themselves further. The relation of inclusion is now obvious, and you can see that individuals whose name begins with C will be a subset of the crowd in the front left corner. In the medical survey question, heart attack victims end up in a corner of the room, and some of them are less than 55 years old. Not everyone will share this particular vivid imagery, but many subsequent experiments have shown that the frequency representation, as it is known, makes it easy to appreciate that one group is wholly included in the other. The solution to the puzzle appears to be that a question phrased as “how many?” makes you think of individuals, but the same question phrased as “what percentage?” does not.
为什么 "100个参与者中有多少人...... "这个问题比 "多少百分比...... "容易得多?一个可能的解释是,提到100个人会让人想到一个空间表征。 想象一下,大量的人被指示在一个房间里把自己分成几组:"那些名字以字母 A 到 L 开头的 ,被告知聚集在左前角。"然后,他们被指示对自己进行进一步分类。包容关系现在很明显,你可以看到名字以 C 开头的个人将是左前角人群的一个子集。在医学调查问题中,心脏病患者最终被安排在房间的一个角落,其中一些人的年龄不到55岁 。不是每个人都会有这种特别生动的想象,但后来的许多实验表明,众所周知的频率表征,使人很容易体会到一个群体完全包含在另一个群体中。谜底似乎是,一个措辞为 "多少人?"的问题让你想到个人,但同样的问题措辞为 "多少百分比?" ,则不会。
What have we learned from these studies about the workings of System 2? One conclusion, which is not new, is that System 2 is not impressively alert. The undergraduates and graduate students who participated in our studies of the conjunction fallacy certainly “knew” the logic of Venn diagrams, but they did not apply it reliably even when all the relevant information was laid out in front of them. The absurdity of the less-is-more pattern was obvious in Hsee’s dinnerware study and was easily recognized in the “how many?” representation, but it was not apparent to the thousands of people who have committed the conjunction fallacy in the original Linda problem and in others like it. In all these cases, the conjunction appeared plausible, and that sufficed for an endorsement of System 2.
我们从这些研究中了解到关于系统 2 的工作原理是什么?一个并不新鲜的结论是,系统2没有令人印象深刻的警惕性。参加我们对连带谬误研究的本科生和研究生当然 "知道 "维恩图的逻辑,但即使所有相关信息都摆在他们面前 ,他们也不能可靠地应用它。少即是多的模式的荒谬性在Hsee的餐具研究中很明显,在 "多少?"的表述中也很容易被识别,但对于成千上万在最初的Linda问题和其他类似问题中犯下连带谬误的人来说,这并不明显。在所有这些情况下,连词似乎都是合理的,这足以让人认可 System 2。
The laziness of System 2 is part of the story. If their next vacation had depended on it, and if they had been given indefinite time and told to follow logic and not to answer until they were sure of their answer, I believe that most of our subjects would have avoided the conjunction fallacy. However, their vacation did not depend on a correct answer; they spent very little time on it, and were content to answer as if they had only been “asked for their opinion.” The laziness of System 2 is an important fact of life, and the observation that representativeness can block the application of an obvious logical rule is also of some interest.
系统2的懒惰是故事的一部分。 如果他们的下一个假期取决于此,如果给他们不确定的时间,并告诉他们要遵循逻辑,在确定答案之前不要回答,我相信我们的大多数受试者会避免连词谬误。然而,他们的假期并不取决于一个正确的答案;他们很少花时间在 ,并且满足于回答,好像他们只是被 "询问意见"。系统2的懒惰是生活中的一个重要事实,而且观察到代表性可以阻止一个明显的逻辑规则的应用,这也有一定的意义。
The remarkable aspect of the Linda story is the contrast to the broken-dishes study. The two problems have the same structure, but yield different results. People who see the dinnerware set that includes broken dishes put a very low price on it; their behavior reflects a rule of intuition. Others who see both sets at once apply the logical rule that more dishes can only add value. Intuition governs judgments in the between-subjects condition; logic rules in joint evaluation. In the Linda problem, in contrast, intuition often overcame logic even in joint evaluation, although we identified some conditions in which logic prevails.
琳达故事的非凡之处在于与破碗研究的对比。这两个问题有相同的结构,但产生了 不同的结果。看到包括破碎碗碟的餐具套装的人把它的价格定得很低;他们的行为反映了直觉的规则。其他同时看到这两套餐具的人应用了逻辑规则,即更多的碗碟只能增加价值。 在主体间条件下,直觉支配着判断;在联合评价中,逻辑支配着判断。相反,在琳达问题中,直觉甚至在联合评价中经常战胜 逻辑,尽管我们发现在某些条件下逻辑占优势。
Amos and I believed that the blatant violations of the logic of probability that we had observed in transparent problems were interesting and worth reporting to our colleagues. We also believed that the results strengthened our argument about the power of judgment heuristics, and that they would persuade doubters. And in this we were quite wrong. Instead, the Linda problem became a case study in the norms of controversy.
阿莫斯和我认为,我们在透明问题中观察到的对概率逻辑的公然违反很有意思,值得向我们的同事报告。 我们还认为,这些结果加强了我们对判断启发式方法的力量的论证,而且它们会说服 怀疑者。在这一点上,我们错得很离谱。相反,琳达问题成为争议规范的案例研究。
The Linda problem attracted a great deal of attention, but it also became a magnet for critics of our approach to judgment. As we had already done, researchers found combinations of instructions and hints that reduced the incidence of the fallacy; some argued that, in the context of the Linda problem, it is reasonable for subjects to understand the word “probability” as if it means “plausibility.” These arguments were sometimes extended to suggest that our entire enterprise was misguided: if one salient cognitive illusion could be weakened or explained away, others could be as well. This reasoning neglects the unique feature of the conjunction fallacy as a case of conflict between intuition and logic. The evidence that we had built up for heuristics from between-subjects experiment (including studies of Linda) was not challenged—it was simply not addressed, and its salience was diminished by the exclusive focus on the conjunction fallacy. The net effect of the Linda problem was an increase in the visibility of our work to the general public, and a small dent in the credibility of our approach among scholars in the field. This was not at all what we had expected.
琳达问题吸引了大量的注意力,但它也成为批评我们判断方法的磁铁。 正如我们已经做的那样,研究人员找到了减少谬误发生率的指示和提示的组合;一些人认为,在琳达问题的上下文 ,受试者将 "概率 "一词理解为 ,好像它意味着 "可信性",这是合理的。这些论点有时被延伸到暗示我们的整个事业被误导了:如果一个突出的认知错觉可以被 削弱或解释 掉,那么其他的也可以。这种推理忽略了连带谬误作为直觉和逻辑之间冲突案例的独特特征 。我们从主体间实验(包括对琳达的研究)中建立起来的启发式方法的证据没有受到挑战--它只是没有得到解决,而且它的突出性也因为只关注连词谬误而被削弱了。琳达问题的净效果是增加了我们的工作在公众中的能见度,并在 我们的方法在该领域的学者中的可信度上有了小的突破。这与我们的预期完全不同。
If you visit a courtroom you will observe that lawyers apply two styles of criticism: to demolish a case they raise doubts about the strongest arguments that favor it; to discredit a witness, they focus on the weakest part of the testimony. The focus on weaknesses is also normal in political debates. I do not believe it is appropriate in scientific controversies, but I have come to accept as a fact of life that the norms of debate in the social sciences do not prohibit the political style of argument, especially when large issues are at stake—and the prevalence of bias in human judgment is a large issue.
如果你参观法庭,你会发现律师们采用两种批评方式:为了拆穿一个案件,他们对赞成该案件的最有力的论据提出怀疑;为了诋毁一个证人,他们专注于证词的最薄弱部分。 我认为这在科学争论中是不合适的,但我已经接受了这样一个事实,即社会科学的辩论规范并不禁止政治风格的争论,特别是当涉及到大问题时--人类判断中普遍存在的偏见就是一个大问题。
Some years ago I had a friendly conversation with Ralph Hertwig, a persistent critic of the Linda problem, with whom I had collaborated in a vain attempt to settle our differences . I asked him why he and others had chosen to focus exclusively on the conjunction fallacy, rather than on other findings that provided stronger support for our position. He smiled as he answered, “It was more interesting,” adding that the Linda problem had attracted so much attention that we had no reason to complain.
几年前,我与拉尔夫-赫特维格(Ralph Hertwig)进行了一次友好的谈话,他是琳达问题的一个持续的 批评者,我曾与他合作,试图 解决我们的分歧 ,但徒劳无功。我问他为什么他和其他人选择只关注连词谬误,而不关注其他为我们的立场提供更有力支持的发现。他笑着回答:"这更有趣。"他还说琳达问题已经吸引了这么多注意力,我们没有理由抱怨 。
SPEAKING OF LESS IS MORE
说到少即是多
“They constructed a very complicated scenario and insisted on calling it highly probable. It is not—it is only a plausible story.”
"他们构建了一个非常复杂的场景,并坚持称其为高度可能。 它不是--它只是一个合理的故事。"
“They added a cheap gift to the expensive product, and made the whole deal less attractive. Less is more in this case.”
"他们在昂贵的产品上增加了一个廉价的礼物,使整个交易的吸引力降低。 在这种情况下,少即是多。"
“In most situations, a direct comparison makes people more careful and more logical. But not always. Sometimes intuition beats logic even when the correct answer stares you in the face.”
"在大多数情况下,直接比较会使人们更加谨慎,更加符合逻辑。但不是 。有时,即使正确的答案摆在你面前,直觉也会战胜逻辑。"
16
16
Causes Trump Statistics
特朗普的原因统计
Consider the following scenario and note your intuitive answer to the question.
考虑以下情景,并注意你对该问题的直觉回答。
A cab was involved in a hit-and-run accident at night.
一辆出租车在夜间发生了一起肇事逃逸事故。
Two cab companies, the Green and the Blue, operate in the city. You are given the following data:
两家出租车公司,绿色和蓝色,在城市中运营。你 ,得到以下数据。
• 85% of the cabs in the city are Green and 15% are Blue.
- 城市中85%的出租车是绿色的,15%是蓝色的。
• A witness identified the cab as Blue. The court tested the reliability of the witness under the circumstances that existed on the night of the accident and concluded that the witness correctly identified each one of the two colors 80% of the time and failed 20% of the time.
- 一名目击者确认该出租车为蓝色。法院在事故发生当晚的情况下测试了证人的可靠性,并得出结论:证人在80%的时间里正确识别了两种颜色中的每一种,20%的时间里没有识别。
What is the probability that the cab involved in the accident was Blue rather than Green?
,发生事故的驾驶室是蓝色而不是绿色的概率是多少?
This is a standard problem of Bayesian inference. There are two items of information: a base rate and the imperfectly reliable testimony of a witness. In the absence of a witness, the probability of the guilty cab being Blue is 15%, which is the base rate of that outcome. If the two cab companies had been equally large, the base rate would be uninformative and you would consider only the reliability of the witness, concluding that the probability is 80%. The two sources of information can be combined by Bayes’s rule. The correct answer is 41% . However, you can probably guess what people do when faced with this problem: they ignore the base rate and go with the witness. The most common answer is 80%.
这是贝叶斯推理的一个标准问题。有两项信息:基准率和证人的不完全可靠的证词。 在没有证人的情况下,有罪的出租车是蓝色的概率是15%,这是该结果的基准率。如果两家出租车公司的规模相同,基准率 ,你会只考虑证人的可靠性,得出的结论是概率是80%。这两个信息来源可以通过贝叶斯法则结合起来。 正确答案是41% 。然而,你可能可以猜到人们在面对这个问题时是怎么做的:他们忽略了基准利率,而选择了证人。 最常见的答案是80%。
CAUSAL STEREOTYPES
因果定型观念
Now consider a variation of the same story, in which only the presentation of the base rate has been altered.
现在 ,考虑同一个故事的变体,其中只有基准利率的表述被改变了。
You are given the following data:
你会得到以下数据。
• The two companies operate the same number of cabs, but Green cabs are involved in 85% of accidents.
- 两家公司运营的出租车数量相同,但绿色出租车涉及85%的事故。
• The information about the witness is as in the previous version.
- 有关证人的信息与前一版本相同。
The two versions of the problem are mathematically indistinguishable, but they are psychologically quite different. People who read the first version do not know how to use the base rate and often ignore it. In contrast, people who see the second version give considerable weight to the base rate, and their average judgment is not too far from the Bayesian solution. Why?
这两个版本的问题在数学上是没有区别的,但它们在心理上 ,是完全不同的。读过第一版的人不知道如何使用基准费率,常常忽略它。相比之下,看到第二个版本的人对基本比率给予了相当大的重视,他们的平均判断 与贝叶斯 方案 相差不大 。 为什么?
In the first version, the base rate of Blue cabs is a statistical fact about the cabs in the city. A mind that is hungry for causal stories finds nothing to chew on: How does the number of Green and Blue cabs in the city cause this cab driver to hit and run?
在第一个版本中,蓝色出租车的基本费率是一个关于城市中出租车的统计事实。一个渴望因果故事的头脑 ,却发现没有什么可以咀嚼的:城市中绿色和蓝色出租车的数量是如何导致这个出租车司机肇事逃逸的?
In the second version, in contrast, the drivers of Green cabs cause more than 5 times as many accidents as the Blue cabs do. The conclusion is immediate: the Green drivers must be a collection of reckless madmen! You have now formed a stereotype of Green recklessness, which you apply to unknown individual drivers in the company. The stereotype is easily fitted into a causal story, because recklessness is a causally relevant fact about individual cabdrivers. In this version, there are two causal stories that need to be combined or reconciled. The first is the hit and run, which naturally evokes the idea that a reckless Green driver was responsible. The second is the witness’s testimony, which strongly suggests the cab was Blue. The inferences from the two stories about the color of the car are contradictory and approximately cancel each other. The chances for the two colors are about equal (the Bayesian estimate is 41%, reflecting the fact that the base rate of Green cabs is a little more extreme than the reliability of the witness who reported a Blue cab).
在第二个版本中,相比之下,绿色出租车的司机造成的事故是蓝色出租车的5倍以上。 结论是直接的:绿色司机一定是一群鲁莽的疯子!你现在已经形成了一个刻板印象 ,即绿色的鲁莽行为,你把它应用于公司中不知名的个别司机。这种刻板印象很容易被纳入一个因果关系的故事中,因为鲁莽是与个别出租车司机的因果关系有关的事实。在这个版本中,有两个因果关系的故事需要合并或调和。首先是肇事逃逸,这自然会让人联想到一个鲁莽的绿色司机 。 其次是证人的证词,这强烈表明出租车是蓝色的。从两个故事中关于汽车颜色的推论是矛盾的,大约相互抵消。两种颜色的几率基本相等(贝叶斯估计为41%,反映出绿色出租车的基本比率比报告蓝色出租车的证人 的可靠性更极端一些)。
The cab example illustrates two types of base rates. Statistical base rates are facts about a population to which a case belongs, but they are not relevant to the individual case. Causal base rates change your view of how the individual case came to be. The two types of base-rate information are treated differently:
cab的例子说明了两种类型的基准率。 统计基准率 是关于一个案例所属人群的事实,但它们与单个案例无关。 因果基准率 改变了你对单个案例如何形成的看法。 这两种类型的基准率信息的处理方式不同。
Statistical base rates are generally underweighted, and sometimes neglected altogether, when specific information about the case at hand is available.
在有关于手头案件的具体信息时,统计基数一般会被低估, ,有时甚至被完全忽略。
Causal base rates are treated as information about the individual case and are easily combined with other case-specific information.
因果基础率被视为关于个别案例的信息,并且很容易与其他具体案例的信息相结合。
The causal version of the cab problem had the form of a stereotype: Green drivers are dangerous. Stereotypes are statements about the group that are (at least tentatively) accepted as facts about every member. Here are two examples:
出租车问题的因果版本有一个刻板印象的形式:绿色司机是危险的。陈规定型是关于群体的陈述,(至少是暂定的) ,作为每个成员的事实接受。这里有两个例子。
Most of the graduates of this inner-city school go to college.
这所市内学校的大多数毕业生都上了大学。
Interest in cycling is widespread in France.
在法国,人们对自行车运动的兴趣非常广泛。
These statements are readily interpreted as setting up a propensity in individual members of the group, and they fit in a causal story. Many graduates of this particular inner-city school are eager and able to go to college, presumably because of some beneficial features of life in that school. There are forces in French culture and social life that cause many Frenchmen to take an interest in cycling. You will be reminded of these facts when you think about the likelihood that a particular graduate of the school will attend college, or when you wonder whether to bring up the Tour de France in a conversation with a Frenchman you just met.
这些陈述很容易被解释为在群体的个别成员中设置了一种倾向,而且它们符合一个因果故事。这个特定的城市内部学校的许多毕业生渴望并能够上大学, ,大概是因为在该学校生活的一些有益特征。法国文化和社会生活中的一些力量导致许多法国人对自行车运动产生兴趣。 当你想到某个学校的毕业生上大学的可能性时,或者当你不知道是否要在与刚认识的 法国人的谈话中提起环法自行车赛时,你就会想起这些事实。
Stereotyping is a bad word in our culture, but in my usage it is neutral. One of the basic characteristics of System 1 is that it represents categories as norms and prototypical exemplars. This is how we think of horses, refrigerators, and New York police officers; we hold in memory a representation of one or more “normal” members of each of these categories. When the categories are social, these representations are called stereotypes. Some stereotypes are perniciously wrong, and hostile stereotyping can have dreadful consequences, but the psychological facts cannot be avoided: stereotypes, both correct and false, are how we think of categories.
在我们的文化中, 陈规定型 是一个坏词,但在我的用法中,它是中性的。系统1的基本特征之一是,它将类别表现为规范和原型典范。这就是我们对马、冰箱和纽约警察的看法;我们在记忆中持有这些类别中的一个或多个 "正常 "成员的表象。当 类别是社会性的,这些表象被称为定型观念。有些定型观念是邪恶的错误,敌对的定型观念会产生可怕的 ,但心理学事实是无法回避的:定型观念,无论是正确的还是错误的,都是我们对类别的思考方式。
You may note the irony. In the context of the cab problem, the neglect of base-rate information is a cognitive flaw, a failure of Bayesian reasoning, and the reliance on causal base rates is desirable. Stereotyping the Green drivers improves the accuracy of judgment. In other contexts, however, such as hiring or profiling, there is a strong social norm against stereotyping, which is also embedded in the law. This is as it should be. In sensitive social contexts, we do not want to draw possibly erroneous conclusions about the individual from the statistics of the group. We consider it morally desirable for base rates to be treated as statistical facts about the group rather than as presumptive facts about individuals. In other words, we reject causal base rates.
你可能会注意到这种讽刺。在出租车问题的背景下,忽视基率信息是一种认知缺陷, ,是贝叶斯推理的失败,而对因果基率的依赖是可取的。对绿色司机的成见提高了判断的准确性。但在其他背景下,如招聘或定性,有一个强烈的社会规范反对成见,这也是法律所规定的。在敏感的社会背景下,我们不希望从群体的统计数字中得出关于个人的可能错误的结论 。我们认为,从道德上讲,应该把基准率作为关于群体的统计事实,而不是作为关于个人的推定事实。 换句话说,我们拒绝因果基准率。
The social norm against stereotyping, including the opposition to profiling, has been highly beneficial in creating a more civilized and more equal society. It is useful to remember, however, that neglecting valid stereotypes inevitably results in suboptimal judgments. Resistance to stereotyping is a laudable moral position, but the simplistic idea that the resistance is costless is wrong. The costs are worth paying to achieve a better society, but denying that the costs exist, while satisfying to the soul and politically correct, is not scientifically defensible. Reliance on the affect heuristic is common in politically charged arguments. The positions we favor have no cost and those we oppose have no benefits. We should be able to do better.
反对刻板印象的社会规范,包括反对以貌取人,对创造一个更文明、更平等的 社会大有裨益。然而,我们应该记住,忽视有效的刻板印象必然会导致次优的判断。 抵制刻板印象是一种值得称道的道德立场,但认为抵制是没有代价的简单想法是错误的。为了实现一个更好的社会,这些成本是值得付出的,但否认成本的存在,虽然满足了灵魂和政治上的正确性,但在科学上并不 。 依靠影响启发式在政治上的争论中很常见。 我们赞成的立场没有成本,我们反对的立场没有好处。 我们应该能够做得更好。
CAUSAL SITUATIONS
因果情况
Amos and I constructed the variants of the cab problem, but we did not invent the powerful notion of causal base rates; we borrowed it from the psychologist Icek Ajzen. In his experiment, Ajzen showed his participants brief vignettes describing some students who had taken an exam at Yale and asked the participants to judge the probability that each student had passed the test. The manipulation of causal base rates was straightforward: Ajzen told one group that the students they saw had been drawn from a class in which 75% passed the exam, and told another group that the same students had been in a class in which only 25% passed. This is a powerful manipulation, because the base rate of passing suggests the immediate inference that the test that only 25% passed must have been brutally difficult. The difficulty of a test is, of course, one of the causal factors that determine every student’s outcome. As expected, Ajzen’s subjects were highly sensitive to the causal base rates, and every student was judged more likely to pass in the high-success condition than in the high-failure rate.
阿莫斯和我构建了出租车问题的变体,但我们并没有发明因果基率这个强大的概念;我们从心理学家Icek Ajzen那里借用了它。 在他的实验中,阿詹向他的参与者展示了描述一些在耶鲁大学参加考试的学生的简短小故事,并要求参与者判断每个学生通过考试的概率。对因果基础率的操纵很简单:阿吉森告诉一个小组,他们看到的学生是从一个有75%通过考试的班级里抽出来的,然后告诉另一个小组 ,同样的学生在一个只有25%通过的班级里。这是一个强有力的操纵,因为通过的基础率表明,只有25%通过的考试一定是非常困难的,这是一个直接的推断。当然,考试的难度是决定每个 学生结果的因果因素之一。正如预期的那样,Ajzen的受试者对因果基础率高度敏感 ,每个学生在高成功率条件下都被判断为比高失败率条件下更容易通过。
Ajzen used an ingenious method to suggest a noncausal base rate. He told his subjects that the students they saw had been drawn from a sample, which itself was constructed by selecting students who had passed or failed the exam. For example, the information for the high-failure group read as follows:
阿詹用一种巧妙的方法提出了一个非因果的基率。他告诉他的受试者,他们看到的学生是从一个样本中抽取的,而这个样本本身就是通过挑选考试合格或不合格的学生来构建的。例如, 高失误组的信息如下。
The investigator was mainly interested in the causes of failure and constructed a sample in which 75% had failed the examination.
调查员主要对失败的原因感兴趣,并构建了一个75%没有通过考试的样本。
Note the difference. This base rate is a purely statistical fact about the ensemble from which cases have been drawn. It has no bearing on the question asked, which is whether the individual student passed or failed the test. As expected, the explicitly stated base rates had some effects on judgment, but they had much less impact than the statistically equivalent causal base rates. System 1 can deal with stories in which the elements are causally linked, but it is weak in statistical reasoning. For a Bayesian thinker, of course, the versions are equivalent. It is tempting to conclude that we have reached a satisfactory conclusion: causal base rates are used; merely statistical facts are (more or less) neglected. The next study, one of my all-time favorites, shows that the situation is rather more complex.
注意区别。这个基准率是一个纯粹的统计事实,即从案件中抽取的合集。这与所问的问题没有关系,即学生个人是通过了考试还是没有通过考试。 正如预期的那样,明确说明的基数对判断有一些影响,但它们的影响远远小于统计学上等同的因果基数。 系统1可以处理元素有因果关系的故事,但它在统计推理方面很弱。 当然,对于贝叶斯思想家来说,这两个版本是等同的。我们很容易得出这样的结论:我们已经得出了令人满意的 :使用了因果基率;仅仅是统计事实(或多或少)被忽略了。 下一项研究,是我一直以来最喜欢的一项研究,表明情况相当复杂。
CAN PSYCHOLOGY BE TAUGHT?
心理学可以被教授吗?
The reckless cabdrivers and the impossibly difficult exam illustrate two inferences that people can draw from causal base rates: a stereotypical trait that is attributed to an individual, and a significant feature of the situation that affects an individual’s outcome. The participants in the experiments made the correct inferences and their judgments improved. Unfortunately, things do not always work out so well. The classic experiment I describe next shows that people will not draw from base-rate information an inference that conflicts with other beliefs. It also supports the uncomfortable conclusion that teaching psychology is mostly a waste of time.
鲁莽的出租车司机和不可能完成的考试说明了人们可以从因果基率中得出两个推论:一个是归于个人的刻板特征, ,一个是影响个人结果的情况的重要特征。参加实验的人做出了正确的推论,他们的判断也得到了改善。不幸的是,事情并不总是那么顺利。我接下来描述的经典实验表明,人们不会从基础信息中得出与其他信念相冲突的推论。它还支持 ,即教授心理学大多是浪费时间这一令人不舒服的结论。
The experiment was conducted a long time ago by the social psychologist Richard Nisbett and his student Eugene Borgida, at the University of Michigan . They told students about the renowned “helping experiment” that had been conducted a few years earlier at New York University. Participants in that experiment were led to individual booths and invited to speak over the intercom about their personal lives and problems. They were to talk in turn for about two minutes. Only one microphone was active at any one time. There were six participants in each group, one of whom was a stooge. The stooge spoke first, following a script prepared by the experimenters. He described his problems adjusting to New York and admitted with obvious embarrassment that he was prone to seizures, especially when stressed. All the participants then had a turn. When the microphone was again turned over to the stooge, he became agitated and incoherent, said he felt a seizure coming on, and asked for someone to help him. The last words heard from him were, “C-could somebody-er-er-help-er-uh-uh-uh [choking sounds]. I … I’m gonna die-er-er-er I’m … gonna die-er-er-I seizure I-er [chokes, then quiet].” At this point the microphone of the next participant automatically became active, and nothing more was heard from the possibly dying individual.
很久以前,社会心理学家理查德-尼斯贝特和他的学生尤金-博吉达在 密歇根大学 进行了这个实验。 他们向学生们讲述了几年前在纽约大学进行的著名的 "帮助实验"。在该实验中,参与者 ,被领到单独的展位,并被邀请通过对讲机讲述他们的个人生活和问题。他们要轮流讲述大约两分钟。每组有六名参与者,其中一名是傀儡。傀儡首先发言,按照实验者准备的脚本。他描述了他在适应纽约方面的问题,并明显尴尬地承认 ,他很容易发作,尤其是在压力大的时候。 然后所有参与者都轮到了。当话筒再次被转给这个傀儡时,他变得焦躁不安,语无伦次,说他感觉要发作了,并要求有人帮助他。 从他那里听到的最后一句话是:"谁能帮帮我,我......我要死了,我......我......要死了,我癫痫发作了,我......"[哽咽,然后安静]。这时,下一个参与者的麦克风自动开始工作,而这个可能濒临死亡的人再也听不到任何声音了。
What do you think the participants in the experiment did? So far as the participants knew, one of them was having a seizure and had asked for help. However, there were several other people who could possibly respond, so perhaps one could stay safely in one’s booth. These were the results: only four of the fifteen participants responded immediately to the appeal for help. Six never got out of their booth, and five others came out only well after the “seizure victim” apparently choked. The experiment shows that individuals feel relieved of responsibility when they know that others have heard the same request for help.
你认为实验中的参与者做了什么?据与会者所知,他们中的一个人正在发作并请求帮助。然而,还有其他几个人 ,可能会有反应,所以也许可以安全地呆在自己的展位上。结果是:15名参与者中只有4人立即对求助呼吁作出反应。 6人从未走出他们的摊位,另外5人在 "癫痫患者 "明显窒息后才出来。 该实验表明,当个人知道其他人听到同样的求助时,他们会感到 责任的解除 。 。
Did the results surprise you? Very probably. Most of us think of ourselves as decent people who would rush to help in such a situation, and we expect other decent people to do the same. The point of the experiment, of course, was to show that this expectation is wrong. Even normal, decent people do not rush to help when they expect others to take on the unpleasantness of dealing with a seizure. And that means you, too.
结果是否令你吃惊?很有可能。我们大多数人都认为自己是正直的人,在这种情况下会冲上去帮忙,我们也希望其他正直的人也能这样做。当然,这个实验的重点是表明这种期望是错误的。 即使是正常的、正派的人,当他们期望别人承担处理癫痫发作的不愉快 ,也不会急于帮助。这意味着你也是如此。
Are you willing to endorse the following statement? “When I read the procedure of the helping experiment I thought I would come to the stranger’s help immediately, as I probably would if I found myself alone with a seizure victim. I was probably wrong. If I find myself in a situation in which other people have an opportunity to help, I might not step forward. The presence of others would reduce my sense of personal responsibility more than I initially thought.” This is what a teacher of psychology would hope you would learn. Would you have made the same inferences by yourself?
你是否愿意赞同以下声明?"当我读到帮助实验的程序时,我想我会立即去帮助这个陌生人,就像我发现自己独自面对一个癫痫患者时可能会做的那样。我可能错了。如果我发现自己处在一个其他人有机会提供帮助的情况下,我可能 ,而不会站出来。其他人的存在会比我最初想象的更能减少我的个人责任感。"这是心理学老师希望你能学到的。 你自己也会做出同样的推论吗?
The psychology professor who describes the helping experiment wants the students to view the low base rate as causal, just as in the case of the fictitious Yale exam. He wants them to infer, in both cases, that a surprisingly high rate of failure implies a very difficult test. The lesson students are meant to take away is that some potent feature of the situation, such as the diffusion of responsibility, induces normal and decent people such as them to behave in a surprisingly unhelpful way.
描述帮助实验的心理学教授希望 ,让学生将低基数率视为因果关系,就像在虚构的 耶鲁大学考试的情况下一样。他想让他们推断,在这两种情况下,出乎意料的高失败率意味着非常困难的测试。学生们要得到的教训是,情况的一些有力的特点,如责任的分散,诱使像他们这样的正常和体面的人以令人惊讶的无益的方式行事。
Changing one’s mind about human nature is hard work, and changing one’s mind for the worse about oneself is even harder. Nisbett and Borgida suspected that students would resist the work and the unpleasantness. Of course, the students would be able and willing to recite the details of the helping experiment on a test, and would even repeat the “official” interpretation in terms of diffusion of responsibility. But did their beliefs about human nature really change? To find out, Nisbett and Borgida showed them videos of brief interviews allegedly conducted with two people who had participated in the New York study. The interviews were short and bland. The interviewees appeared to be nice, normal, decent people. They described their hobbies, their spare-time activities, and their plans for the future, which were entirely conventional. After watching the video of an interview, the students guessed how quickly that particular person had come to the aid of the stricken stranger.
改变一个人对人性的看法是艰难的工作,而 ,改变一个人对自己的看法是更难的。当然,学生们能够并愿意在考试中背诵帮助实验的细节,甚至会重复 "官方 "在责任分散方面的解释。但他们对人性的信念 ,真的改变了吗?为了找出答案,尼斯贝特和博吉达给他们看了据称是对两个参加过纽约研究的人进行的简短采访的视频。采访很短,很平淡。被采访者看起来都是很好的、正常的、正派的人。 他们描述了自己的爱好、业余活动以及对未来的计划,这些都是完全常规的。在观看了 采访视频后,学生们猜测那个特定的人是如何迅速地救助了那个遇难的陌生人。
To apply Bayesian reasoning to the task the students were assigned, you should first ask yourself what you would have guessed about the two individuals if you had not seen their interviews. This question is answered by consulting the base rate. We have been told that only 4 of the 15 participants in the experiment rushed to help after the first request. The probability that an unidentified participant had been immediately helpful is therefore 27%. Thus your prior belief about any unspecified participant should be that he did not rush to help. Next, Bayesian logic requires you to adjust your judgment in light of any relevant information about the individual. However, the videos were carefully designed to be uninformative; they provided no reason to suspect that the individuals would be either more or less helpful than a randomly chosen student. In the absence of useful new information, the Bayesian solution is to stay with the base rates.
为了将贝叶斯推理应用于学生们被分配的任务,你应该首先问自己,如果你没有看到这两个人的访谈,你会对他们有什么猜测。这个问题可以通过咨询基本费率来回答。我们被告知, ,在实验的15名参与者中,只有4人在第一次请求后赶来帮忙。因此,你对任何未指明的参与者的先验信念应该是他没有急于帮忙。 接下来,贝叶斯逻辑要求你根据个人的任何相关信息调整你的判断。 然而,这些视频被精心设计成无信息的;它们没有提供任何理由让人怀疑这些人比随机选择的学生更有帮助或没有帮助。在没有有用的新信息的情况下,贝叶斯的解决方案是保持基准利率。
Nisbett and Borgida asked two groups of students to watch the videos and predict the behavior of the two individuals. The students in the first group were told only about the procedure of the helping experiment, not about its results. Their predictions reflected their views of human nature and their understanding of the situation. As you might expect, they predicted that both individuals would immediately rush to the victim’s aid. The second group of students knew both the procedure of the experiment and its results. The comparison of the predictions of the two groups provides an answer to a significant question: Did students learn from the results of the helping experiment anything that significantly changed their way of thinking? The answer is straightforward: they learned nothing at all. Their predictions about the two individuals were indistinguishable from the predictions made by students who had not been exposed to the statistical results of the experiment. They knew the base rate in the group from which the individuals had been drawn, but they remained convinced that the people they saw on the video had been quick to help the stricken stranger.
Nisbett和Borgida要求两组学生观看录像并预测两个人的行为。 ,第一组的学生只被告知帮助实验的程序,而不知道其结果。他们的预测反映了他们对人性的看法 ,以及他们对情况的理解。如你所料,他们预测这两个人都会立即冲向受害者。 第二组学生既知道实验的过程,也知道其 。两组预测的比较为一个重要问题提供了答案:学生是否从帮助实验的结果中学到了什么,从而大大改变了他们的思考方式? 答案很简单:他们根本没有学到任何东西。他们对这两个人的预测与那些没有接触过实验的统计结果的学生的预测没有区别。他们知道这些人所在的小组的基本比率,但他们仍然相信他们在视频中看到的人迅速帮助了那个陌生人。
For teachers of psychology, the implications of this study are disheartening. When we teach our students about the behavior of people in the helping experiment, we expect them to learn something they had not known before; we wish to change how they think about people’s behavior in a particular situation. This goal was not accomplished in the Nisbett-Borgida study, and there is no reason to believe that the results would have been different if they had chosen another surprising psychological experiment. Indeed, Nisbett and Borgida reported similar findings in teaching another study, in which mild social pressure caused people to accept much more painful electric shocks than most of us (and them) would have expected. Students who do not develop a new appreciation for the power of social setting have learned nothing of value from the experiment. The predictions they make about random strangers, or about their own behavior, indicate that they have not changed their view of how they would have behaved. In the words of Nisbett and Borgida, students “quietly exempt themselves” (and their friends and acquaintances) from the conclusions of experiments that surprise them. Teachers of psychology should not despair, however, because Nisbett and Borgida report a way to make their students appreciate the point of the helping experiment. They took a new group of students and taught them the procedure of the experiment but did not tell them the group results. They showed the two videos and simply told their students that the two individuals they had just seen had not helped the stranger, then asked them to guess the global results. The outcome was dramatic: the students’ guesses were extremely accurate.
对于心理学教师来说,这项研究的意义是令人沮丧的。当我们在 帮助实验中向学生讲授人们的行为时,我们希望他们能学到一些他们以前不知道的东西;我们希望改变他们对人们在特定情况下的行为的看法。这一目标在尼斯贝特-博吉达的研究中没有实现,而且没有理由相信,如果他们选择另一个令人惊讶的心理学实验,结果会有所不同。事实上,尼斯贝特和博吉达在教授另一项研究时报告了 ,在这项研究中,温和的社会压力使人们接受了比我们大多数人(和他们)预期的要痛苦得多的电击。 那些没有对社会环境的力量产生新的认识的学生,没有从实验中学到任何有价值的东西。他们对随机陌生人或对自己行为的预测表明 ,他们没有改变他们对自己会如何表现的看法。用尼斯贝特和博吉达的话说,学生们 "悄悄地把自己"(以及他们的朋友和熟人)从令他们吃惊的实验结论中豁免出来。 然而,心理学教师不应绝望,因为尼斯贝特和博吉达报告了一种使学生欣赏帮助实验的方法。 他们带了一组新的学生,教他们实验的程序,但没有告诉他们小组的结果。 他们播放了两段视频,并简单地告诉学生,他们刚刚看到的两个人没有帮助陌生人,然后让他们猜测全球的结果。 结果是戏剧性的:学生的猜测非常准确。
To teach students any psychology they did not know before, you must surprise them. But which surprise will do? Nisbett and Borgida found that when they presented their students with a surprising statistical fact, the students managed to learn nothing at all. But when the students were surprised by individual cases—two nice people who had not helped—they immediately made the generalization and inferred that helping is more difficult than they had thought. Nisbett and Borgida summarize the results in a memorable sentence:
要教给学生任何他们以前不知道的心理学 ,你必须给他们一个惊喜。 但哪一个惊喜能做到?Nisbett和Borgida发现,当他们向学生们展示一个令人惊讶的统计事实时, ,学生们根本学不到任何东西。但当学生们被个别案例--两个没有帮助过的好人--所惊讶时,他们立即进行了归纳,并推断出帮助比他们想象的更难 。Nisbett和Borgida用一个令人难忘的句子概括了这些结果。
Subjects’ unwillingness to deduce the particular from the general was matched only by their willingness to infer the general from the particular.
受试者不愿意从一般事物中推断出特殊事物,只有他们愿意从特殊事物中推断出一般事物。
This is a profoundly important conclusion. People who are taught surprising statistical facts about human behavior may be impressed to the point of telling their friends about what they have heard, but this does not mean that their understanding of the world has really changed. The test of learning psychology is whether your understanding of situations you encounter has changed, not whether you have learned a new fact. There is a deep gap between our thinking about statistics and our thinking about individual cases. Statistical results with a causal interpretation have a stronger effect on our thinking than noncausal information. But even compelling causal statistics will not change long-held beliefs or beliefs rooted in personal experience. On the other hand, surprising individual cases have a powerful impact and are a more effective tool for teaching psychology because the incongruity must be resolved and embedded in a causal story. That is why this book contains questions that are addressed personally to the reader. You are more likely to learn something by finding surprises in your own behavior than by hearing surprising facts about people in general.
这是一个非常重要的结论。 被告知有关人类行为的令人惊讶的统计事实的人可能会被打动,以至于把他们所听到的告诉 ,但这并不意味着他们对世界的理解真的改变了。学习心理学的测试是你对你所遇到的情况的理解是否改变了,而不是你是否学到了一个新的事实。在我们对统计数据的思考和对个案的思考之间存在着很深的差距。具有因果 解释的统计结果比非因果信息对我们的思考有更大的影响。但即使是令人信服的因果统计数据也不会改变长期持有的信念或根植于个人经验的信念。另一方面,令人惊讶的个案具有强大的影响力,是一种更有效的心理学教学工具,因为必须解决不协调的问题,并将其嵌入一个因果关系的故事中。这就是 ,为什么本书包含针对读者个人的问题。与听到关于一般人的惊人事实相比,你更有可能通过在自己的行为中发现惊喜而学到东西。
SPEAKING OF CAUSES AND STATISTICS
说到原因和统计数据
“We can’t assume that they will really learn anything from mere statistics. Let’s show them one or two representative individual cases to influence their System 1.”
"我们不能假设他们真的会从单纯的统计数字中学到什么。 让我们给他们看一两个有代表性的个别案例,以影响 他们的系统1。"
“No need to worry about this statistical information being ignored. On the contrary, it will immediately be used to feed a stereotype.”
"不必担心这些统计信息会被忽视。相反,它将立即被用来喂养一种定型观念"。
17
17
Regression to the Mean
向平均值回归
I had one of the most satisfying eureka experiences of my career while teaching flight instructors in the Israeli Air Force about the psychology of effective training. I was telling them about an important principle of skill training: rewards for improved performance work better than punishment of mistakes. This proposition is supported by much evidence from research on pigeons, rats, humans, and other animals.
我在给以色列空军的飞行教官讲授有效训练的心理学时,有了我职业生涯中最满意的一次 "乌龙 "经验。我当时告诉他们技能培训的一个重要的 原则:对改善表现的奖励比对错误的惩罚效果更好。对鸽子、大鼠、人类和其他动物的研究中的许多证据都支持这一主张。
When I finished my enthusiastic speech, one of the most seasoned instructors in the group raised his hand and made a short speech of his own. He began by conceding that rewarding improved performance might be good for the birds, but he denied that it was optimal for flight cadets. This is what he said: “On many occasions I have praised flight cadets for clean execution of some aerobatic maneuver. The next time they try the same maneuver they usually do worse. On the other hand, I have often screamed into a cadet’s earphone for bad execution, and in general he does better on his next try. So please don’t tell us that reward works and punishment does not, because the opposite is the case.”
当我完成了热情洋溢的演讲后,小组中最有经验的一位教员举起了手,做了一个他自己的简短演讲。他首先承认,奖励提高 ,可能对鸟类有好处,但他否认这对飞行学员来说是最好的。他是这样说的:"在许多场合,我曾赞扬过飞行学员干净利落地完成了一些特技动作。下次他们再尝试同样的动作时,通常会做得更糟糕。另一方面,我经常对着学员的耳机大喊执行不力,一般来说,他在下一次 。因此,请不要告诉我们奖励有效而惩罚无效,因为情况恰恰相反。"
This was a joyous moment of insight, when I saw in a new light a principle of statistics that I had been teaching for years. The instructor was right—but he was also completely wrong! His observation was astute and correct: occasions on which he praised a performance were likely to be followed by a disappointing performance, and punishments were typically followed by an improvement. But the inference he had drawn about the efficacy of reward and punishment was completely off the mark. What he had observed is known as regression to the mean , which in that case was due to random fluctuations in the quality of performance. Naturally, he praised only a cadet whose performance was far better than average. But the cadet was probably just lucky on that particular attempt and therefore likely to deteriorate regardless of whether or not he was praised. Similarly, the instructor would shout into a cadet’s earphones only when the cadet’s performance was unusually bad and therefore likely to improve regardless of what the instructor did. The instructor had attached a causal interpretation to the inevitable fluctuations of a random process.
这是一个令人高兴的洞察力时刻,当时我从一个新的角度看到了我多年来一直在教授的统计学原理。 教员是对的,但他也是完全错误的。他的观察很敏锐,也很正确:在他表扬了一个人的表现之后,很可能会有一个令人失望的表现 ,而惩罚之后通常会有一个进步。但他对奖励和惩罚的效果所做的推断却完全不符合要求。他所观察到的 ,被称为 向平均值回归 ,在这种情况下,这是由于表现质量的随机波动造成的。自然,他只表扬了一个表现远好于平均水平的学员 。同样,只有当学员的表现异常糟糕时,教官才会对着学员的耳机大喊大叫,因此无论教官怎么做都有可能改善。教员将因果关系的解释 到随机过程的不可避免的波动。
The challenge called for a response, but a lesson in the algebra of prediction would not be enthusiastically received. Instead, I used chalk to mark a target on the floor. I asked every officer in the room to turn his back to the target and throw two coins at it in immediate succession, without looking. We measured the distances from the target and wrote the two results of each contestant on the blackboard. Then we rewrote the results in order, from the best to the worst performance on the first try. It was apparent that most (but not all) of those who had done best the first time deteriorated on their second try, and those who had done poorly on the first attempt generally improved. I pointed out to the instructors that what they saw on the board coincided with what we had heard about the performance of aerobatic maneuvers on successive attempts: poor performance was typically followed by improvement and good performance by deterioration, without any help from either praise or punishment.
这个挑战需要回应,但预测的代数课不会被热烈欢迎。相反,我用粉笔在地板上标记了一个目标。我要求房间里的每一位官员背对着目标,立即连续向其投掷两枚硬币,不要看。我们测量了与目标的距离 ,并在黑板上写下了每位参赛者的两次成绩。然后我们按顺序重写了成绩,从第一次表现最好的到最差的。很明显,大多数(但不是全部)第一次表现最好的人在第二次尝试时情况恶化,而那些第一次表现不好的人普遍有所改进。我向教官们指出,他们在黑板上看到的 ,与我们听到的关于连续尝试特技动作的表现不谋而合:表现差的人通常会有进步,表现好的人则会变差,没有任何表扬或惩罚的帮助。
The discovery I made on that day was that the flight instructors were trapped in an unfortunate contingency: because they punished cadets when performance was poor, they were mostly rewarded by a subsequent improvement, even if punishment was actually ineffective. Furthermore, the instructors were not alone in that predicament. I had stumbled onto a significant fact of the human condition: the feedback to which life exposes us is perverse. Because we tend to be nice to other people when they please us and nasty when they do not, we are statistically punished for being nice and rewarded for being nasty.
那天我的发现是,飞行教官陷入了一个不幸的突发事件中:因为他们在学员表现不佳时对他们进行了惩罚, ,他们大多得到了随后改进的奖励,即使惩罚实际上是无效的。此外,教官并不是唯一处于这种困境的人。我偶然发现了人类状况的一个重要事实:生活让我们接触到的反馈是反常的。因为当别人取悦我们时,我们倾向于对他们好,而当他们不取悦我们时,我们就会变得很讨厌, ,从统计学上讲,我们会因为好而受到惩罚,因为讨厌而得到奖励。
TALENT AND LUCK
人才和运气
A few years ago, John Brockman, who edits the online magazine Edge , asked a number of scientists to report their “favorite equation.” These were my offerings:
几年前,编辑在线杂志 《边缘》 的约翰-布罗克曼要求一些科学家报告他们 "最喜欢的方程式"。这些是我的供品。
success = talent + luck
成功 = 天赋+运气
great success = a little more talent + a lot of luck
大成功=更多的天赋+更多的运气
The unsurprising idea that luck often contributes to success has surprising consequences when we apply it to the first two days of a high-level golf tournament. To keep things simple, assume that on both days the average score of the competitors was at par 72. We focus on a player who did very well on the first day, closing with a score of 66. What can we learn from that excellent score? An immediate inference is that the golfer is more talented than the average participant in the tournament. The formula for success suggests that another inference is equally justified: the golfer who did so well on day 1 probably enjoyed better-than-average luck on that day. If you accept that talent and luck both contribute to success, the conclusion that the successful golfer was lucky is as warranted as the conclusion that he is talented.
运气往往有助于成功,这个不足为奇的想法,当我们把它应用于一场高水平高尔夫比赛的前两天时,会产生令人惊讶的后果。 为了简单起见,假设在这两天里,参赛者的平均成绩为72杆。 我们把注意力放在一个在第一天表现非常好的选手身上,以66杆收官。 我们可以从这个优秀的成绩中了解到什么?一个直接的推论是,这个高尔夫球员比参加比赛的平均 。成功的公式表明,另一个推论同样是合理的:在第一天表现如此出色的高尔夫球员可能在那一天享受了比平均水平更好的运气。如果你接受天赋和运气都有助于成功,那么这个成功的高尔夫球员是幸运的结论和他是有天赋的结论是一样的。
By the same token, if you focus on a player who scored 5 over par on that day, you have reason to infer both that he is rather weak and had a bad day. Of course, you know that neither of these inferences is certain. It is entirely possible that the player who scored 77 is actually very talented but had an exceptionally dreadful day. Uncertain though they are, the following inferences from the score on day 1 are plausible and will be correct more often than they are wrong.
同样的道理,如果你 ,关注一个当天得分超过标准杆5分的球员,你有理由推断他相当弱 ,而且 今天过得不好。当然,你知道这些推断都不确定。完全有可能,得77分的选手实际上非常有天赋,但今天的表现特别糟糕。 虽然不确定,但从第一天的分数中得出的以下推论是可信的, ,而且正确的次数会多于错误的次数。
above-average score on day 1
第一天的得分高于平均水平
=
=
above-average talent + lucky on day 1
高于平均水平的人才+ 第1天就很幸运
and
和
below-average score on day 1
第1天的得分低于平均水平
=
=
below-average talent + unlucky on day 1
低于平均水平的人才+ 第一天就倒霉了
Now, suppose you know a golfer’s score on day 1 and are asked to predict his score on day 2. You expect the golfer to retain the same level of talent on the second day, so your best guesses will be “above average” for the first player and “below average” for the second player. Luck, of course, is a different matter. Since you have no way of predicting the golfers’ luck on the second (or any) day, your best guess must be that it will be average, neither good nor bad. This means that in the absence of any other information, your best guess about the players’ score on day 2 should not be a repeat of their performance on day 1. This is the most you can say:
现在,假设你知道一个高尔夫球员在第一天的得分,并被要求预测他在第二天的得分。你期望这个高尔夫球员在第二天保持相同的天赋水平,所以你的最佳猜测 ,第一个球员的得分将是 "高于平均水平",第二个球员的得分则是 "低于平均水平"。当然,运气是一个不同的问题。由于你无法预测球手在第二天(或任何一天)的运气,你最好的猜测一定是运气一般,不好也不坏。这意味着,在没有任何其他信息的情况下, ,你对球员在第2天的得分的最佳猜测不应该是 ,重复他们在第1天的表现。这是你能说的最多的。
The golfer who did well on day 1 is likely to be successful on day 2 as well, but less than on the first, because the unusual luck he probably enjoyed on day 1 is unlikely to hold.
第1天表现良好的高尔夫球手在第2天也有可能取得成功,但比第1天要少,因为他在第1天可能享有的不寻常的运气不太可能保持。
The golfer who did poorly on day 1 will probably be below average on day 2, but will improve, because his probable streak of bad luck is not likely to continue.
第1天表现不佳的球手在第2天可能会低于平均水平,但会有所改善,因为他可能的霉运连连, ,不可能继续下去。
We also expect the difference between the two golfers to shrink on the second day, although our best guess is that the first player will still do better than the second.
我们还预计,两位球手之间的差距在第二天会缩小,尽管我们最好的猜测是第一位球手仍然会比第二位做得更好。
My students were always surprised to hear that the best predicted performance on day 2 is more moderate, closer to the average than the evidence on which it is based (the score on day 1). This is why the pattern is called regression to the mean. The more extreme the original score, the more regression we expect, because an extremely good score suggests a very lucky day. The regressive prediction is reasonable, but its accuracy is not guaranteed. A few of the golfers who scored 66 on day 1 will do even better on the second day, if their luck improves. Most will do worse, because their luck will no longer be above average.
我的学生总是惊讶地听到,第2天的最佳预测成绩比它所依据的证据(第1天的分数)更适中,更接近平均水平。这就是为什么 模式被称为向平均值回归。原始分数越极端,我们期望的回归越多,因为一个极好的分数暗示着一个非常幸运的日子。 回归预测是合理的,但它的准确性不能保证。有几个在第一天打出66分的球手在第二天会有更好的表现,如果他们的运气好的话。大多数人的情况会更糟,因为他们的运气不再 ,而是高于平均水平。
Now let us go against the time arrow. Arrange the players by their performance on day 2 and look at their performance on day 1. You will find precisely the same pattern of regression to the mean. The golfers who did best on day 2 were probably lucky on that day, and the best guess is that they had been less lucky and had done less well on day 1. The fact that you observe regression when you predict an early event from a later event should help convince you that regression does not have a causal explanation.
现在让我们逆着时间的箭头前进。按照球员在第2天的表现排列,看看他们在第1天的表现,你会发现完全相同的回归平均值的模式。第2天表现最好的高尔夫球员可能是那天运气好,最好的猜测是他们运气较差,在第1天表现较差。当你从后来的事件中预测早期事件时,你观察到 回归的事实应该有助于说服你,回归并没有因果关系的解释。
Regression effects are ubiquitous, and so are misguided causal stories to explain them. A well-known example is the “ Sports Illustrated jinx,” the claim that an athlete whose picture appears on the cover of the magazine is doomed to perform poorly the following season. Overconfidence and the pressure of meeting high expectations are often offered as explanations. But there is a simpler account of the jinx: an athlete who gets to be on the cover of Sports Illustrated must have performed exceptionally well in the preceding season, probably with the assistance of a nudge from luck—and luck is fickle.
回归效应无处不在,用错误的因果故事来解释它们也是如此。 一个著名的例子是 "体育画报的 厄运",即声称照片出现在杂志封面上的运动员注定会在接下来的赛季表现不佳 。过度自信和满足高期望值的压力常常被作为解释。 但对厄运还有一个更简单的解释:能登上 《体育画报》 封面的运动员一定是在前一个赛季表现特别好,可能是在运气的帮助下,而运气是无常的。
I happened to watch the men’s ski jump event in the Winter Olympics while Amos and I were writing an article about intuitive prediction. Each athlete has two jumps in the event, and the results are combined for the final score. I was startled to hear the sportscaster’s comments while athletes were preparing for their second jump: “Norway had a great first jump; he will be tense, hoping to protect his lead and will probably do worse” or “Sweden had a bad first jump and now he knows he has nothing to lose and will be relaxed, which should help him do better.” The commentator had obviously detected regression to the mean and had invented a causal story for which there was no evidence. The story itself could even be true. Perhaps if we measured the athletes’ pulse before each jump we might find that they are indeed more relaxed after a bad first jump. And perhaps not. The point to remember is that the change from the first to the second jump does not need a causal explanation. It is a mathematically inevitable consequence of the fact that luck played a role in the outcome of the first jump. Not a very satisfactory story—we would all prefer a causal account—but that is all there is.
在阿莫斯和我写一篇关于直觉预测的文章时,我碰巧观看了 冬奥会的男子跳台滑雪项目 。每位运动员在该项目上有两跳,结果合并为最终得分。在运动员准备第二跳时,我听到体育主持人的评论:"挪威的第一跳很好;他将会很紧张,希望保护自己的领先优势,可能会做得更差 "或 "瑞典 ,第一跳很差,现在他知道自己没有什么损失,会很放松,这应该有助于他做得更好"。该评论员显然发现了向平均值的倒退,并编造了一个没有证据的因果故事。这个故事本身甚至可能是真的。也许,如果我们在每次跳伞前测量运动员的脉搏,我们可能会发现,他们在第一跳糟糕的 ,确实比较放松。也许也不是。要记住的一点是,从第一跳到第二跳的变化不需要因果解释。 这是一个数学上不可避免的结果,因为运气在第一跳的结果中起了作用。这不是一个非常令人满意的故事--我们都希望有一个因果关系的说明--但这就是全部。
UNDERSTANDING REGRESSION
了解回归
Whether undetected or wrongly explained, the phenomenon of regression is strange to the human mind. So strange, indeed, that it was first identified and understood two hundred years after the theory of gravitation and differential calculus. Furthermore, it took one of the best minds of nineteenth-century Britain to make sense of it, and that with great difficulty.
无论是未被发现还是 错误的解释,回归现象对人类的思维来说是很奇怪的。确实如此奇怪,以至于它在万有引力和微分理论之后两百年才被首次发现和理解。此外,十九世纪英国最优秀的思想家之一才把它弄明白,而且是非常困难地弄明白。
Regression to the mean was discovered and named late in the nineteenth century by Sir Francis Galton, a half cousin of Charles Darwin and a renowned polymath. You can sense the thrill of discovery in an article he published in 1886 under the title “Regression towards Mediocrity in Hereditary Stature,” which reports measurements of size in successive generations of seeds and in comparisons of the height of children to the height of their parents. He writes about his studies of seeds:
向平均数回归是由弗朗西斯-高尔顿爵士在十九世纪发现并命名的,他是查尔斯-达尔文的半个表弟,也是一位著名的多面手。你可以从他在1886年发表的一篇文章中感受到发现的快感,文章的标题是《遗传性身材向中庸的倒退》,其中报告了对连续几代种子的尺寸的测量,以及对儿童身高与父母身高的比较。 他写到了他对种子的研究。
They yielded results that seemed very noteworthy, and I used them as the basis of a lecture before the Royal Institution on February 9th, 1877. It appeared from these experiments that the offspring did not tend to resemble their parent seeds in size, but to be always more mediocre than they—to be smaller than the parents, if the parents were large; to be larger than the parents, if the parents were very small … The experiments showed further that the mean filial regression towards mediocrity was directly proportional to the parental deviation from it.
它们产生的结果似乎非常值得注意,我把它们作为1877年2月9日在皇家学会演讲的基础。从这些实验中可以看出,后代的大小 并不 倾向于与父母的种子相似,而总是比他们更平庸--如果父母很大,就比父母小;如果父母很小,就比父母大 ......实验进一步表明,孝子向平庸的平均退步与父母的偏离程度成正比。
Galton obviously expected his learned audience at the Royal Institution—the oldest independent research society in the world—to be as surprised by his “noteworthy observation” as he had been. What is truly noteworthy is that he was surprised by a statistical regularity that is as common as the air we breathe. Regression effects can be found wherever we look, but we do not recognize them for what they are. They hide in plain sight. It took Galton several years to work his way from his discovery of filial regression in size to the broader notion that regression inevitably occurs when the correlation between two measures is less than perfect, and he needed the help of the most brilliant statisticians of his time to reach that conclusion.
高尔顿显然期待他在皇家学会--世界上最古老的独立研究学会--的博学听众像他一样对他的 "值得注意的观察 "感到惊讶。 ,真正值得注意的是,他对一个像我们呼吸的空气一样普遍的统计规律性感到惊讶。我们在任何地方都能发现回归效应,但我们没有认识到它们是什么。 它们隐藏在众目睽睽之下。高尔顿花了几年时间,才从他发现的尺寸上的孝道回归,到更广泛的概念,即当两个测量指标之间的 相关性不那么完美时,回归不可避免地发生,他需要当时 最杰出的统计学家的帮助 才能得出这个结论。
One of the hurdles Galton had to overcome was the problem of measuring regression between variables that are measured on different scales, such as weight and piano playing. This is done by using the population as a standard of reference. Imagine that weight and piano playing have been measured for 100 children in all grades of an elementary school, and that they have been ranked from high to low on each measure. If Jane ranks third in piano playing and twenty-seventh in weight, it is appropriate to say that she is a better pianist than she is tall. Let us make some assumptions that will simplify things:
高尔顿必须克服的一个障碍是测量在不同尺度上测量的变量之间的回归问题,如体重和钢琴演奏。这是通过使用人口作为参考标准来实现的。想象一下 ,对一所小学各年级的100名儿童的体重和钢琴演奏进行了测量,并在每项测量中从高到低进行了排名。如果简的钢琴演奏排名第三,体重排名第二十七,就可以说她的钢琴演奏比她的身高好。让我们做一些假设,以简化事情。
At any age,
在任何年龄段。
Piano-playing success depends only on weekly hours of practice.
钢琴演奏的成功 ,只取决于每周的练习时间。
Weight depends only on consumption of ice cream.
体重只取决于冰淇淋的消费。
Ice cream consumption and weekly hours of practice are unrelated.
冰淇淋消费和每周的练习时间是不相关的。
Now, using ranks (or the standard scores that statisticians prefer), we can write some equations:
现在,使用等级(或统计学家喜欢的 标准分数 ),我们可以写一些方程式。
weight
重量
=
=
age + ice cream consumption
年龄+冰激凌消费
piano playing
钢琴演奏
=
=
age + weekly hours of practice
年龄+每周的练习时间
You can see that there will be regression to the mean when we predict piano playing from weight, or vice versa. If all you know about Tom is that he ranks twelfth in weight (well above average), you can infer (statistically) that he is probably older than average and also that he probably consumes more ice cream than other children. If all you know about Barbara is that she is eighty-fifth in piano (far below the average of the group), you can infer that she is likely to be young and that she is likely to practice less than most other children.
你可以看到,当我们从体重预测钢琴演奏时,会有回归到 ,反之亦然。如果你对汤姆的了解只是他的体重排在第十二位(远高于平均水平),那么你可以推断(从统计学角度),他可能比平均年龄大,而且他可能比其他孩子吃更多的冰淇淋。如果你对芭芭拉的了解只是她的钢琴成绩是第八十五名(远远低于小组的平均水平),你可以推断 ,她很可能很年轻,而且她很可能比其他大多数孩子练得少。
The correlation coefficient between two measures, which varies between 0 and 1, is a measure of the relative weight of the factors they share. For example, we all share half our genes with each of our parents, and for traits in which environmental factors have relatively little influence, such as height, the correlation between parent and child is not far from .50. To appreciate the meaning of the correlation measure, the following are some examples of coefficients:
两个衡量标准之间的 相关系数 ,在0和1之间变化,是对它们共享的因素的相对权重的衡量。例如,我们都与父母各共享一半的基因,对于环境因素影响相对较小的性状,如 身高, 父母和孩子之间的相关系数 离0.50不远。为了理解相关度量的含义,下面是一些系数的例子。
The correlation between the size of objects measured with precision in English or in metric units is 1. Any factor that influences one measure also influences the other; 100% of determinants are shared.
用英语或公制单位精确测量的物体大小之间的相关性为1。任何影响一种测量方法的因素也影响另一种测量方法;100%的决定因素是共享的。
The correlation between self-reported height and weight among adult American males is .41. If you included women and children, the correlation would be much higher, because individuals’ gender and age influence both their height and their weight, boosting the relative weight of shared factors.
,美国成年男性自我报告的 身高和体重 之间的相关性为0.41。如果你把妇女和儿童包括在内,相关性会高得多,因为个人的性别和年龄同时影响他们的身高和体重,提高了共同因素的相对重要性。
The correlation between SAT scores and college GPA is approximately .60. However, the correlation between aptitude tests and success in graduate school is much lower, largely because measured aptitude varies little in this selected group. If everyone has similar aptitude, differences in this measure are unlikely to play a large role in measures of success.
SAT分数和大学GPA之间的相关性约为0.60。然而,能力测试和 研究生院的成功之间的相关性要低得多,这主要是因为在这个选定的群体中,测量的能力差异很小。如果每个人都有类似的能力,那么这种测量方法的差异不太可能在衡量成功方面发挥很大作用。
The correlation between income and education level in the United States is approximately .40.
在美国, 收入和教育 水平之间的相关性约为0.40。
The correlation between family income and the last four digits of their phone number is 0.
家庭收入与他们的 电话的最后四位数之间的相关性为0。
It took Francis Galton several years to figure out that correlation and regression are not two concepts—they are different perspectives on the same concept. The general rule is straightforward but has surprising consequences: whenever the correlation between two scores is imperfect, there will be regression to the mean. To illustrate Galton’s insight, take a proposition that most people find quite interesting:
弗朗西斯-高尔顿花了几年时间才弄明白, 相关和回归 不是两个概念--它们是对同一概念的不同看法。一般的规则是直截了当的,但有令人惊讶的后果:只要两个分数之间的相关性不完善,就会出现向平均值回归的情况。为了说明高尔顿的洞察力,以一个大多数人认为相当有趣的命题为例, 。
Highly intelligent women tend to marry men who are less intelligent than they are.
高智商的女性倾向于嫁给比她们低智商的男性。
You can get a good conversation started at a party by asking for an explanation, and your friends will readily oblige. Even people who have had some exposure to statistics will spontaneously interpret the statement in causal terms. Some may think of highly intelligent women wanting to avoid the competition of equally intelligent men, or being forced to compromise in their choice of spouse because intelligent men do not want to compete with intelligent women. More far-fetched explanations will come up at a good party. Now consider this statement:
在聚会上,你可以通过要求解释来开始一个好的谈话,你的朋友们会很乐意地答应。即使是接触过一些 ,也会自发地用因果关系来解释这句话。有些人可能会想到高智商的女性 ,希望避免与同样聪明的男性竞争,或者因为聪明的男性不想与聪明的女性竞争而被迫在选择配偶时做出妥协。更多牵强附会的解释会在一个好的聚会上出现。现在考虑一下这句话。
The correlation between the intelligence scores of spouses is less than perfect.
夫妻双方的智力分数之间的关联性并不完美。
This statement is obviously true and not interesting at all. Who would expect the correlation to be perfect? There is nothing to explain. But the statement you found interesting and the statement you found trivial are algebraically equivalent. If the correlation between the intelligence of spouses is less than perfect (and if men and women on average do not differ in intelligence), then it is a mathematical inevitability that highly intelligent women will be married to husbands who are on average less intelligent than they are (and vice versa, of course). The observed regression to the mean cannot be more interesting or more explainable than the imperfect correlation.
这句话显然是真实的, ,一点也不有趣。谁会期望相关关系是完美的?没有什么可解释的。但你觉得有趣的陈述和你觉得琐碎的陈述在代数上是等价的。如果配偶的智力之间的相关性不那么完美(如果男性和女性在智力上平均没有差异),那么在数学上是不可避免的,高度 的女性将嫁给平均智力低于她们的丈夫(当然反之亦然)。观察到的向平均值的回归不能比不完美的相关性更有趣或更能解释。
You probably sympathize with Galton’s struggle with the concept of regression. Indeed, the statistician David Freedman used to say that if the topic of regression comes up in a criminal or civil trial, the side that must explain regression to the jury will lose the case. Why is it so hard? The main reason for the difficulty is a recurrent theme of this book: our mind is strongly biased toward causal explanations and does not deal well with “mere statistics.” When our attention is called to an event, associative memory will look for its cause—more precisely, activation will automatically spread to any cause that is already stored in memory. Causal explanations will be evoked when regression is detected, but they will be wrong because the truth is that regression to the mean has an explanation but does not have a cause. The event that attracts our attention in the golfing tournament is the frequent deterioration of the performance of the golfers who were successful on day 1. The best explanation of it is that those golfers were unusually lucky that day, but this explanation lacks the causal force that our minds prefer. Indeed, we pay people quite well to provide interesting explanations of regression effects. A business commentator who correctly announces that “the business did better this year because it had done poorly last year” is likely to have a short tenure on the air.
你可能同情高尔顿对回归概念的挣扎。 事实上,统计学家大卫-弗里德曼曾经说过,如果在刑事或民事审判中出现回归的话题 ,必须向陪审团解释回归的一方将输掉官司。 为什么它这么难?困难的主要原因是本书的一个反复出现的主题:我们的思想强烈偏向于因果解释,不善于处理 "单纯的统计数据"。当我们的注意力被唤起到一个事件时,联想记忆将寻找它的原因--更准确地说, ,激活将自动扩散到任何已经储存在记忆中的原因。当检测到回归时,因果解释将被唤起,但它们将是错误的,因为事实是,向平均值回归有一个解释,但没有一个原因。在高尔夫比赛中,吸引我们注意的事件是那些在第1天取得成功的球手 ,他们的成绩经常恶化。对它最好的解释是,那些球手那天的运气异常好,但这种解释缺乏我们头脑中喜欢的因果力。事实上,我们付给人们相当高的报酬,让他们对回归效应作出有趣的解释。一个正确宣布 "今年业务做得更好,因为它去年做得很差 "的商业评论员, ,可能会在广播上有一个短暂的任期。
Our difficulties with the concept of regression originate with both System 1 and System 2. Without special instruction, and in quite a few cases even after some statistical instruction, the relationship between correlation and regression remains obscure. System 2 finds it difficult to understand and learn. This is due in part to the insistent demand for causal interpretations, which is a feature of System 1.
我们对回归概念的困难源于系统1和系统2。 如果没有特别的指导,甚至在相当多的情况下,在一些统计指导之后,相关和回归之间的关系仍然不明确。系统2发现它难以理解和学习。这部分是由于对 因果解释的坚持要求,这是系统1的一个特点。
Depressed children treated with an energy drink improve significantly over a three-month period.
用能量饮料治疗的抑郁症儿童在三个月内有明显的改善。
I made up this newspaper headline, but the fact it reports is true: if you treated a group of depressed children for some time with an energy drink, they would show a clinically significant improvement. It is also the case that depressed children who spend some time standing on their head or hug a cat for twenty minutes a day will also show improvement. Most readers of such headlines will automatically infer that the energy drink or the cat hugging caused an improvement, but this conclusion is completely unjustified. Depressed children are an extreme group, they are more depressed than most other children—and extreme groups regress to the mean over time. The correlation between depression scores on successive occasions of testing is less than perfect, so there will be regression to the mean: depressed children will get somewhat better over time even if they hug no cats and drink no Red Bull. In order to conclude that an energy drink—or any other treatment—is effective, you must compare a group of patients who receive this treatment to a “control group” that receives no treatment (or, better, receives a placebo). The control group is expected to improve by regression alone, and the aim of the experiment is to determine whether the treated patients improve more than regression can explain.
这个报纸的标题是我编的,但它所报道的事实是真实的:如果你用能量饮料治疗一群抑郁症儿童一段时间,他们会出现临床上的明显改善。同样,抑郁症儿童 ,每天花一些时间站在他们的头上或拥抱一只猫20分钟,也会出现改善。大多数读者看到这样的标题会自动推断,能量饮料或拥抱猫造成了改善,但这个结论是完全没有道理的。抑郁症儿童是一个极端群体,他们比其他大多数儿童更抑郁--随着时间的推移,极端群体会向平均值回归 。连续几次测试的抑郁症分数之间的相关性不是很完美,所以会有向平均值回归的现象:即使抑郁症儿童不抱猫,不喝红牛,也会随着时间的推移变得更好。为了得出结论,一种能量饮料或任何其他治疗方法是有效的,你必须将一组接受这种治疗的病人 ,与没有接受任何治疗的 "对照组"(或者更好的是接受安慰剂)进行比较。对照组预计仅通过回归来改善,而实验的目的是确定接受治疗的病人的改善是否超过回归所能解释的程度。
Incorrect causal interpretations of regression effects are not restricted to readers of the popular press. The statistician Howard Wainer has drawn up a long list of eminent researchers who have made the same mistake— confusing mere correlation with causation . Regression effects are a common source of trouble in research, and experienced scientists develop a healthy fear of the trap of unwarranted causal inference.
对回归效应的不正确因果解释并不限于大众媒体的读者。统计学家Howard Wainer 列出了一长串犯过同样错误的知名研究人员名单-- 将单纯的相关关系与因果关系混为一谈 。回归效应是研究中常见的麻烦来源,有经验的科学家对毫无根据的因果推断的陷阱产生了健康的恐惧。
One of my favorite examples of the errors of intuitive prediction is adapted from Max Bazerman’s excellent text Judgment in Managerial Decision Making :
我最喜欢的一个关于直觉预测错误的例子是改编自Max Bazerman的优秀文章 《管理决策中的判断力》 。
You are the sales forecaster for a department store chain. All stores are similar in size and merchandise selection, but their sales differ because of location, competition, and random factors. You are given the results for 2011 and asked to forecast sales for 2012. You have been instructed to accept the overall forecast of economists that sales will increase overall by 10%. How would you complete the following table?
你 ,是一家连锁百货公司的销售预测员。所有的商店在规模和商品选择上都很相似,但由于位置、竞争和随机因素,它们的销售情况不同。你得到了2011年的结果,并被要求预测2012年的销售情况。你被指示接受经济学家的总体预测,即销售将整体增加 ,增加10%。你将如何完成以下表格?
Store
商店
2011
2011
2012
2012
1
1
$11,000,000
$11,000,000
________
________
2
2
$23,000,000
$23,000,000
________
________
3
3
$18,000,000
$18,000,000
________
________
4
4
$29,000,000
$29,000,000
________
________
Total
共计
$81,000,000
$81,000,000
$89,100,000
$89,100,000
Having read this chapter, you know that the obvious solution of adding 10% to the sales of each store is wrong. You want your forecasts to be regressive, which requires adding more than 10% to the low-performing branches and adding less (or even subtracting) to others. But if you ask other people, you are likely to encounter puzzlement: Why do you bother them with an obvious question? As Galton painfully discovered, the concept of regression is far from obvious.
读过这一章后,你知道在每个商店的销售额上增加10%的明显解决方案是错误的。你希望你的预测是递减的,这就要求对表现不佳的分支机构增加10%以上的 ,而对其他分支机构增加较少(甚至减少)。但是,如果你问其他人,你很可能会遇到不解:为什么你要用一个明显的问题来打扰他们? 正如高尔顿痛苦地发现,回归的概念远非明显。
SPEAKING OF REGRESSION TO MEDIOCRITY
说到向平庸的倒退
“She says experience has taught her that criticism is more effective than praise. What she doesn’t understand is that it’s all due to regression to the mean.”
"她说,经验告诉她,批评比表扬更有效。 ,她不明白的是,这都是由于向平均值倒退。"
“Perhaps his second interview was less impressive than the first because he was afraid of disappointing us, but more likely it was his first that was unusually good.”
"也许他的第二次面试不如第一次,因为他害怕让我们失望,但更有可能是他的第一次面试异常出色。"
“Our screening procedure is good but not perfect, so we should anticipate regression. We shouldn’t be surprised that the very best candidates often fail to meet our expectations.”
"我们的筛选程序是好的,但不是完美的,所以我们应该预见到退步。 我们不应该惊讶,非常好的候选人往往不能达到我们的期望, 。"
18
18
Taming Intuitive Predictions
驯服直觉性的预测
Life presents us with many occasions to forecast. Economists forecast inflation and unemployment, financial analysts forecast earnings, military experts predict casualties, venture capitalists assess profitability, publishers and producers predict audiences, contractors estimate the time required to complete projects, chefs anticipate the demand for the dishes on their menu, engineers estimate the amount of concrete needed for a building, fireground commanders assess the number of trucks that will be needed to put out a fire. In our private lives, we forecast our spouse’s reaction to a proposed move or our own future adjustment to a new job.
生活为我们提供了许多需要预测的场合。经济学家预测通货膨胀和失业率,金融分析师预测收益,军事专家预测伤亡,风险资本家评估 ,出版商和制片人预测观众,承包商估计完成项目所需的时间,厨师预测对其菜单上的菜肴的需求,工程师估计建筑物所需的混凝土数量,火场指挥官评估扑灭火灾所需的卡车数量。在我们的私人生活中,我们预测我们的配偶对拟议的 ,或我们自己未来对新工作的调整的反应。
Some predictive judgments, such as those made by engineers, rely largely on look-up tables, precise calculations, and explicit analyses of outcomes observed on similar occasions. Others involve intuition and System 1, in two main varieties. Some intuitions draw primarily on skill and expertise acquired by repeated experience. The rapid and automatic judgments and choices of chess masters, fireground commanders, and physicians that Gary Klein has described in Sources of Power and elsewhere illustrate these skilled intuitions, in which a solution to the current problem comes to mind quickly because familiar cues are recognized.
一些预测性判断,如工程师所做的预测,主要依靠查询表、精确的计算和对类似场合观察到的结果的明确分析。 其他的预测涉及到直觉和系统1,主要有两种。 一些直觉主要依靠通过反复经验获得的技能和专业知识。加里-克莱因在《 力量之源 》和其他书中描述的国际象棋大师、火场指挥官和医生的快速和自动的 判断和选择说明了这些熟练的直觉,在这些直觉中,由于熟悉的线索被识别,当前问题的解决方案很快就想到了。
Other intuitions, which are sometimes subjectively indistinguishable from the first, arise from the operation of heuristics that often substitute an easy question for the harder one that was asked. Intuitive judgments can be made with high confidence even when they are based on nonregressive assessments of weak evidence. Of course, many judgments, especially in the professional domain, are influenced by a combination of analysis and intuition.
其他的直觉,有时在主观上与第一种直觉无法区分,它们来自于 启发式的操作,这种操作常常用一个简单的问题来代替所问的更难的问题。 即使直觉判断是基于对薄弱证据的非回归性评估,也可以有很高的信心。当然,许多判断,特别是在 专业领域,受到分析和直觉的综合影响。
NONREGRESSIVE INTUITIONS
非倒退的直觉
Let us return to a person we have already met:
让我们回到一个我们已经 的人。
Julie is currently a senior in a state university. She read fluently when she was four years old. What is her grade point average (GPA)?
朱莉目前是一所州立大学的高年级学生。 她在四岁时就能流利地阅读。 她的平均成绩(GPA)是多少?
People who are familiar with the American educational scene quickly come up with a number, which is often in the vicinity of 3.7 or 3.8. How does this occur? Several operations of System 1 are involved.
熟悉美国教育界的人很快就会想到一个数字,这个数字往往在3.7或3.8附近。这是怎么发生的?涉及系统1的几个操作。
A causal link between the evidence (Julie’s reading) and the target of the prediction (her GPA) is sought. The link can be indirect. In this instance, early reading and a high GPA are both indications of academic talent. Some connection is necessary. You (your System 2) would probably reject as irrelevant a report of Julie winning a fly-fishing competition or excelling at weight lifting in high school. The process is effectively dichotomous. We are capable of rejecting information as irrelevant or false, but adjusting for smaller weaknesses in the evidence is not something that System 1 can do. As a result, intuitive predictions are almost completely insensitive to the actual predictive quality of the evidence. When a link is found, as in the case of Julie’s early reading, WY SIATI applies: your associative memory quickly and automatically constructs the best possible story from the information available.
在证据(Julie的阅读) 和预测的目标(她的GPA)之间寻求因果关系。这种联系可能是间接的。 在这个例子中,早期阅读和高GPA都是学术天赋的标志。你(你的系统2)可能会拒绝朱莉在飞鱼比赛中获胜或在高中举重时表现出色的报告,认为这无关紧要。这个过程实际上是二分法。 ,我们有能力拒绝不相关或虚假的信息,但调整证据中较小的弱点不是系统1能做到的。因此,直觉预测对证据的实际预测质量几乎完全不敏感。当发现一个联系时,如朱莉早期阅读的情况,WY SIATI适用:你的联想记忆迅速并自动 ,从现有信息中构建出最佳故事。
Next, the evidence is evaluated in relation to a relevant norm. How precocious is a child who reads fluently at age four? What relative rank or percentile score corresponds to this achievement? The group to which the child is compared (we call it a reference group) is not fully specified, but this is also the rule in normal speech: if someone graduating from college is described as “quite clever” you rarely need to ask, “When you say ‘quite clever,’ which reference group do you have in mind?”
接下来,要根据相关标准对证据进行评估。 一个四岁就能流利阅读的孩子有多早熟? 这一成就对应的相对等级或百分位数是多少?孩子被比较的群体(我们称之为参考群体)没有被完全指定,但这也是正常说话的规则: ,如果有人从大学毕业被描述为 "相当聪明",你很少需要问:"当你说'相当聪明'时,你想到的是哪个参考群体?"
The next step involves substitution and intensity matching. The evaluation of the flimsy evidence of cognitive ability in childhood is substituted as an answer to the question about her college GPA. Julie will be assigned the same percentile score for her GPA and for her achievements as an early reader.
下一步涉及替换和强度匹配。对儿童时期认知能力的脆弱证据的评估被替换成了对她大学GPA问题的回答。朱莉将 ,她的GPA和她作为早期读者的成就将被分配到相同的百分点 。
The question specified that the answer must be on the GPA scale, which requires another intensity-matching operation, from a general impression of Julie’s academic achievements to the GPA that matches the evidence for her talent. The final step is a translation, from an impression of Julie’s relative academic standing to the GPA that corresponds to it.
该问题规定,答案必须在GPA范围内,这就需要进行另一种强度匹配操作,从对朱莉学术成就的一般印象到与她的天赋证据相匹配的GPA。最后一步是翻译,从对朱莉的相对学术地位的印象到 ,与之对应的GPA。
Intensity matching yields predictions that are as extreme as the evidence on which they are based, leading people to give the same answer to two quite different questions:
强度匹配产生的预测与它们所依据的证据一样极端,导致人们对两个完全不同的问题给出相同的答案。
What is Julie’s percentile score on reading precocity?
朱莉在阅读早熟方面的百分位数是多少?
What is Julie’s percentile score on GPA?
朱莉在GPA上的百分位数是多少?
By now you should easily recognize that all these operations are features of System 1. I listed them here as an orderly sequence of steps, but of course the spread of activation in associative memory does not work this way. You should imagine a process of spreading activation that is initially prompted by the evidence and the question, feeds back upon itself, and eventually settles on the most coherent solution possible.
现在你应该很容易认识到,所有这些操作都是系统1的特征。我在这里把它们列在 ,作为一个有序的步骤序列,但当然,联想记忆中的激活扩散并不是这样工作的。你应该想象一个激活扩散的过程,它最初是由证据和问题促使的,并反馈到自身,最终确定了最连贯的解决方案。
Amos and I once asked participants in an experiment to judge descriptions of eight college freshmen, allegedly written by a counselor on the basis of interviews of the entering class. Each description consisted of five adjectives, as in the following example:
阿莫斯和我曾经在一个实验中要求参与者判断对 八名大学新生的描述,据称这些描述是由辅导员根据对入学班级的访谈写的。每个描述由五个形容词组成,如下面的例子。
intelligent, self-confident, well-read, hardworking, inquisitive
聪明、自信、好学、勤奋、好奇心强
We asked some participants to answer two questions:
我们要求一些参与者回答两个问题。
How much does this description impress you with respect to academic ability?
在学术能力方面,这一描述给你留下的印象有多深?
What percentage of descriptions of freshmen do you believe would impress you more?
你认为对新生的描述有多大比例 ,会给你留下更深刻的印象?
The questions require you to evaluate the evidence by comparing the description to your norm for descriptions of students by counselors. The very existence of such a norm is remarkable. Although you surely do not know how you acquired it, you have a fairly clear sense of how much enthusiasm the description conveys: the counselor believes that this student is good, but not spectacularly good. There is room for stronger adjectives than intelligent ( brilliant , creative ), well-read ( scholarly , erudite , impressively knowledgeable ), and hardworking ( passionate , perfectionist ). The verdict: very likely to be in the top 15% but unlikely to be in the top 3%. There is impressive consensus in such judgments, at least within a culture.
这些问题要求你通过将描述与你对辅导员描述学生的标准进行比较来评估证据。 ,这种规范的存在是非常了不起的。虽然你肯定不知道你是如何获得的,但你对描述所传达的热情有相当清楚的感觉:辅导员 ,认为这个学生很好,但不是惊人的好。还有空间使用比 聪明 (杰出 、 创造性 )、 好学 (学术 、 博学 、 令人印象深刻的知识 )和 勤奋 (热情 、 完美主义 )更有力的形容词。 结论:非常可能进入前15%,但不太可能进入前3%。 在这种判断中,至少在一种文化中,存在令人印象深刻的共识。
The other participants in our experiment were asked different questions:
我们实验中的其他参与者被问到不同的问题。
What is your estimate of the grade point average that the student will obtain?
你对该学生将获得的平均分数的估计是多少?
What is the percentage of freshmen who obtain a higher GPA?
获得较高GPA的新生比例是多少?
You need another look to detect the subtle difference between the two sets of questions. The difference should be obvious, but it is not. Unlike the first questions, which required you only to evaluate the evidence, the second set involves a great deal of uncertainty. The question refers to actual performance at the end of the freshman year. What happened during the year since the interview was performed? How accurately can you predict the student’s actual achievements in the first year at college from five adjectives? Would the counselor herself be perfectly accurate if she predicted GPA from an interview?
你需要再看一下,才能发现这两组问题之间的微妙差别。 这种差别应该是很明显的,但它不是。与第一组问题不同,第一组问题要求 ,你只需评估证据,第二组问题涉及大量的不确定性。该问题指的是大一结束时的实际表现。 在进行采访后的一年里,发生了什么?你能从五个形容词中多准确地预测该学生在大学第一年的实际成绩?如果辅导员自己从面试中 预测GPA,会不会完全准确?
The objective of this study was to compare the percentile judgments that the participants made when evaluating the evidence in one case, and when predicting the ultimate outcome in another. The results are easy to summarize: the judgments were identical. Although the two sets of questions differ (one is about the description, the other about the student’s future academic performance), the participants treated them as if they were the same. As was the case with Julie, the prediction of the future is not distinguished from an evaluation of current evidence—prediction matches evaluation. This is perhaps the best evidence we have for the role of substitution. People are asked for a prediction but they substitute an evaluation of the evidence, without noticing that the question they answer is not the one they were asked. This process is guaranteed to generate predictions that are systematically biased; they completely ignore regression to the mean.
本研究的目的是比较参与者在评价一个案件的证据时和预测另一个案件的最终结果时所作的百分位数判断。结果很容易总结:判断是相同的。尽管这两组问题有所不同(一个是关于描述,另一个是关于学生未来的 学业成绩),但参与者把它们当作是一样的。正如朱莉的情况一样,对未来的预测与对当前证据的评价没有区别--预测与评价相匹配。 这也许是我们对替代作用的最好证据。 人们被要求进行预测,但他们用对证据的评价来替代,而没有注意到 ,他们回答的问题不是他们被要求的问题。这个过程保证产生有系统偏见的预测;他们完全忽略了向平均值的回归。
During my military service in the Israeli Defense Forces, I spent some time attached to a unit that selected candidates for officer training on the basis of a series of interviews and field tests. The designated criterion for successful prediction was a cadet’s final grade in officer school. The validity of the ratings was known to be rather poor (I will tell more about it in a later chapter). The unit still existed years later, when I was a professor and collaborating with Amos in the study of intuitive judgment. I had good contacts with the people at the unit and asked them for a favor. In addition to the usual grading system they used to evaluate the candidates, I asked for their best guess of the grade that each of the future cadets would obtain in officer school. They collected a few hundred such forecasts. The officers who had produced the predictions were all familiar with the letter grading system that the school applied to its cadets and the approximate proportions of A’s, B’s, etc., among them. The results were striking: the relative frequency of A’s and B’s in the predictions was almost identical to the frequencies in the final grades of the school.
我在以色列国防军服役期间,曾有一段时间隶属于一个根据一系列面试和实地测试来挑选军官培训候选人的单位。 成功预测的指定标准是学员在军官学校的最后成绩。多年后,这个单位仍然存在,当时我是一名教授,与阿莫斯合作研究直觉判断。 我与该单位的人有很好的联系,请他们帮个忙。此外, ,在他们用来评估候选人的通常的评分系统中,我要求他们对每个未来的学员在军官学校将获得的成绩作出最佳猜测。作出预测的军官们都熟悉学校对学员采用的字母评分系统,以及其中A、B等的大致比例, 。结果令人震惊:预测中A和B的相对频率与学校最终成绩的频率几乎相同。
These findings provide a compelling example of both substitution and intensity matching. The officers who provided the predictions completely failed to discriminate between two tasks:
这些发现提供了一个令人信服的替代和强度匹配的例子。提供预测的官员完全没有对两项任务进行区分。
their usual mission, which was to evaluate the performance of candidates during their stay at the unit
他们通常的任务是评估 候选人在单位期间的表现。
the task I had asked them to perform, which was an actual prediction of a future grade
我要求他们完成的任务是对未来的成绩进行实际预测。
They had simply translated their own grades onto the scale used in officer school, applying intensity matching. Once again, the failure to address the (considerable) uncertainty of their predictions had led them to predictions that were completely nonregressive.
他们只是将自己的成绩转化为军官学校使用的量表,应用强度匹配。再一次,由于没有解决他们预测的(相当大的)不确定性,导致他们的预测完全是 ,而不是倒退。
A CORRECTION FOR INTUITIVE PREDICTIONS
对直觉预测的修正
Back to Julie, our precocious reader. The correct way to predict her GPA was introduced in the preceding chapter. As I did there for golf on successive days and for weight and piano playing, I write a schematic formula for the factors that determine reading age and college grades:
回到朱莉,我们早熟的读者。上一章介绍了预测她GPA的正确方法。正如我在那里为连续几天的高尔夫以及体重和钢琴演奏所做的那样,我为决定阅读年龄和大学成绩的因素写了一个示意性的公式。
reading age = shared factors + factors specific to reading age = 100%
阅读年龄=共同因素+阅读的特定因素 年龄=100%
GPA = shared factors + factors specific to GPA = 100%
GPA = 共同因素 + GPA的特定因素 = 100%。
The shared factors involve genetically determined aptitude, the degree to which the family supports academic interests, and anything else that would cause the same people to be precocious readers as children and academically successful as young adults. Of course there are many factors that would affect one of these outcomes and not the other. Julie could have been pushed to read early by overly ambitious parents, she may have had an unhappy love affair that depressed her college grades, she could have had a skiing accident during adolescence that left her slightly impaired, and so on.
共同的因素涉及基因决定的能力,家庭支持学术兴趣的程度,以及其他任何会导致同样的人在儿童时早熟的阅读和在年轻时学术成功的因素 。当然,有许多因素会影响这些结果中的一个, ,而不是另一个。朱莉可能是被过于雄心勃勃的父母逼着早早读书,她可能有一段不愉快的爱情,压抑了她的大学成绩,她可能在青春期发生了一次滑雪事故,导致她的身体有轻微损伤,等等。
Recall that the correlation between two measures—in the present case reading age and GPA—is equal to the proportion of shared factors among their determinants. What is your best guess about that proportion? My most optimistic guess is about 30%. Assuming this estimate, we have all we need to produce an unbiased prediction. Here are the directions for how to get there in four simple steps:
回顾一下,两个衡量标准之间的相关性--在本例中,阅读年龄和GPA--等于它们的决定因素中共同因素的比例 。你对这个比例的最佳猜测是什么?我最乐观的猜测是大约30%。假设有这样的估计,我们就拥有了产生无偏预测所需的一切。 以下是如何通过四个简单的步骤达到目的的指示。
Start with an estimate of average GPA.
从对平均GPA的估计开始。
Determine the GPA that matches your impression of the evidence.
确定符合你对证据印象的GPA。
Estimate the correlation between your evidence and GPA.
估计 你的证据和GPA之间的相关性。
If the correlation is .30, move 30% of the distance from the average to the matching GPA.
如果相关性为0.30,则将30%的距离从平均数移至匹配的GPA。
Step 1 gets you the baseline, the GPA you would have predicted if you were told nothing about Julie beyond the fact that she is a graduating senior. In the absence of information, you would have predicted the average. (This is similar to assigning the base-rate probability of business administration graduates when you are told nothing about Tom W.) Step 2 is your intuitive prediction, which matches your evaluation of the evidence. Step 3 moves you from the baseline toward your intuition, but the distance you are allowed to move depends on your estimate of the correlation. You end up, at step 4, with a prediction that is influenced by your intuition but is far more moderate .
第1步让你得到基线,即如果你除了知道朱莉是一个即将毕业的高年级学生外,什么都不知道,你会预测的GPA。在没有信息的情况下,你会预测平均数。(这类似于当你没有被告知任何关于汤姆-W的信息时,分配商业 管理毕业生的基本概率。)第二步是你的直觉预测,它与你对证据的评估相匹配。第3步使你从基线走向你的直觉,但允许你移动的距离取决于你对相关性的估计。 在第4步,你最终得到一个受你的直觉影响的预测,但要 温和得多 。
This approach to prediction is general. You can apply it whenever you need to predict a quantitative variable, such as GPA, profit from an investment, or the growth of a company. The approach builds on your intuition, but it moderates it, regresses it toward the mean. When you have good reasons to trust the accuracy of your intuitive prediction—a strong correlation between the evidence and the prediction—the adjustment will be small.
这种预测的方法是普遍的。只要你需要预测一个量化的变量,如GPA、投资利润或公司的增长,你都可以应用它。该方法建立在你的直觉之上,但它会对其进行调节,使其向平均值回归。当你有充分的理由相信你的直觉预测的准确性--证据和预测之间有很强的相关性-- ,调整会很小。
Intuitive predictions need to be corrected because they are not regressive and therefore are biased. Suppose that I predict for each golfer in a tournament that his score on day 2 will be the same as his score on day 1. This prediction does not allow for regression to the mean: the golfers who fared well on day 1 will on average do less well on day 2, and those who did poorly will mostly improve. When they are eventually compared to actual outcomes, nonregressive predictions will be found to be biased. They are on average overly optimistic for those who did best on the first day and overly pessimistic for those who had a bad start. The predictions are as extreme as the evidence. Similarly, if you use childhood achievements to predict grades in college without regressing your predictions toward the mean, you will more often than not be disappointed by the academic outcomes of early readers and happily surprised by the grades of those who learned to read relatively late. The corrected intuitive predictions eliminate these biases, so that predictions (both high and low) are about equally likely to overestimate and to underestimate the true value. You still make errors when your predictions are unbiased, but the errors are smaller and do not favor either high or low outcomes.
直观的预测需要被纠正,因为它们不是回归性的,因此是有偏见的。假设我对一场比赛中的每个高尔夫球员进行预测,他在第2天的得分将与他在第1天的得分相同。这种预测不允许回归平均值:在第1天表现良好的高尔夫球员 ,在第2天平均表现较差,而那些表现不佳的 ,大多会有所提高。当他们最终与实际结果进行比较时,会发现非回归预测是有偏差的。 他们平均对那些在第一天表现最好的人过于乐观,对那些开局不利的人过于悲观。预测和证据一样极端。 同样,如果你用童年的成就来预测大学的成绩,而不把你的预测向平均值回归,你往往会对早期阅读者的学术成果感到失望,而对那些相对较晚学会阅读的人的成绩感到高兴。修正后的直观预测消除了这些偏见,因此,预测(高和低)高估和低估真实价值的可能性大致相同。当你的预测没有偏见时,你仍然 ,但误差较小,而且不偏向于高或低的结果。
A DEFENSE OF EXTREME PREDICTIONS?
对极端预测的辩护?
I introduced Tom W earlier to illustrate predictions of discrete outcomes such as field of specialization or success in an examination, which are expressed by assigning a probability to a specified event (or in that case by ranking outcomes from the most to the least probable). I also described a procedure that counters the common biases of discrete prediction: neglect of base rates and insensitivity to the quality of information.
我在前面介绍了汤姆-W,以说明对离散结果的预测,如专业领域或考试成功,其表达方式是给特定的事件分配一个概率(或者在这种情况下,将结果从最可能的到最不可能的进行排序 )。我还描述了一个程序,该程序对抗离散预测的常见偏见:忽视基本率和对信息的质量不敏感。
The biases we find in predictions that are expressed on a scale, such as GPA or the revenue of a firm, are similar to the biases observed in judging the probabilities of outcomes.
我们在以规模表示的预测中发现的偏差,如GPA或公司的收入,类似于在判断结果的概率中观察到的偏差。
The corrective procedures are also similar:
纠正程序 ,也是类似的。
Both contain a baseline prediction, which you would make if you knew nothing about the case at hand. In the categorical case, it was the base rate. In the numerical case, it is the average outcome in the relevant category.
两者都包含一个基线预测,如果你对手头的案件一无所知,你会做出这样的预测。在分类的情况下,它是基本比率。在数字的情况下,它是相关类别的平均结果。
Both contain an intuitive prediction, which expresses the number that comes to your mind, whether it is a probability or a GPA.
两者都包含一个直观的预测,表达了你脑海中的数字,无论是概率还是GPA。
In both cases, you aim for a prediction that is intermediate between the baseline and your intuitive response.
在这两种情况下,你的目标是 ,一个介于基线和你的直觉反应之间的预测。
In the default case of no useful evidence, you stay with the baseline.
在没有有用证据的默认情况下,你就保持基准线。
At the other extreme, you also stay with your initial prediction. This will happen, of course, only if you remain completely confident in your initial prediction after a critical review of the evidence that supports it.
在另一个极端,你也会坚持你最初的预测。 当然,这只有在你对支持你最初的预测的证据进行严格审查后,仍然完全有信心 ,才会发生。
In most cases you will find some reason to doubt that the correlation between your intuitive judgment and the truth is perfect, and you will end up somewhere between the two poles.
在大多数情况下,你会 ,找到一些理由来怀疑你的直觉判断和真相之间的相关性是完美的,你最终会在两极之间徘徊。
This procedure is an approximation of the likely results of an appropriate statistical analysis. If successful, it will move you toward unbiased predictions, reasonable assessments of probability, and moderate predictions of numerical outcomes. The two procedures are intended to address the same bias: intuitive predictions tend to be overconfident and overly extreme.
这个程序是对适当的统计分析的可能结果的一种近似。如果成功的话,它将使你走向无偏见的预测,对概率的合理评估,以及对数字结果的适度预测。 这两个程序旨在解决相同的偏见:直觉预测往往过于自信和过于极端。
Correcting your intuitive predictions is a task for System 2. Significant effort is required to find the relevant reference category, estimate the baseline prediction, and evaluate the quality of the evidence. The effort is justified only when the stakes are high and when you are particularly keen not to make mistakes. Furthermore, you should know that correcting your intuitions may complicate your life. A characteristic of unbiased predictions is that they permit the prediction of rare or extreme events only when the information is very good. If you expect your predictions to be of modest validity, you will never guess an outcome that is either rare or far from the mean. If your predictions are unbiased, you will never have the satisfying experience of correctly calling an extreme case. You will never be able to say, “I thought so!” when your best student in law school becomes a Supreme Court justice, or when a start-up that you thought very promising eventually becomes a major commercial success. Given the limitations of the evidence, you will never predict that an outstanding high school student will be a straight-A student at Princeton. For the same reason, a venture capitalist will never be told that the probability of success for a start-up in its early stages is “very high.”
纠正你的直觉预测是系统2的任务。 需要付出巨大的努力来找到相关的参考类别,估计基线预测,并评估证据的质量。只有在赌注很大,并且 ,你特别想不犯错误的时候,这种努力才是合理的。此外,你应该知道,纠正你的直觉可能会使你的生活变得复杂。 无偏见预测的一个特点是,只有在信息非常好的情况下才允许预测罕见或极端事件。如果你期望你的预测具有适度的有效性,你永远不会猜到一个既罕见又远离 平均值的结果。如果你的预测是不偏不倚的,你永远不会有正确判断一个极端案例的满意经历。 当你在法学院最好的学生成为最高法院法官时,或者当你认为非常有前途的创业公司最终成为重大商业成功时,你永远无法说:"我就知道!"鉴于证据的局限性,你永远不会预测 ,一个优秀的高中生将成为普林斯顿大学的全优生。出于同样的原因,风险资本家永远不会被告知,一个处于早期阶段的初创企业的成功概率是 "非常高"。
The objections to the principle of moderating intuitive predictions must be taken seriously, because absence of bias is not always what matters most. A preference for unbiased predictions is justified if all errors of prediction are treated alike, regardless of their direction. But there are situations in which one type of error is much worse than another. When a venture capitalist looks for “the next big thing,” the risk of missing the next Google or Facebook is far more important than the risk of making a modest investment in a start-up that ultimately fails. The goal of venture capitalists is to call the extreme cases correctly, even at the cost of overestimating the prospects of many other ventures. For a conservative banker making large loans, the risk of a single borrower going bankrupt may outweigh the risk of turning down several would-be clients who would fulfill their obligations. In such cases, the use of extreme language (“very good prospect,” “serious risk of default”) may have some justification for the comfort it provides, even if the information on which these judgments are based is of only modest validity.
必须认真对待对调节直觉预测的原则的反对意见,因为没有偏见并不总是最重要的。如果对所有的预测错误都一视同仁,无论其方向如何,对无偏见的 预测的偏爱是合理的。但在有些情况下,一种类型的错误要比另一种差得多。当风险资本家寻找 "下一个大事件 "时,错过下一个谷歌或Facebook的风险远比对一个最终失败的初创公司进行适度投资 的风险重要。 ,风险资本家的目标是正确判断极端情况,甚至以高估许多其他企业的前景为代价。对于一个保守的银行家来说,提供大额贷款,一个借款人破产的风险可能超过拒绝几个会履行其义务的潜在客户的风险。在这种情况下,使用极端的语言("前景非常好","严重的 违约风险")可能有一定的理由,因为它提供了安慰,即使这些判断所依据的信息只有一定的有效性。
For a rational person, predictions that are unbiased and moderate should not present a problem. After all, the rational venture capitalist knows that even the most promising start-ups have only a moderate chance of success. She views her job as picking the most promising bets from the bets that are available and does not feel the need to delude herself about the prospects of a start-up in which she plans to invest. Similarly, rational individuals predicting the revenue of a firm will not be bound to a single number—they should consider the range of uncertainty around the most likely outcome. A rational person will invest a large sum in an enterprise that is most likely to fail if the rewards of success are large enough, without deluding herself about the chances of success. However, we are not all rational, and some of us may need the security of distorted estimates to avoid paralysis. If you choose to delude yourself by accepting extreme predictions, however, you will do well to remain aware of your self-indulgence.
对于一个理性的人来说,不偏不倚和适度的预测不应该是一个问题。毕竟,理性的风险投资家知道,即使是最有前途的初创企业也只有适度的成功机会。她认为 ,她的工作就是从现有的赌注中挑选出最有前途的赌注,并不觉得有必要对她计划投资的初创企业的前景进行自我欺骗。同样,理性的个人预测公司的收入不会被一个单一的数字所束缚--他们应该考虑围绕最可能的结果的不确定性范围。 如果成功的回报足够大,一个理性的人将在一个最有可能失败的企业中投资一大笔 ,而不会对成功的机会产生错觉。然而,我们并不都是理性的,我们中的一些人可能需要扭曲的估计的安全感来避免瘫痪。 然而,如果你选择通过接受极端的预测来自欺欺人,那么你最好保持对自我放纵的意识。
Perhaps the most valuable contribution of the corrective procedures I propose is that they will require you to think about how much you know. I will use an example that is familiar in the academic world, but the analogies to other spheres of life are immediate. A department is about to hire a young professor and wants to choose the one whose prospects for scientific productivity are the best. The search committee has narrowed down the choice to two candidates:
也许我提出的纠正程序最宝贵的贡献是,它们将要求你思考你知道多少。我将使用一个在学术界很熟悉的例子,但对其他生活领域的类比是直接的。 一个部门即将雇用一名年轻教授,并希望选择一个科学生产力前景最好的人。 搜索委员会已将选择范围缩小到两名候选人。
Kim recently completed her graduate work. Her recommendations are spectacular and she gave a brilliant talk and impressed everyone in her interviews. She has no substantial track record of scientific productivity.
金最近完成了她的研究生工作。 她的推荐信非常精彩,她在面试中做了精彩的演讲,给大家留下了深刻印象。她没有实质性的科学成果记录。
Jane has held a postdoctoral position for the last three years. She has been very productive and her research record is excellent, but her talk and interviews were less sparkling than Kim’s.
Jane在过去三年中担任博士后职务。她的工作非常有成效,她的研究记录 ,但她的谈话和采访没有Kim那么闪光。
The intuitive choice favors Kim, because she left a stronger impression, and WYSIATI. But it is also the case that there is much less information about Kim than about Jane. We are back to the law of small numbers. In effect, you have a smaller sample of information from Kim than from Jane, and extreme outcomes are much more likely to be observed in small samples. There is more luck in the outcomes of small samples, and you should therefore regress your prediction more deeply toward the mean in your prediction of Kim’s future performance. When you allow for the fact that Kim is likely to regress more than Jane, you might end up selecting Jane although you were less impressed by her. In the context of academic choices, I would vote for Jane, but it would be a struggle to overcome my intuitive impression that Kim is more promising. Following our intuitions is more natural, and somehow more pleasant, than acting against them.
直观的选择有利于金,因为她给人留下了更深刻的印象,而WYSIATI。但情况也是如此,关于金的信息比关于简的信息少得多。 我们又回到了小数法则。实际上,你从金那里得到的信息样本比从简那里得到的要少,而极端的结果在小样本中更容易被观察到 。小样本的结果有更多的运气,因此你在预测金的未来表现时,应该更深入地向平均值回归。 当你允许金可能比简更多地回归这一事实时,你最终可能会选择简,尽管你对她的印象不那么深刻。在学术选择方面, ,我会投票给简,但要克服我的直觉印象,即金更有前途,那将是一场斗争。跟随我们的直觉比违背直觉行事更自然,而且在某种程度上更令人愉快。
You can readily imagine similar problems in different contexts, such as a venture capitalist choosing between investments in two start-ups that operate in different markets. One start-up has a product for which demand can be estimated with fair precision. The other candidate is more exciting and intuitively promising, but its prospects are less certain. Whether the best guess about the prospects of the second start-up is still superior when the uncertainty is factored in is a question that deserves careful consideration.
你可以很容易地想象不同背景下的类似问题,比如风险资本家在两个在不同市场运作的新创公司中选择投资。一个新创公司 ,其产品的需求可以被相当精确地估计。另一个候选公司更令人激动,直觉上很有希望,但其前景不太确定。当不确定性被考虑在内时,对第二个新创公司前景的最佳猜测是否仍有优势,这是一个值得认真考虑的问题。
A TWO-SYSTEMS VIEW OF REGRESSION
回归的双系统观点
Extreme predictions and a willingness to predict rare events from weak evidence are both manifestations of System 1. It is natural for the associative machinery to match the extremeness of predictions to the perceived extremeness of evidence on which it is based—this is how substitution works. And it is natural for System 1 to generate overconfident judgments, because confidence, as we have seen, is determined by the coherence of the best story you can tell from the evidence at hand. Be warned: your intuitions will deliver predictions that are too extreme and you will be inclined to put far too much faith in them.
极端预测和 ,愿意从薄弱的证据中预测罕见的事件,都是系统1的表现。联想机制自然会将预测的极端性与它所依据的证据的感知极端性相匹配--这就是替代的作用。系统1自然会产生过度自信的判断,因为正如我们所看到的,信心是由你能从手头的证据中说出的最佳故事的一致性 。请注意:你的直觉将提供过于极端的预测,你将倾向于对其过于信任。
Regression is also a problem for System 2. The very idea of regression to the mean is alien and difficult to communicate and comprehend. Galton had a hard time before he understood it. Many statistics teachers dread the class in which the topic comes up, and their students often end up with only a vague understanding of this crucial concept. This is a case where System 2 requires special training. Matching predictions to the evidence is not only something we do intuitively; it also seems a reasonable thing to do. We will not learn to understand regression from experience. Even when a regression is identified, as we saw in the story of the flight instructors, it will be given a causal interpretation that is almost always wrong.
回归也是系统2的一个问题。 向平均值回归的想法本身是陌生的,难以沟通和理解。高尔顿在理解它之前经历了一段艰难的时期。许多统计学教师 ,害怕在课堂上出现这个话题,他们的学生最终往往对这个关键概念只有模糊的理解。这是系统2需要特殊培训的情况。将预测与证据相匹配,不仅是 我们凭直觉做的事情;也似乎是一件合理的事情。 我们不会从经验中学习理解回归。即使是在确定了回归的情况下, ,正如我们在飞行教官的故事中看到的那样,它将被赋予一个几乎总是错误的因果解释。
SPEAKING OF INTUITIVE PREDICTIONS
说到直觉的预测
“That start-up achieved an outstanding proof of concept, but we shouldn’t expect them to do as well in the future. They are still a long way from the market and there is a lot of room for regression.”
"那家初创公司取得了出色的概念证明,但我们不应该期望他们在未来做得那么好。他们离市场还有很长的路要走,有很大的退步空间。"
“Our intuitive prediction is very favorable, but it is probably too high. Let’s take into account the strength of our evidence and regress the prediction toward the mean.”
"我们的直觉预测是非常有利的, ,但可能太高了。让我们考虑到我们证据的强度,并将预测向平均值回归。"
“The investment may be a good idea, even if the best guess is that it will fail. Let’s not say we really believe it is the next Google.”
"投资可能是一个好主意,即使最好的猜测是它会失败。让我们不要说我们真的相信它是下一个谷歌"。
“I read one review of that brand and it was excellent. Still, that could have been a fluke. Let’s consider only the brands that have a large number of reviews and pick the one that looks best.”
"我读了一篇关于该品牌的评论,它非常好。尽管如此,这可能是一种侥幸。让我们只考虑那些有 大量评论的品牌,并挑选看起来最好的一个"。
19
19
The Illusion of Understanding
理解的幻觉
The trader-philosopher-statistician Nassim Taleb could also be considered a psychologist. In The Black Swan , Taleb introduced the notion of a narrative fallacy to describe how flawed stories of the past shape our views of the world and our expectations for the future. Narrative fallacies arise inevitably from our continuous attempt to make sense of the world. The explanatory stories that people find compelling are simple; are concrete rather than abstract; assign a larger role to talent, stupidity, and intentions than to luck; and focus on a few striking events that happened rather than on the countless events that failed to happen. Any recent salient event is a candidate to become the kernel of a causal narrative. Taleb suggests that we humans constantly fool ourselves by constructing flimsy accounts of the past and believing they are true.
交易员-哲学家-统计学家纳西姆-塔勒布也可以被认为是一位心理学家。在 《黑天鹅》 中,塔勒布提出了 叙事谬误 的概念,以描述 ,过去有缺陷的故事如何塑造我们对世界的看法和对未来的期望。叙事谬误不可避免地产生于我们对世界的不断尝试。 人们认为有说服力的解释故事是简单的;是具体的,而不是抽象的;赋予天赋、愚蠢和意图更大的作用,而不是运气;关注少数发生的引人注目的事件,而不是关注 ,无数未能发生的事件。塔勒布认为,我们人类不断地通过构建对过去脆弱的描述来欺骗自己,并相信它们是真实的。
Good stories provide a simple and coherent account of people’s actions and intentions. You are always ready to interpret behavior as a manifestation of general propensities and personality traits—causes that you can readily match to effects. The halo effect discussed earlier contributes to coherence, because it inclines us to match our view of all the qualities of a person to our judgment of one attribute that is particularly significant . If we think a baseball pitcher is handsome and athletic, for example, we are likely to rate him better at throwing the ball , too. Halos can also be negative: if we think a player is ugly, we will probably underrate his athletic ability. The halo effect helps keep explanatory narratives simple and coherent by exaggerating the consistency of evaluations: good people do only good things and bad people are all bad. The statement “Hitler loved dogs and little children” is shocking no matter how many times you hear it, because any trace of kindness in someone so evil violates the expectations set up by the halo effect. Inconsistencies reduce the ease of our thoughts and the clarity of our feelings.
好的故事为人们的行为和意图提供了一个简单而连贯的说明。 你总是准备将行为解释为一般倾向和人格特征的表现 ,这些原因你可以很容易地与效果相匹配。前面讨论的光环效应有助于连贯性,因为它倾向于将我们对一个人的所有品质的看法与我们对 一个特别重要的属性 的判断相匹配。例如,如果我们认为一个棒球投手英俊潇洒,运动能力强,我们就有可能在 投球 方面给他更好的评价 。光环效应通过夸大评价的一致性来帮助保持解释叙事的简单性和连贯性:好人只做善事,坏人 ,都是坏人。"希特勒喜欢狗和小孩子 "这句话无论你听到多少次 ,都会感到震惊,因为在一个如此邪恶的人身上,任何一丝善意都违反了光环效应所设定的期望。 不一致的地方降低了我们思想的轻松和感受的清晰。
A compelling narrative fosters an illusion of inevitability. Consider the story of how Google turned into a giant of the technology industry. Two creative graduate students in the computer science department at Stanford University come up with a superior way of searching information on the Internet. They seek and obtain funding to start a company and make a series of decisions that work out well. Within a few years, the company they started is one of the most valuable stocks in America, and the two former graduate students are among the richest people on the planet. On one memorable occasion, they were lucky, which makes the story even more compelling: a year after founding Google, they were willing to sell their company for less than $1 million, but the buyer said the price was too high. Mentioning the single lucky incident actually makes it easier to underestimate the multitude of ways in which luck affected the outcome.
一个令人信服的叙述培养了一种不可避免的错觉。 考虑一下谷歌如何变成科技行业巨头的故事。斯坦福大学计算机科学 系的两名富有创造力的研究生想出了一种在互联网上搜索信息的卓越方法。在几年内,他们创办的公司成为美国最有价值的股票之一,这两位前研究生成为地球上最富有的人之一。在一个难忘的场合, ,他们是幸运的,这使故事更加引人入胜:在创立谷歌一年后,他们愿意以不到100万美元的价格 出售公司 ,但买家说价格太高。提及单一的幸运事件,实际上更容易低估运气影响结果的众多方式。
A detailed history would specify the decisions of Google’s founders, but for our purposes it suffices to say that almost every choice they made had a good outcome. A more complete narrative would describe the actions of the firms that Google defeated. The hapless competitors would appear to be blind, slow, and altogether inadequate in dealing with the threat that eventually overwhelmed them.
一段详细的历史将具体说明谷歌的 创始人的决定,但为了我们的目的,我们只需说他们所做的几乎每一个选择都有好的结果。更完整的叙述将描述谷歌所击败的公司的行动。 这些无助的竞争对手在应对最终压倒他们的威胁时显得盲目、迟钝,而且完全不充分。
I intentionally told this tale blandly, but you get the idea: there is a very good story here. Fleshed out in more detail, the story could give you the sense that you understand what made Google succeed; it would also make you feel that you have learned a valuable general lesson about what makes businesses succeed. Unfortunately, there is good reason to believe that your sense of understanding and learning from the Google story is largely illusory. The ultimate test of an explanation is whether it would have made the event predictable in advance. No story of Google’s unlikely success will meet that test, because no story can include the myriad of events that would have caused a different outcome. The human mind does not deal well with nonevents. The fact that many of the important events that did occur involve choices further tempts you to exaggerate the role of skill and underestimate the part that luck played in the outcome. Because every critical decision turned out well, the record suggests almost flawless prescience—but bad luck could have disrupted any one of the successful steps. The halo effect adds the final touches, lending an aura of invincibility to the heroes of the story.
我故意把这个故事讲得很平淡,但你会明白: ,这里有一个非常好的故事。如果更详细地阐述,这个故事可以让你感觉到,你明白是什么让谷歌成功;它也会让你感觉到,你已经学到了关于什么让企业成功的宝贵的一般经验。不幸的是,我们有充分的理由相信,你对谷歌故事的理解和学习的感觉在很大程度上是虚幻的。对一个解释的最终 测试是它是否会使事件事先可以预测。任何关于谷歌不太可能成功的故事都无法满足这一测试,因为没有任何故事能够包括无数可能导致不同结果的事件。 人类的思维并不能很好地处理非事件。 许多确实发生的重要事件涉及选择,这一事实进一步诱使你夸大 ,夸大技巧的作用,而低估运气在结果中发挥的作用。由于每个关键决定都结果良好,记录表明几乎没有缺陷的预知,但坏运气可能打乱任何一个成功步骤。光环效应为故事中的英雄们增添了最后的润色,使他们拥有了无敌的光环。
Like watching a skilled rafter avoiding one potential calamity after another as he goes down the rapids, the unfolding of the Google story is thrilling because of the constant risk of disaster. However, there is an instructive difference between the two cases. The skilled rafter has gone down rapids hundreds of times. He has learned to read the roiling water in front of him and to anticipate obstacles. He has learned to make the tiny adjustments of posture that keep him upright. There are fewer opportunities for young men to learn how to create a giant company, and fewer chances to avoid hidden rocks—such as a brilliant innovation by a competing firm. Of course there was a great deal of skill in the Google story, but luck played a more important role in the actual event than it does in the telling of it. And the more luck was involved, the less there is to be learned.
就像 ,看着一个熟练的漂流者在急流中避开一个又一个潜在的 灾难,谷歌故事的展开是惊心动魄的,因为灾难的风险不断。然而,这两种情况之间有一个很有启发性的区别。 熟练的漂流者已经从激流中走了数百次。他已经学会了读懂他面前的咆哮的水和预测障碍物。他已经学会了对 姿势的微小调整,使他保持直立。年轻人学习如何创建一个巨型公司的机会较少,避免暗礁的机会也较少--比如竞争公司的辉煌创新。当然,在谷歌的故事中,有大量的技巧,但运气在实际事件中发挥的作用比它在讲述中的作用更重要。而且,运气参与得越多, ,就越不值得学习。
At work here is that powerful WY SIATI rule. You cannot help dealing with the limited information you have as if it were all there is to know. You build the best possible story from the information available to you, and if it is a good story, you believe it. Paradoxically, it is easier to construct a coherent story when you know little, when there are fewer pieces to fit into the puzzle. Our comforting conviction that the world makes sense rests on a secure foundation: our almost unlimited ability to ignore our ignorance.
在这里起作用的是那个强大的WY SIATI规则。你不能不处理你所拥有的有限信息,就好像它是你所知道的全部。 你从你所拥有的信息中建立一个尽可能好的故事,如果它是一个好故事,你就相信它。矛盾的是,当你知之甚少时,要构建一个连贯的故事是比较容易的,因为要把更多的碎片装进 。我们对世界有意义的安慰性信念建立在一个安全的基础上:我们几乎有无限的能力来忽视我们的无知。
I have heard of too many people who “knew well before it happened that the 2008 financial crisis was inevitable.” This sentence contains a highly objectionable word, which should be removed from our vocabulary in discussions of major events. The word is, of course, knew . Some people thought well in advance that there would be a crisis, but they did not know it. They now say they knew it because the crisis did in fact happen. This is a misuse of an important concept. In everyday language, we apply the word know only when what was known is true and can be shown to be true. We can know something only if it is both true and knowable. But the people who thought there would be a crisis (and there are fewer of them than now remember thinking it) could not conclusively show it at the time. Many intelligent and well-informed people were keenly interested in the future of the economy and did not believe a catastrophe was imminent; I infer from this fact that the crisis was not knowable. What is perverse about the use of know in this context is not that some individuals get credit for prescience that they do not deserve. It is that the language implies that the world is more knowable than it is. It helps perpetuate a pernicious illusion.
我听说有太多的人 "在事情发生之前就知道2008年的金融危机是不可避免的"。这句话包含了一个非常令人反感的词,在讨论重大事件时应该从我们的词汇中删除。 这个词当然是 知道 。有些人事先就想到会有危机,但他们并不知道。他们现在说他们知道,因为危机确实发生了。 这是对一个重要概念的误用。在日常语言中,只有当已知的东西是真实的并且可以被证明是真实的时候,我们才会应用 知道 这个词。只有当某件事情既是真实的又是可知的,我们才能知道它。 但是认为会有危机的人(而且比现在记得的认为危机的人少)当时无法确凿地表明。许多聪明且消息灵通的人对经济的未来非常感兴趣,不认为灾难迫在眉睫;我从这一事实推断,危机是不可知的。在 这种情况下,使用 知道 是不正常的,并不是说有些人因为预知而得到了他们不应该得到的荣誉。它是指这种语言暗示世界比它更可知。 它帮助延续了一种有害的幻觉。
The core of the illusion is that we believe we understand the past, which implies that the future also should be knowable, but in fact we understand the past less than we believe we do. Know is not the only word that fosters this illusion. In common usage, the words intuition and premonition also are reserved for past thoughts that turned out to be true. The statement “I had a premonition that the marriage would not last, but I was wrong” sounds odd, as does any sentence about an intuition that turned out to be false. To think clearly about the future, we need to clean up the language that we use in labeling the beliefs we had in the past.
幻觉的核心是我们相信我们了解过去,这意味着未来也应该是可知的,但事实上我们对过去的了解不如我们认为的 。 知道 并不是促进 这种幻觉的唯一词汇。在通常的用法中, 直觉 和 预感 这两个词也是为过去的想法保留的,结果是真的。"我有一种预感,认为这段婚姻不会长久,但我错了 "这句话听起来很奇怪,任何关于直觉被证明是错误的句子也一样。为了清楚地思考未来,我们需要清理 ,我们在给过去的信念贴标签时使用的语言。
THE SOCIAL COSTS OF HINDSIGHT
后见之明的社会成本
The mind that makes up narratives about the past is a sense-making organ. When an unpredicted event occurs, we immediately adjust our view of the world to accommodate the surprise. Imagine yourself before a football game between two teams that have the same record of wins and losses. Now the game is over, and one team trashed the other. In your revised model of the world, the winning team is much stronger than the loser, and your view of the past as well as of the future has been altered by that new perception. Learning from surprises is a reasonable thing to do, but it can have some dangerous consequences.
编造关于过去的叙述的头脑是一个感觉制造器官。 当一个未预料到的事件发生时,我们立即调整我们对世界的看法以适应这个惊喜。想象一下自己在两支胜负记录相同的球队之间的足球比赛前。现在比赛 ,一支球队捣毁了另一支。在你修订的世界模型中,赢家比输家强得多,而你对过去以及未来的看法也被这种新的认知所改变。 从意外中学习是一件合理的事情,但它可能产生一些危险的后果。
A general limitation of the human mind is its imperfect ability to reconstruct past states of knowledge, or beliefs that have changed. Once you adopt a new view of the world (or of any part of it), you immediately lose much of your ability to recall what you used to believe before your mind changed.
人类思维的一个普遍局限性是其重建 过去的知识状态或已经改变的信念的能力不完善。一旦你对世界(或世界的任何部分)采取了新的观点,你就会立即失去回忆你在思维改变之前曾经相信的大部分能力。
Many psychologists have studied what happens when people change their minds. Choosing a topic on which minds are not completely made up—say, the death penalty—the experimenter carefully measures people’s attitudes. Next, the participants see or hear a persuasive pro or con message. Then the experimenter measures people’s attitudes again; they usually are closer to the persuasive message they were exposed to. Finally, the participants report the opinion they held beforehand. This task turns out to be surprisingly difficult. Asked to reconstruct their former beliefs, people retrieve their current ones instead—an instance of substitution—and many cannot believe that they ever felt differently .
许多心理学家研究了当人们改变他们的想法时会发生什么。 选择一个思想还没有完全定型的话题--例如,死刑--实验者 ,仔细测量人们的态度。接下来,参与者看到或听到一个有说服力的支持或反对信息。 然后,实验者再次测量人们的态度;他们通常更接近于他们所接触的说服性信息。 最后,参与者报告他们事先持有的意见。这项任务原来是出乎意料的困难。在被要求重建他们以前的信念时,人们 ,而找回他们现在的信念--这是一个替代的例子--许多人无法相信他们 曾经有过不同的感觉 。
Your inability to reconstruct past beliefs will inevitably cause you to underestimate the extent to which you were surprised by past events. Baruch Fischhoff first demonstrated this “I-knew-it-all-along” effect, or hindsight bias , when he was a student in Jerusalem. Together with Ruth Beyth (another of our students), Fischhoff conducted a survey before President Richard Nixon visited China and Russia in 1972. The respondents assigned probabilities to fifteen possible outcomes of Nixon’s diplomatic initiatives. Would Mao Zedong agree to meet with Nixon? Might the United States grant diplomatic recognition to China? After decades of enmity, could the United States and the Soviet Union agree on anything significant?
你无法重建过去的信念,将不可避免地导致你低估你对过去事件的惊讶程度。 巴鲁克-菲什霍夫(Baruch Fischhoff)在耶路撒冷当学生时,首次证明了这种 "我早就知道 "的效应,或 事后诸葛亮式的偏见 。Fischhoff与Ruth Beyth(我们的另一个学生)一起 ,在1972年理查德-尼克松总统访问中国和俄罗斯之前进行了一次调查。受访者为尼克松的外交 举措的15种可能结果分配了概率。 毛泽东会同意与尼克松会面吗?美国可能会对中国给予外交承认吗?经过几十年的敌对, 美国和苏联 能否在任何重大问题上达成一致 ?
After Nixon’s return from his travels, Fischhoff and Beyth asked the same people to recall the probability that they had originally assigned to each of the fifteen possible outcomes. The results were clear. If an event had actually occurred, people exaggerated the probability that they had assigned to it earlier. If the possible event had not come to pass, the participants erroneously recalled that they had always considered it unlikely. Further experiments showed that people were driven to overstate the accuracy not only of their original predictions but also of those made by others. Similar results have been found for other events that gripped public attention, such as the O. J. Simpson murder trial and the impeachment of President Bill Clinton. The tendency to revise the history of one’s beliefs in light of what actually happened produces a robust cognitive illusion.
在尼克松旅行归来后,菲斯霍夫和贝斯要求同样的人回忆他们最初为15种可能的结果中的每一种分配的概率。 结果很清楚。 如果一个事件真的发生了,人们会夸大他们先前分配给它的概率。如果可能的事件没有发生,参与者 ,错误地回忆起他们一直认为它是不可能的。进一步的实验表明,人们不仅被驱使夸大他们原来预测的准确性,而且也夸大别人的预测。类似的结果也出现在其他引起公众关注的事件中,如O.J.辛普森谋杀案的审判和对比尔-克林顿总统的弹劾。倾向于 ,根据实际发生的情况修改自己的信仰历史,这产生了一个强大的认知幻觉。
Hindsight bias has pernicious effects on the evaluations of decision makers. It leads observers to assess the quality of a decision not by whether the process was sound but by whether its outcome was good or bad. Consider a low-risk surgical intervention in which an unpredictable accident occurred that caused the patient’s death. The jury will be prone to believe, after the fact, that the operation was actually risky and that the doctor who ordered it should have known better. This outcome bias makes it almost impossible to evaluate a decision properly—in terms of the beliefs that were reasonable when the decision was made.
后见之明的偏见对决策者的评价有着有害的影响。它导致观察者在评估 一项决定的质量 时,不是看其过程是否合理,而是看其结果是好是坏。考虑一个低风险的手术干预,其中发生了一个不可预测的事故 ,导致病人死亡。陪审团在事后很容易相信,该手术实际上是有风险的,而且下达手术的医生应该知道得更清楚。这种结果偏差使我们几乎不可能正确地评价一个决定--从作出决定时的合理信念来看。
Hindsight is especially unkind to decision makers who act as agents for others—physicians, financial advisers, third-base coaches, CEOs, social workers, diplomats, politicians. We are prone to blame decision makers for good decisions that worked out badly and to give them too little credit for successful moves that appear obvious only after the fact. There is a clear outcome bias . When the outcomes are bad, the clients often blame their agents for not seeing the handwriting on the wall—forgetting that it was written in invisible ink that became legible only afterward. Actions that seemed prudent in foresight can look irresponsibly negligent in hindsight. Based on an actual legal case, students in California were asked whether the city of Duluth, Minnesota, should have shouldered the considerable cost of hiring a full-time bridge monitor to protect against the risk that debris might get caught and block the free flow of water. One group was shown only the evidence available at the time of the city’s decision; 24% of these people felt that Duluth should take on the expense of hiring a flood monitor. The second group was informed that debris had blocked the river, causing major flood damage; 56% of these people said the city should have hired the monitor , although they had been explicitly instructed not to let hindsight distort their judgment.
事后诸葛亮对那些作为他人代理人的决策者尤其不友好--医生、财务顾问、三垒教练、首席执行官、社会工作者、外交官、政治家。我们很容易为那些效果不好的好决定责备决策者,为那些事后才发现的成功举措给予他们太少的赞扬。有一个明显的 结果偏见 。 当结果不好时,客户往往责怪他们的代理人 ,没有看到墙上的字迹--忘记了它是用看不见的墨水写的,只有在事后才变得清晰可见。在预见中似乎很谨慎的行为,在事后看来可能是不负责任的过失。根据一个实际的法律案例,加利福尼亚的学生被问及明尼苏达州德卢斯市是否应该承担雇用全职桥梁监测员的大量费用,以 ,防止碎片可能被卷入并阻碍水流自由流动的风险。有一组人只看到了城市决策时的证据;其中24%的人认为德卢斯市应该承担雇佣洪水监测员的费用。 第二组人被告知瓦砾堵塞了河道,造成了重大的水灾损失;这些人中有56%的人说,市政府 应该雇用监测员 ,尽管他们被明确指示不要让事后诸葛亮扭曲了他们的判断。
The worse the consequence, the greater the hindsight bias. In the case of a catastrophe, such as 9/11, we are especially ready to believe that the officials who failed to anticipate it were negligent or blind. On July 10, 2001, the Central Intelligence Agency obtained information that al-Qaeda might be planning a major attack against the United States. George Tenet, director of the CIA, brought the information not to President George W. Bush but to National Security Adviser Condoleezza Rice. When the facts later emerged, Ben Bradlee, the legendary executive editor of The Washington Post , declared, “It seems to me elementary that if you’ve got the story that’s going to dominate history you might as well go right to the president.” But on July 10, no one knew—or could have known—that this tidbit of intelligence would turn out to dominate history.
后果越严重,后见之明的偏差就越大。在发生灾难的情况下,如9/11事件,我们特别愿意相信那些没有预见到灾难的官员是疏忽或盲目的。2001年7月10日,中央情报局获得情报,基地组织可能正在 ,计划对美国进行重大袭击。中央情报局局长乔治-特尼特(George Tenet)没有将该情报带给乔治-W-布什总统,而是带给了国家安全顾问康多莉扎-赖斯。当事实后来出现时, 《华盛顿邮报 》的传奇执行编辑本-布拉德利宣称:"在我看来,如果你有一个将主导历史的故事,你可能 ,也可以直接去找总统。"但在7月10日,没有人知道--或者说不可能知道--这个 小道消息 将变成主宰历史的因素。
Because adherence to standard operating procedures is difficult to second-guess, decision makers who expect to have their decisions scrutinized with hindsight are driven to bureaucratic solutions—and to an extreme reluctance to take risks . As malpractice litigation became more common, physicians changed their procedures in multiple ways: ordered more tests, referred more cases to specialists, applied conventional treatments even when they were unlikely to help. These actions protected the physicians more than they benefited the patients, creating the potential for conflicts of interest. Increased accountability is a mixed blessing.
由于对标准操作程序的遵守很难被猜测,那些预期自己的决定会被事后审查的决策者会被驱使采取官僚主义的解决方案,并且极度 不愿意承担风险 。随着 渎职诉讼变得越来越普遍,医生们以多种方式改变了他们的程序:订购更多的检查,将更多的病例转给专家,即使在不可能有帮助的情况下也采用常规治疗。 这些行动对医生的保护大于对病人的帮助,创造了利益冲突的可能。加强问责制是一个混合的祝福。
Although hindsight and the outcome bias generally foster risk aversion, they also bring undeserved rewards to irresponsible risk seekers, such as a general or an entrepreneur who took a crazy gamble and won. Leaders who have been lucky are never punished for having taken too much risk. Instead, they are believed to have had the flair and foresight to anticipate success, and the sensible people who doubted them are seen in hindsight as mediocre, timid, and weak. A few lucky gambles can crown a reckless leader with a halo of prescience and boldness.
虽然事后诸葛亮和结果偏差通常会助长风险规避,但它们也会给不负责任的风险寻求者带来不应有的回报,比如一个将军或一个企业家进行了一场疯狂的赌博并赢了。运气好的领导人从来不会因为承担过多的风险而受到惩罚。 相反,他们被认为具有预见成功的天赋和远见,而那些怀疑他们的明智的人 ,事后被视为平庸、胆小和软弱。几次幸运的赌博可以给一个鲁莽的领导人戴上先知先觉和大胆的光环。
RECIPES FOR SUCCESS
成功的秘诀
The sense-making machinery of System 1 makes us see the world as more tidy, simple, predictable, and coherent than it really is. The illusion that one has understood the past feeds the further illusion that one can predict and control the future. These illusions are comforting. They reduce the anxiety that we would experience if we allowed ourselves to fully acknowledge the uncertainties of existence. We all have a need for the reassuring message that actions have appropriate consequences, and that success will reward wisdom and courage. Many business books are tailor-made to satisfy this need.
系统1的意义制造机制使我们把世界看得比实际情况更整洁、更简单、更可预测、更连贯。"一个人已经理解了过去 "的错觉进一步助长了 "一个人可以预测 、 、控制未来 "的错觉。这些幻觉是令人欣慰的。他们减少了我们的焦虑,如果我们允许自己完全承认存在的不确定性,我们都需要一个令人放心的信息,即行动有适当的后果,成功会奖励智慧和勇气。许多商业书籍是为满足这一需求而量身定做的。
Do leaders and management practices influence the outcomes of firms in the market? Of course they do, and the effects have been confirmed by systematic research that objectively assessed the characteristics of CEOs and their decisions, and related them to subsequent outcomes of the firm. In one study, the CEOs were characterized by the strategy of the companies they had led before their current appointment , as well as by management rules and procedures adopted after their appointment. CEOs do influence performance, but the effects are much smaller than a reading of the business press suggests.
领导人和管理层 的做法是否影响企业在市场中的结果?当然,这些影响已经被系统的研究所证实,这些研究客观地评估了首席执行官的特征和他们的决策,并将它们与公司的后续结果联系起来。在一项研究中,首席执行官的特点是他们在被 任命之前 所领导的公司的战略,以及 ,他们被任命之后所采用的管理规则和程序。首席执行官确实影响了业绩,但其影响要比阅读商业媒体所暗示的小得多。
Researchers measure the strength of relationships by a correlation coefficient, which varies between 0 and 1. The coefficient was defined earlier (in relation to regression to the mean) by the extent to which two measures are determined by shared factors. A very generous estimate of the correlation between the success of the firm and the quality of its CEO might be as high as .30, indicating 30% overlap. To appreciate the significance of this number, consider the following question:
研究人员通过相关系数来衡量关系的强度,该系数在0和1之间变化。该系数在前面被定义为(与回归平均值有关),即两个测量 ,由共同的因素决定的程度。一个非常慷慨的估计,公司的成功和其CEO的质量之间的相关性可能高达0.30,表明30%的重叠。为了理解这个数字的意义,请考虑以下问题。
Suppose you consider many pairs of firms. The two firms in each pair are generally similar, but the CEO of one of them is better than the other. How often will you find that the firm with the stronger CEO is the more successful of the two?
假设你考虑许多对公司。 每对中的两个公司一般都很相似,但其中一个公司的首席执行官比另一个公司更好 。 你有多少次会发现 ,拥有更强大的首席执行官的公司是两者中更成功的?
In a well-ordered and predictable world, the correlation would be perfect (1), and the stronger CEO would be found to lead the more successful firm in 100% of the pairs. If the relative success of similar firms was determined entirely by factors that the CEO does not control (call them luck, if you wish), you would find the more successful firm led by the weaker CEO 50% of the time. A correlation of .30 implies that you would find the stronger CEO leading the stronger firm in about 60% of the pairs—an improvement of a mere 10 percentage points over random guessing, hardly grist for the hero worship of CEOs we so often witness.
在一个秩序良好、可预测的世界里,这种相关性将是完美的(1),在100%的配对中,较强的CEO将被发现领导更成功的公司。 如果类似公司的相对成功完全由CEO无法控制的因素决定(如果你愿意,可以把它们称为 运气),你会发现较弱的CEO领导的公司在50%的情况下更成功。0.30的相关性意味着你会发现,在大约60%的配对中,实力较强的CEO会领导实力较强的公司--比随机猜测只提高了10个百分点,对于我们经常看到的对CEO的英雄崇拜来说,这几乎是不可能的。
If you expected this value to be higher—and most of us do—then you should take that as an indication that you are prone to overestimate the predictability of the world you live in. Make no mistake: improving the odds of success from 1:1 to 3:2 is a very significant advantage, both at the racetrack and in business. From the perspective of most business writers, however, a CEO who has so little control over performance would not be particularly impressive even if her firm did well. It is difficult to imagine people lining up at airport bookstores to buy a book that enthusiastically describes the practices of business leaders who, on average, do somewhat better than chance. Consumers have a hunger for a clear message about the determinants of success and failure in business, and they need stories that offer a sense of understanding, however illusory.
如果你预计这个数值会更高--我们中的大多数 ,那么你应该把这看作是你容易高估你所生活的世界的 可预测性。不要搞错:把成功的几率从1:1提高到3:2是一个非常重要的优势,无论是在赛马场还是在商业领域。 然而,从大多数商业作家的角度来看,一个对业绩控制力如此之小的CEO,即使她的公司做得好,也不会有特别 。很难想象人们会在机场书店排队购买一本热情洋溢地描述商业领袖做法的书,而这些商业领袖的表现平均来说比机会要好一些。 消费者渴望得到关于商业成功和失败的决定因素的明确信息,他们需要那些提供理解感的故事,无论 ,都是虚幻的。
In his penetrating book The Halo Effect , Philip Rosenzweig, a business school professor based in Switzerland, shows how the demand for illusory certainty is met in two popular genres of business writing: histories of the rise (usually) and fall (occasionally) of particular individuals and companies, and analyses of differences between successful and less successful firms. He concludes that stories of success and failure consistently exaggerate the impact of leadership style and management practices on firm outcomes, and thus their message is rarely useful.
瑞士的商学院教授菲利普-罗森茨威格在 他的 《 光环效应 》 一书中 ,展示了两种流行的商业写作体裁是如何满足对虚幻确定性的需求的:特定个人和公司的崛起(通常)和衰落(偶尔)的历史,以及对成功和不太成功的公司之间差异的分析。他的结论是: ,成功和失败的故事始终夸大了领导风格和管理实践对公司结果的影响,因此,他们的信息很少有用。
To appreciate what is going on, imagine that business experts, such as other CEOs, are asked to comment on the reputation of the chief executive of a company. They are keenly aware of whether the company has recently been thriving or failing. As we saw earlier in the case of Google, this knowledge generates a halo. The CEO of a successful company is likely to be called flexible, methodical, and decisive. Imagine that a year has passed and things have gone sour. The same executive is now described as confused, rigid, and authoritarian. Both descriptions sound right at the time: it seems almost absurd to call a successful leader rigid and confused, or a struggling leader flexible and methodical.
为了理解发生了什么,想象一下,商业专家,如其他首席执行官,被要求对一家公司的首席执行官的声誉进行评论。他们敏锐地意识到,公司最近是 蓬勃发展还是失败。正如我们先前在谷歌的案例中看到的那样,这种知识产生了一种光环。一个成功公司的首席执行官很可能被称为灵活、有条不紊和果断的人。想象一下,一年过去了,事情已经变味了。同一个行政部门现在被描述为混乱、僵化和独裁。这两种描述在当时听起来都是正确的:把一个成功的 领导人称为僵化和混乱,或把一个挣扎中的领导人称为灵活和有条不紊,似乎都是荒谬的。
Indeed, the halo effect is so powerful that you probably find yourself resisting the idea that the same person and the same behaviors appear methodical when things are going well and rigid when things are going poorly. Because of the halo effect, we get the causal relationship backward: we are prone to believe that the firm fails because its CEO is rigid, when the truth is that the CEO appears to be rigid because the firm is failing. This is how illusions of understanding are born.
事实上,光环效应是如此强大,以至于你可能会发现自己抵制这样的想法:同一个人和同样的行为在事情进展顺利时显得有条不紊,而在事情进展不顺利时则显得僵化。由于光环效应,我们把因果关系弄反了:我们很容易相信 ,因为它的CEO是僵硬的,而事实是CEO显得僵硬是因为公司正在失败。理解的幻觉就是这样产生的。
The halo effect and outcome bias combine to explain the extraordinary appeal of books that seek to draw operational morals from systematic examination of successful businesses. One of the best-known examples of this genre is Jim Collins and Jerry I. Porras’s Built to Last . The book contains a thorough analysis of eighteen pairs of competing companies, in which one was more successful than the other. The data for these comparisons are ratings of various aspects of corporate culture, strategy, and management practices. “We believe every CEO, manager, and entrepreneur in the world should read this book,” the authors proclaim. “You can build a visionary company .”
光环效应和结果偏见结合在一起,解释了那些试图从对成功企业的系统检查中得出经营道德的书籍的非凡吸引力。 这种类型的最著名的例子之一是吉姆 柯林斯和杰里I.波拉斯的《 基业长青 》。该书包含了对18对竞争公司的 ,其中一个比另一个更成功的透彻分析。这些比较的数据是对企业文化、战略和管理实践各方面的评级。"我们相信世界上每一位首席执行官、经理和企业家都应该读一读这本书,"作者宣称。 "你可以建立 一个有远见的公司 "。
The basic message of Built to Last and other similar books is that good managerial practices can be identified and that good practices will be rewarded by good results. Both messages are overstated. The comparison of firms that have been more or less successful is to a significant extent a comparison between firms that have been more or less lucky. Knowing the importance of luck, you should be particularly suspicious when highly consistent patterns emerge from the comparison of successful and less successful firms. In the presence of randomness, regular patterns can only be mirages.
经久不衰 》和其他类似书籍的基本信息是,良好的管理实践可以被发现,而良好的实践将得到良好的回报。 这两个信息都被夸大了。 对成功与否的公司的比较,在很大程度上是对运气好坏的公司的比较。由于知道 运气的重要性,当成功和不太成功的公司的比较中出现高度一致的模式时,你应该特别怀疑。 在随机性的存在下,有规律的模式只能是海市蜃楼。
Because luck plays a large role, the quality of leadership and management practices cannot be inferred reliably from observations of success. And even if you had perfect foreknowledge that a CEO has brilliant vision and extraordinary competence, you still would be unable to predict how the company will perform with much better accuracy than the flip of a coin . On average, the gap in corporate profitability and stock returns between the outstanding firms and the less successful firms studied in Built to Last shrank to almost nothing in the period following the study. The average profitability of the companies identified in the famous In Search of Excellence dropped sharply as well within a short time. A study of Fortune ’s “Most Admired Companies” finds that over a twenty-year period, the firms with the worst ratings went on to earn much higher stock returns than the most admired firms.
因为运气起着很大的作用,领导和管理实践的质量不能从对成功的观察中可靠地推断出来。而且,即使你有完美的 ,知道一个CEO有卓越的远见和非凡的能力,你仍然无法预测公司将如何表现,其准确度比 掷硬币 高得多。平均来说,在 研究之后的这段时间里,杰出的公司和不太成功的公司在 公司 盈利能力和股票回报方面的差距几乎缩小到了零。在著名的 《寻找卓越 》中所确定的公司的平均利润率也在短时间内急剧下降。 对 《财富》杂志 "最受赞赏的公司 " 的研究发现,在20年的时间里,评级最差的公司后来的股票回报率远远高于最受赞赏的公司。
You are probably tempted to think of causal explanations for these observations: perhaps the successful firms became complacent, the less successful firms tried harder. But this is the wrong way to think about what happened. The average gap must shrink, because the original gap was due in good part to luck, which contributed both to the success of the top firms and to the lagging performance of the rest. We have already encountered this statistical fact of life: regression to the mean .
你可能很想为这些 :也许成功的公司变得自满,不太成功的公司则更努力。但这是对所发生情况的错误思考。 平均差距必须缩小,因为原来的差距在很大程度上是由运气造成的,它既有助于顶级公司的成功,也有助于其他公司的落后表现。 我们已经遇到了这种生活的统计事实: 向平均值回归 。
Stories of how businesses rise and fall strike a chord with readers by offering what the human mind needs: a simple message of triumph and failure that identifies clear causes and ignores the determinative power of luck and the inevitability of regression. These stories induce and maintain an illusion of understanding, imparting lessons of little enduring value to readers who are all too eager to believe them.
企业兴衰的故事通过提供人类思维所需要的东西而引起读者的共鸣:一个简单的胜利和失败的信息,找出明确的原因,忽略了运气的决定性力量和退步的不可避免性。这些故事诱导并维持一种理解的假象 ,向读者传授没有什么持久价值的教训 ,而他们都太渴望相信它们。
SPEAKING OF HINDSIGHT
说到后知后觉
“The mistake appears obvious, but it is just hindsight. You could not have known in advance.”
"这个错误看起来很明显,但这只是后知后觉。你不可能事先知道。"
“He’s learning too much from this success story, which is too tidy. He has fallen for a narrative fallacy.”
"他从这个成功的故事中学到了太多东西,这太整齐了。他上了一个叙事谬误的当。"
“She has no evidence for saying that the firm is badly managed. All she knows is that its stock has gone down. This is an outcome bias, part hindsight and part halo effect.”
"她没有证据说该公司管理不善。 她只知道该公司的股票已经下跌了。这是一种结果 偏见,部分是事后诸葛亮,部分是光环效应。"
“Let’s not fall for the outcome bias. This was a stupid decision even though it worked out well.”
"我们不要被结果的偏见所迷惑。这是一个愚蠢的决定,尽管它的结果很好。"
20
20
The Illusion of Validity
有效性的幻觉
System 1 is designed to jump to conclusions from little evidence—and it is not designed to know the size of its jumps. Because of WYSIATI, only the evidence at hand counts. Because of confidence by coherence, the subjective confidence we have in our opinions reflects the coherence of the story that System 1 and System 2 have constructed. The amount of evidence and its quality do not count for much, because poor evidence can make a very good story. For some of our most important beliefs we have no evidence at all, except that people we love and trust hold these beliefs. Considering how little we know, the confidence we have in our beliefs is preposterous—and it is also essential.
系统1被设计成从很少的证据中得出结论--而且它被设计成不知道其跳跃的大小。 因为所见即所得,只有手头的证据才算数。因为一致性的信心, ,我们对我们的观点的主观信心反映了系统1和系统2所构建的故事的一致性。证据的数量和质量并不重要,因为拙劣的证据可以构成一个非常好的故事。 对于我们一些最重要的信念,我们根本没有任何证据,除了我们所爱和信任的人持有这些信念。考虑到我们所知甚少, ,我们对自己的信念的信心是荒谬的,而且也是至关重要的。
THE ILLUSION OF VALIDITY
有效性的假象
Many decades ago I spent what seemed like a great deal of time under a scorching sun, watching groups of sweaty soldiers as they solved a problem. I was doing my national service in the Israeli Army at the time. I had completed an undergraduate degree in psychology, and after a year as an infantry officer was assigned to the army’s Psychology Branch, where one of my occasional duties was to help evaluate candidates for officer training. We used methods that had been developed by the British Army in World War II.
几十年前,我在烈日下花了似乎是大量的时间,看着一群群汗流浃背的士兵解决一个个问题。我已经完成了心理学的本科学位,在做了一年的 步兵军官后,被分配到军队的心理学部门,我在那里的一个偶尔的职责是帮助评估军官培训的候选人。我们使用的方法是英国军队在第二次世界大战中开发的。
One test, called the “leaderless group challenge,” was conducted on an obstacle field. Eight candidates, strangers to each other, with all insignia of rank removed and only numbered tags to identify them, were instructed to lift a long log from the ground and haul it to a wall about six feet high. The entire group had to get to the other side of the wall without the log touching either the ground or the wall, and without anyone touching the wall. If any of these things happened, they had to declare it and start again.
其中一项测试被称为 "无领导小组挑战",是在一个障碍物场地上进行的。八名候选人彼此陌生,去掉了所有的军衔标志, ,只有编号的标签可以识别他们,他们被指示 ,从地上抬起一根长木头,把它拖到约六英尺高的墙上。整个小组必须在木头不接触地面或墙面的情况下到达墙的另一边,而且不能有任何人触碰墙面。如果发生这些事情中的任何一件,他们必须宣布,并重新开始。
There was more than one way to solve the problem. A common solution was for the team to send several men to the other side by crawling over the pole as it was held at an angle, like a giant fishing rod, by other members of the group. Or else some soldiers would climb onto someone’s shoulders and jump across. The last man would then have to jump up at the pole, held up at an angle by the rest of the group, shinny his way along its length as the others kept him and the pole suspended in the air, and leap safely to the other side. Failure was common at this point, which required them to start all over again.
解决 问题的方法不止一种。一个常见的解决方案是,队伍中的其他成员以一定的角度举着杆子,像一个巨大的鱼竿一样,通过爬行将几个人送到另一边。 或者,一些士兵会爬上某人的肩膀,跳过去。然后,最后一个人必须跳到杆子上,由其余的人以一定的角度举起,沿着杆子的长度闪闪发光, ,其他的人让他和杆子悬浮在空中,然后安全地跳到另一边。在这一点上,失败是常见的,这需要他们重新开始。
As a colleague and I monitored the exercise, we made note of who took charge, who tried to lead but was rebuffed, how cooperative each soldier was in contributing to the group effort. We saw who seemed to be stubborn, submissive, arrogant, patient, hot-tempered, persistent, or a quitter. We sometimes saw competitive spite when someone whose idea had been rejected by the group no longer worked very hard. And we saw reactions to crisis: who berated a comrade whose mistake had caused the whole group to fail, who stepped forward to lead when the exhausted team had to start over. Under the stress of the event, we felt, each man’s true nature revealed itself. Our impression of each candidate’s character was as direct and compelling as the color of the sky.
当我和一位同事监督演习时,我们记下了谁负责,谁试图领导但被拒绝,每个士兵在为小组努力做出贡献时的合作程度。我们看到谁似乎很顽固、顺从、 傲慢、有耐心、脾气暴躁、坚持不懈或放弃。我们有时看到竞争的怨恨,当某人的想法被小组拒绝后,他不再非常努力。我们还看到了对危机的反应:谁斥责了一个因错误而导致整个小组失败的同志,谁在精疲力竭的团队不得不重新开始时挺身而出,发挥领导作用。在事件的压力下,我们感觉到,每个 人的真实本性显露出来。我们对每个候选人的性格印象就像天空的颜色一样直接和引人注目。
After watching the candidates make several attempts, we had to summarize our impressions of soldiers’ leadership abilities and determine, with a numerical score, who should be eligible for officer training. We spent some time discussing each case and reviewing our impressions. The task was not difficult, because we felt we had already seen each soldier’s leadership skills. Some of the men had looked like strong leaders, others had seemed like wimps or arrogant fools, others mediocre but not hopeless. Quite a few looked so weak that we ruled them out as candidates for officer rank. When our multiple observations of each candidate converged on a coherent story, we were completely confident in our evaluations and felt that what we had seen pointed directly to the future. The soldier who took over when the group was in trouble and led the team over the wall was a leader at that moment. The obvious best guess about how he would do in training, or in combat, was that he would be as effective then as he had been at the wall. Any other prediction seemed inconsistent with the evidence before our eyes.
在观看了候选人的几次尝试后,我们必须总结我们对士兵领导能力的印象,并用数字评分来确定谁应该有资格接受军官培训。 我们花了一些时间讨论每个案例,并回顾 我们的印象。这项任务并不困难,因为我们觉得我们已经看到了每个士兵的领导能力。有些人看起来像强有力的领导者,有些人看起来像懦夫或傲慢的傻瓜,有些人平庸但并非毫无希望。 相当多的人看起来如此软弱,以至于我们排除了他们作为军官级别候选人的可能性。当我们对每个候选人的多种观察汇聚成一个连贯的 ,我们对自己的评价完全有信心,并觉得我们所看到的直接指向了未来。在队伍遇到困难时接手并带领队伍越过城墙的士兵在那一刻就是一个领导者。 关于他在训练中,或在战斗中的表现,明显的最佳猜测是,他那时会像在城墙上一样有效。任何其他预测似乎 ,与我们眼前的证据不一致。
Because our impressions of how well each soldier had performed were generally coherent and clear, our formal predictions were just as definite. A single score usually came to mind and we rarely experienced doubts or formed conflicting impressions. We were quite willing to declare, “This one will never make it,” “That fellow is mediocre, but he should do okay,” or “He will be a star.” We felt no need to question our forecasts, moderate them, or equivocate. If challenged, however, we were prepared to admit, “But of course anything could happen.” We were willing to make that admission because, despite our definite impressions about individual candidates, we knew with certainty that our forecasts were largely useless.
因为 ,我们对每个士兵的表现的印象总体上是连贯而清晰的,我们的正式预测也同样是明确的。一个单一的分数通常出现在脑海中,我们很少经历怀疑或形成矛盾的印象。 我们很愿意宣布,"这个人永远不会成功","那个家伙很平庸,但他应该 ,"或 "他将成为一个明星"。我们觉得没有必要质疑我们的预测,缓和它们,或者含糊其辞。然而,如果受到质疑,我们准备承认,"但当然任何事情都可能发生"。我们愿意做出这样的承认,因为尽管我们对个别候选人有明确的印象,但我们肯定地知道,我们的预测基本上是无用的。
The evidence that we could not forecast success accurately was overwhelming. Every few months we had a feedback session in which we learned how the cadets were doing at the officer-training school and could compare our assessments against the opinions of commanders who had been monitoring them for some time. The story was always the same: our ability to predict performance at the school was negligible. Our forecasts were better than blind guesses, but not by much.
证据表明,我们不能 ,不能准确预测成功。每隔几个月,我们就会举行一次反馈会议,了解学员在军官培训学校的表现,并将我们的评估与已经监测了一段时间的指挥官的意见进行比较。情况总是一样的:我们预测学校表现的能力可以忽略不计。我们的预测比盲目猜测好 ,但不会好很多。
We were downcast for a while after receiving the discouraging news. But this was the army. Useful or not, there was a routine to be followed and orders to be obeyed. Another batch of candidates arrived the next day. We took them to the obstacle field, we faced them with the wall, they lifted the log, and within a few minutes we saw their true natures revealed, as clearly as before. The dismal truth about the quality of our predictions had no effect whatsoever on how we evaluated candidates and very little effect on the confidence we felt in our judgments and predictions about individuals.
在收到这个令人沮丧的消息后,我们沮丧了好一阵子。但这是军队。无论是否有用,都有一个需要遵循的程序和需要服从的命令。第二天又来了一批候选人。 我们把他们带到障碍场,我们把他们和墙面对面,他们举起了木头,几分钟内我们看到他们的真实本性 ,和以前一样清晰。关于我们预测质量的令人沮丧的事实对我们如何评价候选人没有任何影响,对我们对个人的判断和预测的信心也影响甚微。
What happened was remarkable. The global evidence of our previous failure should have shaken our confidence in our judgments of the candidates, but it did not. It should also have caused us to moderate our predictions, but it did not. We knew as a general fact that our predictions were little better than random guesses, but we continued to feel and act as if each of our specific predictions was valid. I was reminded of the Müller-Lyer illusion, in which we know the lines are of equal length yet still see them as being different. I was so struck by the analogy that I coined a term for our experience: the illusion of validity .
所发生的事情是了不起的。我们以前失败的全球证据应该动摇我们对候选人的判断的信心,但它没有 。我们知道,作为一个普遍的事实,我们的预测比随机猜测好不了多少,但我们仍然觉得我们的每一个具体预测都是有效的,并付诸行动。我想起了穆勒-莱尔的错觉,在这种错觉中,我们知道线条的长度是相等的,但仍然把它们看成是不同的。 这个比喻给我留下了深刻的印象,我为我们的经验创造了一个术语: 有效性的错觉 。
I had discovered my first cognitive illusion.
我发现了我的第一个认知幻觉。
Decades later, I can see many of the central themes of my thinking—and of this book—in that old story. Our expectations for the soldiers’ future performance were a clear instance of substitution, and of the representativeness heuristic in particular. Having observed one hour of a soldier’s behavior in an artificial situation, we felt we knew how well he would face the challenges of officer training and of leadership in combat. Our predictions were completely nonregressive—we had no reservations about predicting failure or outstanding success from weak evidence. This was a clear instance of WYSIATI. We had compelling impressions of the behavior we observed and no good way to represent our ignorance of the factors that would eventually determine how well the candidate would perform as an officer.
几十年后,我在这个老故事中看到了我思考的许多中心主题--以及这本书的中心主题。我们对士兵未来表现的期望 ,这是一个明显的替代实例,尤其是代表性启发式。在观察了一个士兵在人为情况下的一个 小时的行为后,我们觉得我们知道他将如何面对军官训练和战斗中的领导力的挑战。我们的预测是完全非回归性的--我们对从薄弱的证据中预测失败或突出的成功毫无保留。我们对所观察到的行为有令人信服的印象,但没有 好的方式来表示我们对最终决定候选人作为官员的表现的因素的无知。
Looking back, the most striking part of the story is that our knowledge of the general rule—that we could not predict—had no effect on our confidence in individual cases. I can see now that our reaction was similar to that of Nisbett and Borgida’s students when they were told that most people did not help a stranger suffering a seizure. They certainly believed the statistics they were shown, but the base rates did not influence their judgment of whether an individual they saw on the video would or would not help a stranger. Just as Nisbett and Borgida showed, people are often reluctant to infer the particular from the general.
回过头来看,这个故事最引人注目的部分是,我们对一般规则的了解--我们无法预测--对我们在个别情况下的信心没有影响。 我现在可以看到,我们的反应与尼斯贝特和博吉达的学生相似,当时 ,他们被告知大多数人不会帮助一个遭受癫痫发作的陌生人。他们当然相信他们所看到的统计数据,但基本比率并不影响他们对他们在视频中看到的某个人是否会帮助陌生人的判断。正如Nisbett和Borgida所表明的那样,人们往往不愿意从一般事物中推断出特殊事物。
Subjective confidence in a judgment is not a reasoned evaluation of the probability that this judgment is correct. Confidence is a feeling, which reflects the coherence of the information and the cognitive ease of processing it. It is wise to take admissions of uncertainty seriously, but declarations of high confidence mainly tell you that an individual has constructed a coherent story in his mind, not necessarily that the story is true.
对一个判断的主观信心 ,并不是对这个判断正确的概率的合理评价。信心是一种感觉,它反映了信息的连贯性和处理信息的认知难度。 认真对待不确定性的承认是明智的,但高信心的声明主要告诉你,一个人在他心中构建了一个连贯的故事,不一定 这个故事是真的。
THE ILLUSION OF STOCK-PICKING SKILL
挑选股票的技巧的假象
In 1984, Amos and I and our friend Richard Thaler visited a Wall Street firm. Our host, a senior investment manager, had invited us to discuss the role of judgment biases in investing. I knew so little about finance that I did not even know what to ask him, but I remember one exchange. “When you sell a stock,” I asked, “who buys it?” He answered with a wave in the vague direction of the window, indicating that he expected the buyer to be someone else very much like him. That was odd: What made one person buy and the other sell? What did the sellers think they knew that the buyers did not?
1984年,阿莫斯和我以及我们的朋友理查德-塞勒访问了一家华尔街公司。我们的主人,一位高级投资经理,邀请我们讨论判断偏差在投资中的作用。我对金融知之甚少,甚至不知道该问他什么,但我记得有一次交流。"当你卖出一只股票时,"我问,"谁买的?"他回答说:" ,向窗户的模糊方向挥了挥手,表示他预计买主会是和他非常相似的其他人。这很奇怪:是什么让一个人买,另一个人卖?卖家认为他们知道什么而买家不知道?
Since then, my questions about the stock market have hardened into a larger puzzle: a major industry appears to be built largely on an illusion of skill . Billions of shares are traded every day, with many people buying each stock and others selling it to them. It is not unusual for more than 100 million shares of a single stock to change hands in one day. Most of the buyers and sellers know that they have the same information; they exchange the stocks primarily because they have different opinions. The buyers think the price is too low and likely to rise, while the sellers think the price is high and likely to drop. The puzzle is why buyers and sellers alike think that the current price is wrong. What makes them believe they know more about what the price should be than the market does? For most of them, that belief is an illusion.
从那时起,我对股市的疑问就硬生生地变成了一个更大的难题:一个主要行业似乎主要建立在 技能的幻觉 上。每天都有数十亿股的交易,每只股票都有很多人买入,也有很多人卖给他们。一只股票在一天内换手超过1亿 ,这是很正常的。大多数买家和卖家都知道他们拥有相同的信息;他们交换股票主要是因为他们有不同的意见。买家认为价格太低, 可能会上涨,而卖家认为价格很高,可能会下跌。谜题是为什么买家和卖家都认为目前的价格是错误的。 是什么让他们相信自己比市场更了解价格应该是什么?对他们中的大多数人来说,这种信念是一种幻觉。
In its broad outlines, the standard theory of how the stock market works is accepted by all the participants in the industry. Everybody in the investment business has read Burton Malkiel’s wonderful book A Random Walk Down Wall Street . Malkiel’s central idea is that a stock’s price incorporates all the available knowledge about the value of the company and the best predictions about the future of the stock. If some people believe that the price of a stock will be higher tomorrow, they will buy more of it today. This, in turn, will cause its price to rise. If all assets in a market are correctly priced, no one can expect either to gain or to lose by trading. Perfect prices leave no scope for cleverness, but they also protect fools from their own folly. We now know, however, that the theory is not quite right. Many individual investors lose consistently by trading, an achievement that a dart-throwing chimp could not match. The first demonstration of this startling conclusion was collected by Terry Odean, a finance professor at UC Berkeley who was once my student.
在其大致轮廓中,关于股票市场如何运作的标准理论被该行业的所有参与者所接受 。 投资行业的每个人都读过伯顿-马尔基尔的精彩著作 《华尔街的随机漫步》 。马尔基尔的中心思想是,一只股票的价格包含了所有关于公司价值的现有知识和对股票未来的最佳预测。如果有些人相信某只股票的价格明天会更高,他们今天就会买更多的 。这反过来又会导致其价格上涨。如果一个市场中的所有资产都被正确定价,那么没有人可以期望通过交易获得收益或损失。完美的价格没有给聪明人留下任何余地,但它们也保护傻瓜不受自己的愚蠢行为影响。然而,我们现在知道,这个理论并不完全正确。许多个人投资者通过交易持续亏损,这种成就是投掷飞镖的 黑猩猩无法比拟的。 这个惊人结论 的第一个证明是由加州大学伯克利分校的金融学教授特里-奥德恩收集的,他曾经是我的学生。
Odean began by studying the trading records of 10,000 brokerage accounts of individual investors spanning a seven-year period. He was able to analyze every transaction the investors executed through that firm, nearly 163,000 trades. This rich set of data allowed Odean to identify all instances in which an investor sold some of his holdings in one stock and soon afterward bought another stock. By these actions the investor revealed that he (most of the investors were men) had a definite idea about the future of the two stocks: he expected the stock that he chose to buy to do better than the stock he chose to sell.
Odean首先研究了1万个个人投资者的经纪账户的交易记录,时间跨度为7年。 他能够分析投资者通过该公司执行的每一笔交易,近16.3万笔交易。 这套丰富的数据使奥德恩能够识别出投资者卖出他所持有的某只股票的一部分并在不久后买入另一只股票的所有情况。通过这些行动,投资者透露出他(大多数投资者是男性)对这两只股票的未来有明确的想法:他预计他选择买入的股票会比他选择卖出的股票表现更好。
To determine whether those ideas were well founded, Odean compared the returns of the stock the investor had sold and the stock he had bought in its place, over the course of one year after the transaction. The results were unequivocally bad. On average, the shares that individual traders sold did better than those they bought, by a very substantial margin: 3.2 percentage points per year, above and beyond the significant costs of executing the two trades.
为了确定 ,这些想法是否有根有据,Odean比较了投资者卖出的股票和他买入的股票在交易后一年内的收益。 结果是明确的坏事。平均而言,个人交易者卖出的股票比他们买入的股票表现更好,幅度非常大:每年3.2个百分点,高于并超过 执行这两种交易的巨大成本。
It is important to remember that this is a statement about averages: some individuals did much better, others did much worse. However, it is clear that for the large majority of individual investors, taking a shower and doing nothing would have been a better policy than implementing the ideas that came to their minds. Later research by Odean and his colleague Brad Barber supported this conclusion. In a paper titled “Trading Is Hazardous to Your Wealth,” they showed that, on average, the most active traders had the poorest results, while the investors who traded the least earned the highest returns. In another paper, titled “Boys Will Be Boys,” they showed that men acted on their useless ideas significantly more often than women, and that as a result women achieved better investment results than men.
,重要的是要记住,这是一个关于平均数的声明:有些人做得更好,有些人做得更差。然而,很明显,对于绝大多数个人投资者来说,洗个澡,什么都不做,会比实施他们脑海中的想法要好。后来Odean 和他的同事Brad Barber的研究支持了这一结论。在一篇题为 "交易对你的财富有危害 "的论文中,他们表明,平均而言,最活跃的交易者的结果最差,而交易最少的投资者获得的回报最高。在另一篇题为 "男孩就是男孩 "的论文中,他们表明, 男性对他们的无用想法采取行动 的频率明显高于女性,并且 ,因此女性取得了比男性更好的投资结果。
Of course, there is always someone on the other side of each transaction; in general, these are financial institutions and professional investors, who are ready to take advantage of the mistakes that individual traders make in choosing a stock to sell and another stock to buy. Further research by Barber and Odean has shed light on these mistakes. Individual investors like to lock in their gains by selling “winners,” stocks that have appreciated since they were purchased, and they hang on to their losers. Unfortunately for them, recent winners tend to do better than recent losers in the short run, so individuals sell the wrong stocks. They also buy the wrong stocks. Individual investors predictably flock to companies that draw their attention because they are in the news. Professional investors are more selective in responding to news . These findings provide some justification for the label of “smart money” that finance professionals apply to themselves.
当然,每笔交易的另一方总是有人;一般来说,这些人是金融机构和专业投资者,他们准备利用个人交易者在选择卖出一只股票和购买另一只股票时所犯的错误。Barber和Odean的进一步研究揭示了 ,这些错误。个人投资者喜欢通过 卖出 "赢家",即 购买后升值的股票来锁定收益,而对输家则坚持持有。对他们来说,不幸的是,从短期来看,最近的赢家往往比最近的输家做得更好,所以个人卖错了股票。他们也买错了股票。可以预见的是,个人投资者会涌向那些因为出现在新闻中而吸引他们注意力的公司 。专业投资者 对新闻的反应 更具选择性。这些发现为金融专业人员给自己贴上 "聪明钱 "的标签提供了一定的理由。
Although professionals are able to extract a considerable amount of wealth from amateurs , few stock pickers, if any, have the skill needed to beat the market consistently, year after year. Professional investors, including fund managers, fail a basic test of skill: persistent achievement. The diagnostic for the existence of any skill is the consistency of individual differences in achievement. The logic is simple: if individual differences in any one year are due entirely to luck, the ranking of investors and funds will vary erratically and the year-to-year correlation will be zero. Where there is skill, however, the rankings will be more stable. The persistence of individual differences is the measure by which we confirm the existence of skill among golfers, car salespeople, orthodontists, or speedy toll collectors on the turnpike.
尽管专业人员能够 从业余人员 那里攫取相当多的 财富 ,但很少有选股者(如果有的话)拥有年复一年持续击败市场 。专业投资者,包括基金经理,未能通过对技能的基本测试:持续的成就。任何技能存在的诊断是成就的个体差异的一致性。逻辑很简单:如果任何一年的个体差异完全是由于运气,投资者和基金的排名将无规律地变化,年与年 的相关性将为零。个人差异的持久性是我们确认高尔夫球手、汽车销售员、牙齿矫正师或高速公路上快速收费员之间存在技能的措施。
Mutual funds are run by highly experienced and hardworking professionals who buy and sell stocks to achieve the best possible results for their clients. Nevertheless, the evidence from more than fifty years of research is conclusive: for a large majority of fund managers, the selection of stocks is more like rolling dice than like playing poker. Typically at least two out of every three mutual funds underperform the overall market in any given year.
共同基金是由经验丰富、勤奋工作的专业人士经营的,他们买卖股票是为了给他们的 客户取得最好的结果。然而,50多年来的研究证据是确凿的:对绝大多数基金经理来说,选择股票更像是掷骰子而不是打扑克。通常情况下,每三个共同基金中至少有两个在任何一年的 表现低于整体市场 。
More important, the year-to-year correlation between the outcomes of mutual funds is very small, barely higher than zero. The successful funds in any given year are mostly lucky; they have a good roll of the dice. There is general agreement among researchers that nearly all stock pickers, whether they know it or not—and few of them do—are playing a game of chance. The subjective experience of traders is that they are making sensible educated guesses in a situation of great uncertainty. In highly efficient markets, however, educated guesses are no more accurate than blind guesses.
更重要的是,共同基金的结果 ,年与年之间的相关性非常小,几乎不高于零。在任何一年里,成功的基金大多是幸运的;他们有一个很好的掷骰子的机会。研究人员普遍认为,几乎所有的选股者,不管他们是否知道,而且很少有人知道,都在玩一种机会游戏。交易员的主观体验是,他们在一个 ,在一个有很大不确定性的情况下,做出了明智的有教育意义的猜测。然而,在高度有效的市场中,受过教育的猜测并不比盲目猜测更准确。
Some years ago I had an unusual opportunity to examine the illusion of financial skill up close. I had been invited to speak to a group of investment advisers in a firm that provided financial advice and other services to very wealthy clients. I asked for some data to prepare my presentation and was granted a small treasure: a spreadsheet summarizing the investment outcomes of some twenty-five anonymous wealth advisers, for each of eight consecutive years. Each adviser’s score for each year was his (most of them were men) main determinant of his year-end bonus. It was a simple matter to rank the advisers by their performance in each year and to determine whether there were persistent differences in skill among them and whether the same advisers consistently achieved better returns for their clients year after year.
几年前,我有一个不寻常的机会,近距离地检查财务技能的幻觉。我曾被邀请向一家为非常富有的客户提供财务咨询和其他服务的公司的投资顾问小组发表演讲。我要求提供一些数据,以 准备我的演讲,并得到了一个小宝贝:一个总结了大约二十五个匿名财富顾问连续八年的投资结果的电子表格。 每个顾问每年的得分是他(其中大多数是男性)年终奖金的主要决定因素。按每年的业绩对顾问进行排名,并确定 ,他们之间是否存在 持续的技能差异 ,以及同一顾问是否年复一年地为客户取得更好的回报,这是一个简单的问题。
To answer the question, I computed correlation coefficients between the rankings in each pair of years: year 1 with year 2, year 1 with year 3, and so on up through year 7 with year 8. That yielded 28 correlation coefficients, one for each pair of years. I knew the theory and was prepared to find weak evidence of persistence of skill. Still, I was surprised to find that the average of the 28 correlations was .01. In other words, zero. The consistent correlations that would indicate differences in skill were not to be found. The results resembled what you would expect from a dice-rolling contest, not a game of skill.
为了回答这个问题,我计算了每对年份的排名之间的相关系数:第1年与第2年,第1年与第3年,以此类推,直到第7年与第8年。这产生了28个相关系数,每一对年份都有一个。我知道这个理论,并准备找到技能持久性的微弱证据。不过,我还是惊讶地发现,28个关联的平均数是0.01,换句话说,是零。结果类似于你所期望的掷骰子比赛,而不是技能游戏。
No one in the firm seemed to be aware of the nature of the game that its stock pickers were playing. The advisers themselves felt they were competent professionals doing a serious job, and their superiors agreed. On the evening before the seminar, Richard Thaler and I had dinner with some of the top executives of the firm, the people who decide on the size of bonuses. We asked them to guess the year-to-year correlation in the rankings of individual advisers. They thought they knew what was coming and smiled as they said “not very high” or “performance certainly fluctuates.” It quickly became clear, however, that no one expected the average correlation to be zero.
公司中似乎没有人意识到其选股者所玩的游戏的性质。顾问们自己觉得他们是有能力的 ,是在认真工作的专业人士,他们的上级也同意。在研讨会的前一天晚上,理查德-塞勒和我与公司的一些高层管理人员共进晚餐,他们是决定奖金大小的人。我们让他们猜测个别顾问的排名中每年 的相关性。他们认为自己知道会发生什么,并微笑着说 "不是很高 "或 "业绩肯定会有波动"。然而,很快就可以看出,没有人期望平均相关性为零。
Our message to the executives was that, at least when it came to building portfolios, the firm was rewarding luck as if it were skill. This should have been shocking news to them, but it was not. There was no sign that they disbelieved us. How could they? After all, we had analyzed their own results, and they were sophisticated enough to see the implications, which we politely refrained from spelling out. We all went on calmly with our dinner, and I have no doubt that both our findings and their implications were quickly swept under the rug and that life in the firm went on just as before. The illusion of skill is not only an individual aberration; it is deeply ingrained in the culture of the industry. Facts that challenge such basic assumptions—and thereby threaten people’s livelihood and self-esteem—are simply not absorbed. The mind does not digest them. This is particularly true of statistical studies of performance, which provide base-rate information that people generally ignore when it clashes with their personal impressions from experience.
我们向高管们传达的信息是,至少在建立投资组合方面,该公司在奖励运气,就像奖励技能一样。 这对他们来说应该是一个令人震惊的消息,但事实并非如此。没有迹象表明他们不相信我们,他们怎么可能呢?毕竟,我们已经分析了他们自己的结果,而且他们也足够成熟,能够看出其中的含义,我们礼貌地避免说出这些含义。 我们都平静地继续我们的晚餐,而且我毫不怀疑,我们的发现和它们的含义都很快被扫到了 ,公司的生活也像以前一样继续。对技能的幻想不仅是个人的反常行为;它在行业文化中根深蒂固。 挑战这种基本假设的事实--从而威胁到人们的生计和自尊--根本没有被吸收。心灵无法消化它们。这一点在对表现的统计研究中尤为明显, ,这些研究提供了基础的信息,当这些信息与人们从经验中得到的个人印象发生冲突时,人们通常会忽略这些信息。
The next morning, we reported the findings to the advisers, and their response was equally bland. Their own experience of exercising careful judgment on complex problems was far more compelling to them than an obscure statistical fact. When we were done, one of the executives I had dined with the previous evening drove me to the airport. He told me, with a trace of defensiveness, “I have done very well for the firm and no one can take that away from me.” I smiled and said nothing. But I thought, “Well, I took it away from you this morning. If your success was due mostly to chance, how much credit are you entitled to take for it?”
第二天早上,我们向顾问们报告了这些发现,他们的反应同样平淡无奇。他们自己对复杂问题进行仔细判断的经验远比晦涩难懂的统计事实更让他们信服。 当我们完成后,前一天晚上和我一起吃饭的一位 高管开车送我去机场。他告诉我,带着一丝防御性,"我为公司做得非常好,没有人可以从我这里夺走。"我笑了笑,没有说什么。但我想,"好吧,今天早上我把它从你那里拿走了。如果你的成功主要是由于机会,那么你有多少功劳可以拿?"
WHAT SUPPORTS THE ILLUSIONS OF SKILL AND VALIDITY?
是什么支持了幻觉 的技能和有效性?
Cognitive illusions can be more stubborn than visual illusions. What you learned about the Müller-Lyer illusion did not change the way you see the lines, but it changed your behavior. You now know that you cannot trust your impression of the length of lines that have fins appended to them, and you also know that in the standard Müller-Lyer display you cannot trust what you see. When asked about the length of the lines, you will report your informed belief, not the illusion that you continue to see. In contrast, when my colleagues and I in the army learned that our leadership assessment tests had low validity, we accepted that fact intellectually, but it had no impact on either our feelings or our subsequent actions. The response we encountered in the financial firm was even more extreme. I am convinced that the message that Thaler and I delivered to both the executives and the portfolio managers was instantly put away in a dark corner of memory where it would cause no damage.
认知错觉可能比视觉错觉更顽固。 你所了解的穆勒-莱尔错觉并没有改变你看待线条的方式,但它改变了你的行为。你现在知道,你不能相信 ,你对附加了鳍的线条长度的印象,你也知道在标准的穆勒-莱尔显示中,你不能相信你 。当被问及线条的长度时,你会报告你知情的信念,而不是你继续看到的幻象。相比之下,当我和我在军队的同事得知我们的领导力评估测试的有效性很低时,我们在理智上接受了这个事实,但它对我们的感受和随后的行动都没有影响。我们在金融公司 ,遇到的反应甚至更为极端。我相信,泰勒和我向高管和投资组合经理传达的信息,立即被收进了记忆的黑暗角落,不会造成任何损害。
Why do investors, both amateur and professional, stubbornly believe that they can do better than the market, contrary to an economic theory that most of them accept, and contrary to what they could learn from a dispassionate evaluation of their personal experience? Many of the themes of previous chapters come up again in the explanation of the prevalence and persistence of an illusion of skill in the financial world.
为什么投资者,无论是业余的还是专业的,都固执地认为他们可以比市场做得更好,这与他们大多数人接受的经济理论相反,也与 ,他们可以从个人经验的冷静评估中学到的东西相反?前几章的许多主题在解释金融界普遍存在和持续存在的技能幻觉时再次出现了。
The most potent psychological cause of the illusion is certainly that the people who pick stocks are exercising high-level skills. They consult economic data and forecasts, they examine income statements and balance sheets, they evaluate the quality of top management, and they assess the competition. All this is serious work that requires extensive training, and the people who do it have the immediate (and valid) experience of using these skills. Unfortunately, skill in evaluating the business prospects of a firm is not sufficient for successful stock trading, where the key question is whether the information about the firm is already incorporated in the price of its stock. Traders apparently lack the skill to answer this crucial question, but they appear to be ignorant of their ignorance. As I had discovered from watching cadets on the obstacle field, subjective confidence of traders is a feeling, not a judgment. Our understanding of cognitive ease and associative coherence locates subjective confidence firmly in System 1.
造成这种错觉的最有力的心理原因当然是,挑选股票的人在行使高级技能。他们查阅经济数据 和预测,他们检查利润表和资产负债表,他们评估高层管理人员的质量,他们评估竞争。所有这些都是严肃的工作,需要广泛的培训,而从事这些工作的人都有使用这些技能的直接(和有效)经验。不幸的是,评估公司商业前景的技巧并不足以成功地进行股票 ,其中的关键问题是关于公司的信息是否已经纳入其股票价格中。交易员显然缺乏回答这个关键问题的技巧,但他们似乎对自己的无知一无所知。正如我在障碍赛场上观察学员时发现的那样,交易者的主观信心是一种感觉,而不是一种判断。我们对认知 易和联想一致性的理解将主观信心牢牢地定位在系统1。
Finally, the illusions of validity and skill are supported by a powerful professional culture. We know that people can maintain an unshakable faith in any proposition, however absurd, when they are sustained by a community of like-minded believers. Given the professional culture of the financial community, it is not surprising that large numbers of individuals in that world believe themselves to be among the chosen few who can do what they believe others cannot.
最后,有效性和技能的幻觉得到了强大的专业文化的支持。 我们知道,当人们被一个志同道合的信徒群体所支持时,他们可以对任何命题保持不可动摇的信念,无论多么荒谬。鉴于金融界的职业文化, ,在这个世界上有大量的人认为自己是被选中的少数人之一,可以做他们认为别人做不到的事情,这一点并不奇怪。
THE ILLUSIONS OF PUNDITS
学者的幻觉
The idea that the future is unpredictable is undermined every day by the ease with which the past is explained. As Nassim Taleb pointed out in The Black Swan , our tendency to construct and believe coherent narratives of the past makes it difficult for us to accept the limits of our forecasting ability. Everything makes sense in hindsight, a fact that financial pundits exploit every evening as they offer convincing accounts of the day’s events. And we cannot suppress the powerful intuition that what makes sense in hindsight today was predictable yesterday. The illusion that we understand the past fosters overconfidence in our ability to predict the future.
未来不可预测的观点每天都被过去的解释所破坏。正如纳西姆-塔勒布在 《黑天鹅》 中指出的那样,我们倾向于构建和相信连贯的叙述 ,这使我们难以接受我们预测能力的局限性。一切都是事后诸葛亮,金融专家每天晚上都会利用这一事实,对当天的事件进行令人信服的描述。 我们无法抑制强大的直觉,即今天事后诸葛亮的事情在昨天是可以预测的。 我们了解过去的错觉助长了我们对预测未来能力的过度自信, 。
The often-used image of the “march of history” implies order and direction. Marches, unlike strolls or walks, are not random. We think that we should be able to explain the past by focusing on either large social movements and cultural and technological developments or the intentions and abilities of a few great men. The idea that large historical events are determined by luck is profoundly shocking, although it is demonstrably true. It is hard to think of the history of the twentieth century, including its large social movements, without bringing in the role of Hitler, Stalin, and Mao Zedong. But there was a moment in time, just before an egg was fertilized, when there was a fifty-fifty chance that the embryo that became Hitler could have been a female. Compounding the three events, there was a probability of one-eighth of a twentieth century without any of the three great villains and it is impossible to argue that history would have been roughly the same in their absence. The fertilization of these three eggs had momentous consequences, and it makes a joke of the idea that long-term developments are predictable.
经常使用的 "历史前进 "的形象意味着秩序和方向。 行进,与漫步或散步不同,不是随机的。我们认为,我们应该能够通过关注大型社会运动和文化及技术发展,或者关注少数伟人的意图和能力来解释过去。大型历史事件 ,是由运气决定的,这种想法令人深感震惊,尽管它是明显的事实。如果不把希特勒、斯大林和毛泽东的作用带进来,就很难想到二十世纪的历史,包括其大型社会运动。但是,就在卵子受精之前,有一个时刻,成为希特勒的胚胎有50%的机会可能是一个 的女性。综合这三个事件,二十世纪有八分之一的概率没有这三个大坏蛋中的任何一个,不可能说没有他们,历史就会大致相同。这三个卵子的受精产生了重大的后果,它使长期发展可以预测的想法成为一个笑话。
Yet the illusion of valid prediction remains intact, a fact that is exploited by people whose business is prediction—not only financial experts but pundits in business and politics, too. Television and radio stations and newspapers have their panels of experts whose job it is to comment on the recent past and foretell the future. Viewers and readers have the impression that they are receiving information that is somehow privileged, or at least extremely insightful. And there is no doubt that the pundits and their promoters genuinely believe they are offering such information. Philip Tetlock, a psychologist at the University of Pennsylvania, explored these so-called expert predictions in a landmark twenty-year study, which he published in his 2005 book Expert Political Judgment: How Good Is It? How Can We Know? Tetlock has set the terms for any future discussion of this topic.
然而, 有效预测的假象仍然完好无损,这一事实被那些以预测为业的人所利用--不仅是金融专家,还有商业和政治方面的专家。电视台、广播电台和报纸都有自己的专家小组,他们的工作是对最近的过去进行评论并预测未来。观众和读者的印象是,他们收到的信息在某种程度上是 特权的,或者至少是极富洞察力的。毫无疑问,这些专家和他们的推动者真正相信他们提供的是这样的信息。 宾夕法尼亚大学的心理学家菲利普-特洛克在一项具有里程碑意义的二十年研究中探讨了这些所谓的专家预测,他在2005年出版的《 专家政治判断:它有多好?我们怎么能知道呢? 泰特洛克已经为今后关于这个话题的任何讨论设定了条件。
Tetlock interviewed 284 people who made their living “commenting or offering advice on political and economic trends.” He asked them to assess the probabilities that certain events would occur in the not too distant future, both in areas of the world in which they specialized and in regions about which they had less knowledge. Would Gorbachev be ousted in a coup? Would the United States go to war in the Persian Gulf? Which country would become the next big emerging market? In all, Tetlock gathered more than 80,000 predictions. He also asked the experts how they reached their conclusions, how they reacted when proved wrong, and how they evaluated evidence that did not support their positions. Respondents were asked to rate the probabilities of three alternative outcomes in every case: the persistence of the status quo, more of something such as political freedom or economic growth, or less of that thing.
特洛克 ,采访了284位以 "评论或提供政治和经济趋势建议 "为生的人。他要求他们评估某些事件在不远的将来发生的概率,包括在他们擅长的世界领域和他们不太了解的地区。 戈尔巴乔夫会在政变中被赶下台吗?美国会在波斯湾开战吗?哪个国家会成为下一个大的新兴市场? 泰特洛克总共收集了8万多个预测。 他还询问了专家们是如何得出结论的,当被证明是错误的时候他们是如何反应的,以及他们是如何评估不支持其立场的证据的。受访者被要求对每种情况下的三种替代结果的概率进行评分:维持现状,增加诸如政治自由或经济增长之类的东西,或者减少这种东西。
The results were devastating. The experts performed worse than they would have if they had simply assigned equal probabilities to each of the three potential outcomes. In other words, people who spend their time, and earn their living, studying a particular topic produce poorer predictions than dart-throwing monkeys who would have distributed their choices evenly over the options. Even in the region they knew best, experts were not significantly better than nonspecialists.
其结果是毁灭性的。专家们的表现比他们简单地给三个潜在结果中的每一个分配相同的概率要差。换句话说,那些花时间、 ,并以此为生的人产生的预测比投掷飞镖的猴子要差,因为后者会将他们的选择平均分配到各个选项上。 即使在他们最熟悉的地区,专家们也没有明显好于非专家们。
Those who know more forecast very slightly better than those who know less. But those with the most knowledge are often less reliable. The reason is that the person who acquires more knowledge develops an enhanced illusion of her skill and becomes unrealistically overconfident. “We reach the point of diminishing marginal predictive returns for knowledge disconcertingly quickly,” Tetlock writes. “In this age of academic hyperspecialization , there is no reason for supposing that contributors to top journals—distinguished political scientists, area study specialists, economists, and so on—are any better than journalists or attentive readers of The New York Times in ‘reading’ emerging situations.” The more famous the forecaster, Tetlock discovered, the more flamboyant the forecasts. “Experts in demand,” he writes, “were more overconfident than their colleagues who eked out existences far from the limelight.”
那些知道得多的人比那些知道得少的人预测得非常好。但拥有最多知识的人往往不那么可靠。 原因是获得更多知识的人对自己的技能产生了更大的幻想,变得不切实际地过度自信。"我们很快就达到了知识的边际预测收益递减点,令人不安。 "在这个学术过度专业化的时代 ,没有理由认为顶级期刊的撰稿人--杰出的政治 科学家、地区研究专家、经济学家等等--在'阅读'新形势方面比记者或 《纽约时报》 的细心读者更出色。"泰特洛克发现,越是有名的预测者,预测就越是浮夸。"有需求的专家,"他写道,"比起那些远离风口浪尖勉强生存的同事,他们更加自信。"
Tetlock also found that experts resisted admitting that they had been wrong, and when they were compelled to admit error, they had a large collection of excuses: they had been wrong only in their timing, an unforeseeable event had intervened, or they had been wrong but for the right reasons. Experts are just human in the end. They are dazzled by their own brilliance and hate to be wrong. Experts are led astray not by what they believe, but by how they think, says Tetlock. He uses the terminology from Isaiah Berlin’s essay on Tolstoy, “The Hedgehog and the Fox.” Hedgehogs “know one big thing” and have a theory about the world; they account for particular events within a coherent framework, bristle with impatience toward those who don’t see things their way, and are confident in their forecasts. They are also especially reluctant to admit error. For hedgehogs, a failed prediction is almost always “off only on timing” or “very nearly right.” They are opinionated and clear, which is exactly what television producers love to see on programs. Two hedgehogs on different sides of an issue, each attacking the idiotic ideas of the adversary, make for a good show.
泰特洛克还 ,发现专家们抵制承认他们的错误,当他们被迫承认错误时,他们有一大堆借口:他们只是在时间上出了错,一个不可预见的事件介入了,或者他们错了,但理由是正确的。 专家们最终只是人类。 他们被自己的光辉所迷惑,讨厌错误。 泰特洛克说,专家们被引入歧途 ,不是因为他们相信什么,而是因为 他们如何思考。他使用了以赛亚-伯林关于托尔斯泰的文章 "刺猬与狐狸 "中的术语。刺猬 "知道一件大事",对世界有一套理论;他们在一个连贯的框架内解释特定的事件,对那些不按他们的方式看问题的人感到不耐烦,并对他们的预测充满信心。他们 ,也特别不愿意承认错误。对刺猬来说,一个失败的预测几乎总是 "只在时间上有偏差 "或 "非常接近正确"。他们有主见,思路清晰,这正是电视制作人喜欢在节目中看到的东西。两只刺猬站在一个问题的不同侧面,各自攻击对手的白痴想法,是一场好戏。
Foxes, by contrast, are complex thinkers. They don’t believe that one big thing drives the march of history (for example, they are unlikely to accept the view that Ronald Reagan single-handedly ended the cold war by standing tall against the Soviet Union). Instead the foxes recognize that reality emerges from the interactions of many different agents and forces, including blind luck, often producing large and unpredictable outcomes. It was the foxes who scored best in Tetlock’s study, although their performance was still very poor. But they are less likely than hedgehogs to be invited to participate in television debates.
相比之下,狐狸是复杂的思考者。 他们不相信有一件大事推动了历史的发展(例如,他们不太可能接受罗纳德-里根通过站在高处对抗苏联而单枪匹马地结束了冷战的观点)。相反,狐狸们认识到,现实是由许多不同的代理人和力量的相互作用产生的,包括盲目的运气,往往会产生巨大和不可预测的结果。 在特洛克的研究中,狐狸们得分最高,尽管他们的表现仍然非常糟糕。但他们比刺猬更不可能被邀请参加电视辩论。
IT IS NOT THE EXPERTS’ FAULT—THE WORLD IS DIFFICULT
这不是专家的错,世界很困难
The main point of this chapter is not that people who attempt to predict the future make many errors; that goes without saying. The first lesson is that errors of prediction are inevitable because the world is unpredictable. The second is that high subjective confidence is not to be trusted as an indicator of accuracy (low confidence could be more informative).
本章的主要观点不是说试图预测未来的人犯了很多错误;这一点不言而喻。第一个教训是, 预测的错误是不可避免的,因为世界是不可预测的。第二个教训是,高的主观信心是不可信的,不能作为准确性的指标(低的信心可能更有参考价值)。
Short-term trends can be forecast, and behavior and achievements can be predicted with fair accuracy from previous behaviors and achievements. But we should not expect performance in officer training and in combat to be predictable from behavior on an obstacle field—behavior both on the test and in the real world is determined by many factors that are specific to the particular situation. Remove one highly assertive member from a group of eight candidates and everyone else’s personalities will appear to change. Let a sniper’s bullet move by a few centimeters and the performance of an officer will be transformed. I do not deny the validity of all tests—if a test predicts an important outcome with a validity of .20 or .30, the test should be used. But you should not expect more. You should expect little or nothing from Wall Street stock pickers who hope to be more accurate than the market in predicting the future of prices. And you should not expect much from pundits making long-term forecasts—although they may have valuable insights into the near future. The line that separates the possibly predictable future from the unpredictable distant future is yet to be drawn.
短期趋势可以预测,行为和成就可以从以前的行为和成就中相当准确地预测出来。但是我们不应该期望在军官 训练和战斗中的表现可以从障碍场上的行为中预测出来--在测试和现实世界中的行为都是由许多特定情况下的因素决定的。 从一组八个候选人中除去一个高度自信的成员,其他人的个性都会出现变化。让狙击手的子弹移动几厘米,军官的表现 ,就会发生变化。我不否认所有测试的有效性--如果一个测试预测了一个重要的结果,其有效性为0.20或0.30,那么就应该使用这个测试。 但你不应该期望更多。你应该对华尔街的选股者抱有很少的期望,或者什么都不期望,他们希望在预测未来价格方面比 市场更准确。你也不应该对作出 长期预测的专家抱有很大期望--尽管他们可能对不久的将来有宝贵的洞察力。将可能的可预测的未来与不可预测的遥远的未来分开的界限还没有划定。
SPEAKING OF ILLUSORY SKILL
虚幻的技巧
“He knows that the record indicates that the development of this illness is mostly unpredictable. How can he be so confident in this case? Sounds like an illusion of validity.”
"他知道记录显示,这种疾病的发展大多是不可预测的。 在这种情况下,他怎么能如此自信?听起来像是一种有效性的错觉"。
“She has a coherent story that explains all she knows, and the coherence makes her feel good.”
"她有一个连贯的故事,解释了她所知道的一切,而这种连贯性让她感觉很好。"
“What makes him believe that he is smarter than the market? Is this an illusion of skill?”
"是什么让他相信自己比市场更聪明?这是一种技能的幻觉吗?"
“She is a hedgehog. She has a theory that explains everything, and it gives her the illusion that she understands the world.”
"她是一只刺猬。她有一个可以解释一切的理论,这让她产生了一种错觉,以为自己了解这个世界。"
“The question is not whether these experts are well trained. It is whether their world is predictable.”
"问题不在于这些专家是否训练有素,而在于他们的 世界是否可以预测。"
21
21
Intuitions vs. Formulas
直觉与公式
Paul Meehl was a strange and wonderful character, and one of the most versatile psychologists of the twentieth century. Among the departments in which he had faculty appointments at the University of Minnesota were psychology, law, psychiatry, neurology, and philosophy. He also wrote on religion, political science, and learning in rats. A statistically sophisticated researcher and a fierce critic of empty claims in clinical psychology, Meehl was also a practicing psychoanalyst. He wrote thoughtful essays on the philosophical foundations of psychological research that I almost memorized while I was a graduate student. I never met Meehl, but he was one of my heroes from the time I read his Clinical vs . Statistical Prediction : A Theoretical Analysis and a Review of the Evidence .
保罗-米尔是一个奇怪而奇妙的人物,也是二十世纪最多才多艺的心理学家之一。他在明尼苏达大学 ,在其任教的部门中包括心理学、法律、精神病学、神经学和哲学。米尔是一位统计学上的成熟研究者,也是临床心理学中对空洞主张的激烈批评者,他也是一位执业的精神分析学家。他写了关于心理学研究的哲学基础的深思熟虑的文章,在我还是 研究生的时候,我几乎背下来了。我从未见过Meehl,但从我读到他的《 临床》一书时 起,他就是我的英雄之一。 理论分析和证据回顾 。
In the slim volume that he later called “my disturbing little book,” Meehl reviewed the results of 20 studies that had analyzed whether clinical predictions based on the subjective impressions of trained professionals were more accurate than statistical predictions made by combining a few scores or ratings according to a rule. In a typical study, trained counselors predicted the grades of freshmen at the end of the school year. The counselors interviewed each student for forty-five minutes. They also had access to high school grades, several aptitude tests, and a four-page personal statement. The statistical algorithm used only a fraction of this information: high school grades and one aptitude test. Nevertheless, the formula was more accurate than 11 of the 14 counselors. Meehl reported generally similar results across a variety of other forecast outcomes, including violations of parole, success in pilot training, and criminal recidivism.
在这本后来被他称为 "我令人不安的小书 "的薄薄的书中,他说Meehl回顾了20项研究的结果,这些研究分析了基于训练有素的专业人员的主观印象的 临床预测 ,是否比根据规则将一些分数或评级结合起来的 统计 预测更准确。在一个典型的研究中,训练有素的辅导员在学年结束时预测新生的成绩。 辅导员对每个学生进行了四十五分钟的访谈。他们还可以获得高中成绩、几项能力测试和四页的个人陈述。统计 算法只使用了这些信息的一小部分:高中成绩和一项能力测试。尽管如此,该公式还是比14位咨询师中的11位更准确。Meehl报告说,在其他各种预测结果中, ,包括违反假释、试点培训的成功和刑事再犯,结果大致相似。
Not surprisingly, Meehl’s book provoked shock and disbelief among clinical psychologists, and the controversy it started has engendered a stream of research that is still flowing today, more than fifty years after its publication. The number of studies reporting comparisons of clinical and statistical predictions has increased to roughly two hundred, but the score in the contest between algorithms and humans has not changed. About 60% of the studies have shown significantly better accuracy for the algorithms. The other comparisons scored a draw in accuracy, but a tie is tantamount to a win for the statistical rules, which are normally much less expensive to use than expert judgment. No exception has been convincingly documented.
毫不奇怪,Meehl的书在临床心理学家中引起了震惊和不相信 ,而它所引发的争论也引发了一股研究热潮,在其出版50多年后的今天仍在流传。报告比较临床和统计预测的研究数量已经增加到大约200项,但在算法和人类之间的竞争中,分数并没有改变。大约60%的研究 ,显示算法的准确性明显更好。其他的比较在准确度上取得了平局,但平局就相当于统计规则的胜利,而统计规则的使用成本通常要比专家判断低得多。没有任何例外被令人信服地记录下来。
The range of predicted outcomes has expanded to cover medical variables such as the longevity of cancer patients, the length of hospital stays, the diagnosis of cardiac disease, and the susceptibility of babies to sudden infant death syndrome; economic measures such as the prospects of success for new businesses, the evaluation of credit risks by banks, and the future career satisfaction of workers; questions of interest to government agencies, including assessments of the suitability of foster parents, the odds of recidivism among juvenile offenders, and the likelihood of other forms of violent behavior; and miscellaneous outcomes such as the evaluation of scientific presentations, the winners of football games, and the future prices of Bordeaux wine. Each of these domains entails a significant degree of uncertainty and unpredictability. We describe them as “low-validity environments.” In every case, the accuracy of experts was matched or exceeded by a simple algorithm.
预测结果的范围已经扩大到包括医疗变量,如癌症患者的寿命、 住院时间、心脏疾病的诊断以及婴儿对婴儿猝死综合症的敏感性;经济措施,如新企业的成功前景、银行的信贷风险评估以及工人未来的职业满意度。政府机构感兴趣的问题,包括评估寄养父母的合适性、少年犯重犯的几率 ,以及其他形式的暴力行为的可能性;以及杂项结果,如科学报告的评估、足球比赛的获胜者,以及波尔多葡萄酒的未来价格。这些领域中的每一个都包含了相当程度的不确定性和不可预测性。 我们把它们描述为 "低效力环境"。在每一种情况下, ,专家的准确性都被一个简单的算法所匹配或超过了。
As Meehl pointed out with justified pride thirty years after the publication of his book, “There is no controversy in social science which shows such a large body of qualitatively diverse studies coming out so uniformly in the same direction as this one.”
正如米尔在他的书出版30年后有理由自豪地指出的那样, " 在社会科学中, 没有任何争论 能像这本书一样,显示出如此庞大的、质量不同的研究机构如此一致地朝着同一方向发展"。
The Princeton economist and wine lover Orley Ashenfelter has offered a compelling demonstration of the power of simple statistics to outdo world-renowned experts. Ashenfelter wanted to predict the future value of fine Bordeaux wines from information available in the year they are made. The question is important because fine wines take years to reach their peak quality, and the prices of mature wines from the same vineyard vary dramatically across different vintages; bottles filled only twelve months apart can differ in value by a factor of 10 or more . An ability to forecast future prices is of substantial value, because investors buy wine, like art, in the anticipation that its value will appreciate.
普林斯顿大学的经济学家和葡萄酒爱好者Orley Ashenfelter为 ,令人信服地证明了简单的统计数字能够超越世界知名专家的力量。Ashenfelter想通过波尔多葡萄酒酿造当年的信息来预测其未来价值。这个问题很重要,因为优质葡萄酒需要多年时间才能达到其巅峰品质,同一葡萄园的成熟葡萄酒在不同年份的价格差异很大; ,相隔仅12个月灌装的瓶子 ,价值可能相差 10倍或更多 。预测未来价格的能力具有实质性的价值,因为投资者购买葡萄酒,就像购买艺术品一样,是为了预期其价值会升值。
It is generally agreed that the effect of vintage can be due only to variations in the weather during the grape-growing season. The best wines are produced when the summer is warm and dry, which makes the Bordeaux wine industry a likely beneficiary of global warming. The industry is also helped by wet springs, which increase quantity without much effect on quality. Ashenfelter converted that conventional knowledge into a statistical formula that predicts the price of a wine—for a particular property and at a particular age—by three features of the weather: the average temperature over the summer growing season, the amount of rain at harvest-time, and the total rainfall during the previous winter. His formula provides accurate price forecasts years and even decades into the future. Indeed, his formula forecasts future prices much more accurately than the current prices of young wines do. This new example of a “Meehl pattern” challenges the abilities of the experts whose opinions help shape the early price. It also challenges economic theory, according to which prices should reflect all the available information, including the weather. Ashenfelter’s formula is extremely accurate—the correlation between his predictions and actual prices is above .90.
人们普遍认为,年份的影响只可能是由于葡萄种植季节的天气变化造成的。最好的葡萄酒是在夏天温暖干燥的时候生产的 ,这使得波尔多葡萄酒业可能成为全球变暖的受益者。该行业还得到了湿润的春天的帮助,它增加了数量,但对质量没有太大的影响。 Ashenfelter将这些传统知识转化为一个统计公式,通过天气的三个特征来预测葡萄酒的价格--对于特定的财产和特定的年龄: ,夏季生长季节的平均温度,收获时的雨量,以及前一个冬天的总雨量。他的公式提供了未来几年甚至几十年的准确价格预测。事实上,他的公式对未来价格的预测比目前年轻葡萄酒的价格要准确得多。 这个 "Meehl模式 "的新例子挑战了专家的能力 ,他们的意见有助于形成早期价格。这也是对经济理论的挑战,根据经济理论,价格应该反映所有可用信息,包括天气。Ashenfelter的公式非常准确--他的预测和实际价格之间的相关性在0.90以上。
Why are experts inferior to algorithms? One reason, which Meehl suspected, is that experts try to be clever, think outside the box, and consider complex combinations of features in making their predictions. Complexity may work in the odd case, but more often than not it reduces validity. Simple combinations of features are better. Several studies have shown that human decision makers are inferior to a prediction formula even when they are given the score suggested by the formula! They feel that they can overrule the formula because they have additional information about the case, but they are wrong more often than not. According to Meehl, there are few circumstances under which it is a good idea to substitute judgment for a formula. In a famous thought experiment, he described a formula that predicts whether a particular person will go to the movies tonight and noted that it is proper to disregard the formula if information is received that the individual broke a leg today. The name “broken-leg rule” has stuck. The point, of course, is that broken legs are very rare—as well as decisive.
为什么专家不如算法?Meehl怀疑的一个原因是,专家们 ,试图变得聪明,跳出框框,并在作出预测时考虑复杂的特征组合。 复杂可能在奇特的情况下起作用,但更多的时候会降低有效性。 简单的特征组合会更好。一些研究表明,人类决策者不如预测公式,即使他们得到了公式所建议的分数也是如此。他们 ,觉得他们可以推翻公式,因为他们有关于案件的额外信息,但他们往往是错的。根据Meehl的说法,在少数情况下,用判断来代替公式是一个好主意。在一个著名的思想实验中,他描述了一个预测某个人今晚是否会去看电影的公式,并指出,如果收到该人今天摔断腿的信息,不考虑这个公式是 。"摔断腿规则 "这个名字一直流传至今。当然,问题的关键在于,断腿是非常罕见的,也是决定性的。
Another reason for the inferiority of expert judgment is that humans are incorrigibly inconsistent in making summary judgments of complex information. When asked to evaluate the same information twice, they frequently give different answers. The extent of the inconsistency is often a matter of real concern. Experienced radiologists who evaluate chest X-rays as “normal” or “abnormal” contradict themselves 20% of the time when they see the same picture on separate occasions. A study of 101 independent auditors who were asked to evaluate the reliability of internal corporate audits revealed a similar degree of inconsistency. A review of 41 separate studies of the reliability of judgments made by auditors, pathologists, psychologists, organizational managers, and other professionals suggests that this level of inconsistency is typical, even when a case is reevaluated within a few minutes. Unreliable judgments cannot be valid predictors of anything.
专家判断力低下的另一个原因是,人类在对复杂信息进行总结性判断时不可救药地不一致。当被要求对相同的 信息进行两次评估时,他们经常 ,给出不同的答案。不一致的程度往往是一个真正令人担忧的问题。 有经验的放射科医生 将胸部X射线评价为 "正常 "或 "不正常",当他们在不同场合看到同一张照片时,有20%的时间是自相矛盾的。一项对101名独立审计师的研究,他们被要求评价 企业内部审计 的可靠性, ,显示出类似程度的不一致。对审计师、病理学家、心理学家、组织管理者和其他专业人士所做判断的可靠性的 41项独立研究 的审查表明,这种程度的不一致是典型的,即使是在几分钟内对一个案例进行重新评估。 不可靠的判断不可能成为任何有效的预测因素。
The widespread inconsistency is probably due to the extreme context dependency of System 1. We know from studies of priming that unnoticed stimuli in our environment have a substantial influence on our thoughts and actions. These influences fluctuate from moment to moment. The brief pleasure of a cool breeze on a hot day may make you slightly more positive and optimistic about whatever you are evaluating at the time. The prospects of a convict being granted parole may change significantly during the time that elapses between successive food breaks in the parole judges’ schedule. Because you have little direct knowledge of what goes on in your mind, you will never know that you might have made a different judgment or reached a different decision under very slightly different circumstances. Formulas do not suffer from such problems. Given the same input, they always return the same answer. When predictability is poor—which it is in most of the studies reviewed by Meehl and his followers—inconsistency is destructive of any predictive validity.
广泛的不一致 ,可能是由于系统1的极端环境依赖性。我们从对引物的研究中知道,我们环境中未被注意到的刺激物对我们的思想和行动有很大的影响。 这些影响在不同的时刻是波动的。在炎热的日子里,凉风带来的短暂愉悦可能会让你对当时正在评估的任何事情稍微积极乐观一些。在假释法官的日程安排中,在 连续的食物休息 之间的时间里,罪犯获得假释的 前景可能会发生重大变化。因为你对自己的想法几乎没有直接了解,所以你永远不会知道,在非常细微的不同情况下,你可能会做出不同的判断或达成不同的决定。公式不会受到 ,在相同的输入下,它们总是返回相同的答案。 当可预测性很差时--在Meehl和他的追随者审查的大多数研究中,可预测性很差--不一致对任何预测的有效性都有破坏性。
The research suggests a surprising conclusion: to maximize predictive accuracy, final decisions should be left to formulas, especially in low-validity environments. In admission decisions for medical schools, for example, the final determination is often made by the faculty members who interview the candidate. The evidence is fragmentary, but there are solid grounds for a conjecture: conducting an interview is likely to diminish the accuracy of a selection procedure, if the interviewers also make the final admission decisions. Because interviewers are overconfident in their intuitions, they will assign too much weight to their personal impressions and too little weight to other sources of information, lowering validity . Similarly, the experts who evaluate the quality of immature wine to predict its future have a source of information that almost certainly makes things worse rather than better: they can taste the wine. In addition, of course, even if they have a good understanding of the effects of the weather on wine quality, they will not be able to maintain the consistency of a formula.
该研究提出了一个令人惊讶的结论:为了最大限度地提高预测的准确性,最后的决定应该留给公式,特别是在低效力的环境中。 例如,在医学院的录取决定中,最后的决定往往是由面试候选人的教员作出。证据是零散的,但有确凿的理由进行猜想:如果面试官也做出最后的录取决定,那么进行面试很可能会降低选拔程序的准确性。因为面试官对自己的直觉过于自信 ,他们会对个人印象赋予过多的权重,而对其他信息来源的权重过低, 降低了有效性 。 同样,评估未成熟葡萄酒质量以预测其未来的专家有一个信息来源,几乎肯定会使事情变得更糟而不是更好:他们可以品尝葡萄酒。 当然,此外,即使他们 ,对天气对葡萄酒质量的影响有很好的了解,他们也无法保持配方的一致性。
The most important development in the field since Meehl’s original work is Robyn Dawes’s famous article “The Robust Beauty of Improper Linear Models in Decision Making.” The dominant statistical practice in the social sciences is to assign weights to the different predictors by following an algorithm, called multiple regression, that is now built into conventional software. The logic of multiple regression is unassailable: it finds the optimal formula for putting together a weighted combination of the predictors. However, Dawes observed that the complex statistical algorithm adds little or no value. One can do just as well by selecting a set of scores that have some validity for predicting the outcome and adjusting the values to make them comparable (by using standard scores or ranks). A formula that combines these predictors with equal weights is likely to be just as accurate in predicting new cases as the multiple-regression formula that was optimal in the original sample. More recent research went further: formulas that assign equal weights to all the predictors are often superior, because they are not affected by accidents of sampling .
自Meehl的原创工作以来,该领域最重要的发展是Robyn Dawes的著名文章 "决策中不正确线性模型的稳健之美"。社会科学中的主流统计实践是通过遵循一种称为多元回归的算法为不同的 预测因素分配权重,这种算法现在已经内置于常规软件中。多元回归的逻辑是无可非议的:它找到了将预测因子加权组合的最佳公式。 然而,道斯观察到,复杂的统计算法几乎没有增加任何价值。人们可以通过选择一组对预测结果有一定有效性的分数 ,并调整数值使其具有可比性(通过使用标准分数或等级)。一个将这些预测因素以相等的权重结合起来的公式,在预测新案例时可能与在原始样本中最理想的多元回归公式一样准确。最近的研究更进一步:为所有预测因素分配同等权重的公式 ,往往更有优势,因为它们 不会受到抽样事故的影响 。
The surprising success of equal-weighting schemes has an important practical implication: it is possible to develop useful algorithms without any prior statistical research. Simple equally weighted formulas based on existing statistics or on common sense are often very good predictors of significant outcomes. In a memorable example, Dawes showed that marital stability is well predicted by a formula:
平均加权方案的惊人成功有一个重要的实际意义:有可能在没有任何事先统计研究的情况下开发出有用的算法。基于现有统计数据或常识的简单平均加权公式往往能很好地预测重要的 。在一个令人难忘的例子中,道斯表明,婚姻稳定性可以通过一个公式很好地预测。
frequency of lovemaking minus frequency of quarrels
做爱频率减去争吵频率
You don’t want your result to be a negative number.
你不希望你的结果是一个负数。
The important conclusion from this research is that an algorithm that is constructed on the back of an envelope is often good enough to compete with an optimally weighted formula, and certainly good enough to outdo expert judgment. This logic can be applied in many domains, ranging from the selection of stocks by portfolio managers to the choices of medical treatments by doctors or patients.
这项研究的重要结论是,在信封背面构建的算法往往足以与最佳加权公式竞争,当然 ,足以超越专家判断。这种逻辑可以应用于许多领域,从投资组合经理的股票选择到医生或病人的医疗选择。
A classic application of this approach is a simple algorithm that has saved the lives of hundreds of thousands of infants. Obstetricians had always known that an infant who is not breathing normally within a few minutes of birth is at high risk of brain damage or death. Until the anesthesiologist Virginia Apgar intervened in 1953, physicians and midwives used their clinical judgment to determine whether a baby was in distress. Different practitioners focused on different cues. Some watched for breathing problems while others monitored how soon the baby cried. Without a standardized procedure, danger signs were often missed, and many newborn infants died.
这种方法的一个经典应用是一个简单的算法,它挽救了数十万婴儿的生命。产科医生一直都知道,在出生后几 分钟内没有正常呼吸的婴儿 ,有很大的脑损伤或死亡风险。在麻醉师弗吉尼亚-阿普加于1953年介入之前,医生和助产士用他们的临床判断来确定婴儿是否处于困境。不同的医生关注不同的线索。 一些人观察呼吸问题,而另一些人则监测婴儿的哭声有多快。由于没有一个标准化的程序, ,危险信号往往被错过,许多新生儿死亡。
One day over breakfast, a medical resident asked how Dr. Apgar would make a systematic assessment of a newborn. “That’s easy,” she replied. “You would do it like this.” Apgar jotted down five variables (heart rate, respiration, reflex, muscle tone, and color) and three scores (0, 1, or 2, depending on the robustness of each sign). Realizing that she might have made a breakthrough that any delivery room could implement, Apgar began rating infants by this rule one minute after they were born. A baby with a total score of 8 or above was likely to be pink, squirming, crying, grimacing, with a pulse of 100 or more—in good shape. A baby with a score of 4 or below was probably bluish, flaccid, passive, with a slow or weak pulse—in need of immediate intervention. Applying Apgar’s score, the staff in delivery rooms finally had consistent standards for determining which babies were in trouble, and the formula is credited for an important contribution to reducing infant mortality. The Apgar test is still used every day in every delivery room. Atul Gawande’s recent A Checklist Manifesto provides many other examples of the virtues of checklists and simple rules.
有一天吃早餐时,一位住院医生 问阿普加医生 如何对新生儿进行系统评估。"那很容易,"她回答。"你会这样做。"阿普加记下了五个变量(心率、呼吸、反射、肌肉张力和颜色)和三个分数(0、1或2,取决于每个迹象的稳健性)。 ,意识到她可能已经取得了任何产房都可以实施的突破,阿普加开始在婴儿出生一分钟后按照这个规则进行评级。总分在8分或以上的婴儿可能是粉红色的,蠕动着,哭泣着,面无表情,脉搏在100或以上--状态良好。得分在4分或以下的婴儿可能脸色发青,身体松弛,处于被动状态,脉搏缓慢或微弱-- ,需要立即进行干预。应用阿普加的评分,产房的工作人员终于有了一致的标准来判断哪些婴儿有问题,这个公式被认为是对降低婴儿死亡率的重要贡献。 阿普加测试仍然每天在每个产房里使用。Atul Gawande最近发表的 《清单宣言》 提供了许多其他例子,说明了 清单的优点 和简单规则。
THE HOSTILITY TO ALGORITHMS
对算法的敌意
From the very outset, clinical psychologists responded to Meehl’s ideas with hostility and disbelief. Clearly, they were in the grip of an illusion of skill in terms of their ability to make long-term predictions. On reflection, it is easy to see how the illusion came about and easy to sympathize with the clinicians’ rejection of Meehl’s research.
从一开始,临床心理学家就对Meehl的观点作出了敌意和不相信的反应。 显然,他们在进行长期预测的能力方面陷入了一种技能的幻觉。 经过反思,我们很容易看到这种幻觉是如何产生的,也很容易对临床医生拒绝Meehl的研究表示同情。
The statistical evidence of clinical inferiority contradicts clinicians’ everyday experience of the quality of their judgments. Psychologists who work with patients have many hunches during each therapy session, anticipating how the patient will respond to an intervention, guessing what will happen next. Many of these hunches are confirmed, illustrating the reality of clinical skill.
临床劣势的统计证据与临床医生对其判断质量的日常经验相矛盾。 与病人打交道的心理学家在每次治疗过程中都会有许多预感,预测病人对某项干预措施的反应,猜测接下来会发生什么。 这些预感中有许多得到了证实,说明了临床技能的现实。
The problem is that the correct judgments involve short-term predictions in the context of the therapeutic interview, a skill in which therapists may have years of practice. The tasks at which they fail typically require long-term predictions about the patient’s future. These are much more difficult, even the best formulas do only modestly well, and they are also tasks that the clinicians have never had the opportunity to learn properly—they would have to wait years for feedback, instead of receiving the instantaneous feedback of the clinical session. However, the line between what clinicians can do well and what they cannot do at all well is not obvious, and certainly not obvious to them. They know they are skilled, but they don’t necessarily know the boundaries of their skill. Not surprisingly, then, the idea that a mechanical combination of a few variables could outperform the subtle complexity of human judgment strikes experienced clinicians as obviously wrong.
问题是,正确的判断涉及治疗性访谈背景下的短期预测,治疗师可能有多年的实践经验。他们失败的任务通常需要对病人的未来进行长期预测。这些都要困难得多,即使是最好的公式也只能做得一般,而且这些任务也是临床医生从来没有 机会好好学习的--他们需要等待多年的反馈,而不是接受临床环节的即时反馈。 然而,临床医生能做好的事情和他们完全不能做好的事情之间的界限并不明显,当然对他们也不明显。 他们知道自己有技术,但他们不一定知道自己技术的界限。因此,毫不奇怪, ,认为几个变量的机械组合可以胜过人类判断的微妙复杂性的想法,让有经验的临床医生感到明显错误。
The debate about the virtues of clinical and statistical prediction has always had a moral dimension. The statistical method, Meehl wrote, was criticized by experienced clinicians as “mechanical, atomistic, additive, cut and dried, artificial, unreal, arbitrary, incomplete, dead, pedantic, fractionated, trivial, forced, static, superficial, rigid, sterile, academic, pseudoscientific and blind.” The clinical method, on the other hand, was lauded by its proponents as “dynamic, global, meaningful, holistic, subtle, sympathetic, configural, patterned, organized, rich, deep, genuine, sensitive, sophisticated, real, living, concrete, natural, true to life, and understanding.”
关于临床预测和统计预测的优点的辩论一直有一个道德层面。 Meehl写道,统计方法被有经验的临床医生批评为 "机械的、原子化的、加法的、干脆的、 人工的、不真实的、任意的、不完整的、死的、迂腐的、零碎的、琐碎的、强迫的、静态的、表面的、僵硬的、无菌的、学术的、伪科学的和盲目的。"另一方面,临床方法被其支持者称赞为 "动态的、全球的、有意义的、整体的、微妙的、有同情心的、配置的、模式化的、有组织的、丰富的、深刻的、真正的、敏感的、复杂的、真实的、活生生的、具体的、 自然的、真实的和理解的"。
This is an attitude we can all recognize. When a human competes with a machine, whether it is John Henry a-hammerin’ on the mountain or the chess genius Garry Kasparov facing off against the computer Deep Blue, our sympathies lie with our fellow human. The aversion to algorithms making decisions that affect humans is rooted in the strong preference that many people have for the natural over the synthetic or artificial. Asked whether they would rather eat an organic or a commercially grown apple, most people prefer the “all natural” one. Even after being informed that the two apples taste the same, have identical nutritional value, and are equally healthful, a majority still prefer the organic fruit . Even the producers of beer have found that they can increase sales by putting “All Natural” or “No Preservatives” on the label.
当人类与机器竞争时,无论是约翰-亨利在山上的锤击,还是国际象棋天才加里-卡斯帕罗夫与电脑 "深蓝 "的对决,我们都同情我们的同伴。对算法做出影响人类的决定的厌恶,根源在于 ,许多人对自然的强烈偏爱超过了合成或人工的偏爱。当被问及是愿意吃有机苹果还是商业化种植的苹果时,大多数人更喜欢 "全天然 "的苹果。 即使被告知这两种苹果味道相同,营养价值相同,同样有益健康,大多数人仍然喜欢 有机水果 。 甚至啤酒生产商也发现, ,他们可以通过在标签上标注 "全天然 "或 "无防腐剂 "来提高销量。
The deep resistance to the demystification of expertise is illustrated by the reaction of the European wine community to Ashenfelter’s formula for predicting the price of Bordeaux wines. Ashenfelter’s formula answered a prayer: one might thus have expected that wine lovers everywhere would be grateful to him for demonstrably improving their ability to identify the wines that later would be good. Not so. The response in French wine circles, wrote The New York Times , ranged “somewhere between violent and hysterical.” Ashenfelter reports that one oenophile called his findings “ludicrous and absurd.” Another scoffed, “It is like judging movies without actually seeing them.”
欧洲葡萄酒界对Ashenfelter预测波尔多葡萄酒价格的公式的反应,说明了对专业知识去神秘化的深刻抵制。Ashenfelter的公式回答了一个祈祷:因此,人们可能期望各地的葡萄酒爱好者会感谢他 ,因为他明显地提高了他们识别 ,后来会有好酒的能力。并非如此。 纽约时报 》写道,法国葡萄酒界的反应是 "介于暴力和歇斯底里之间"。Ashenfelter报道说,一位嗜酒者称他的发现是 "可笑的和荒谬的"。另一个人嘲笑说:"这就像没有真正看过电影就对其进行评判一样"。
The prejudice against algorithms is magnified when the decisions are consequential. Meehl remarked, “I do not quite know how to alleviate the horror some clinicians seem to experience when they envisage a treatable case being denied treatment because a ‘blind, mechanical’ equation misclassifies him.” In contrast, Meehl and other proponents of algorithms have argued strongly that it is unethical to rely on intuitive judgments for important decisions if an algorithm is available that will make fewer mistakes. Their rational argument is compelling, but it runs against a stubborn psychological reality: for most people, the cause of a mistake matters. The story of a child dying because an algorithm made a mistake is more poignant than the story of the same tragedy occurring as a result of human error, and the difference in emotional intensity is readily translated into a moral preference.
对算法的偏见 ,当决定是有影响的时候就会被放大。 Meehl说:"当一些临床医生设想一个可以治疗的病例因为一个'盲目的、机械的'方程式而被拒绝治疗时,我不太知道如何减轻他们的恐惧。"与此相反,Meehl和其他算法的支持者强烈认为,如果有一种算法可以减少错误,那么依靠直觉判断 ,进行重要的决策是不道德的。他们的理性论点是令人信服的,但它违背了一个顽固的心理现实:对大多数人来说,错误的原因很重要。一个孩子因为算法出错而死亡的故事比同样的悲剧因人为错误而发生的故事更令人感慨,而情感强度的差异 ,很容易转化为道德上的偏好。
Fortunately, the hostility to algorithms will probably soften as their role in everyday life continues to expand. Looking for books or music we might enjoy, we appreciate recommendations generated by software. We take it for granted that decisions about credit limits are made without the direct intervention of any human judgment. We are increasingly exposed to guidelines that have the form of simple algorithms, such as the ratio of good and bad cholesterol levels we should strive to attain. The public is now well aware that formulas may do better than humans in some critical decisions in the world of sports: how much a professional team should pay for particular rookie players, or when to punt on fourth down. The expanding list of tasks that are assigned to algorithms should eventually reduce the discomfort that most people feel when they first encounter the pattern of results that Meehl described in his disturbing little book.
幸运的是,随着算法在日常生活中的作用不断扩大,对算法的敌意可能会减弱。在寻找我们可能喜欢的书籍或音乐时,我们欣赏由软件产生的推荐。我们认为理所当然的是,关于信用额度的决定是在没有任何人类判断的直接干预下做出的。 我们越来越多地接触到具有简单算法形式的指南,例如我们应该努力达到的好坏胆固醇水平的比例。公众现在很清楚,在体育界的一些关键决策中,公式可能比人做得更好:一支职业球队应该为特定的新秀球员支付多少钱,或者什么时候在第四次进攻时弃权。不断扩大的 ,分配给算法的任务清单应该最终减少大多数人在第一次遇到Meehl在其令人不安的小书中描述的结果模式时的不适感。
LEARNING FROM MEEHL
向Meehl学习
In 1955, as a twenty-one-year-old lieutenant in the Israeli Defense Forces, I was assigned to set up an interview system for the entire army. If you wonder why such a responsibility would be forced upon someone so young, bear in mind that the state of Israel itself was only seven years old at the time; all its institutions were under construction, and someone had to build them. Odd as it sounds today, my bachelor’s degree in psychology probably qualified me as the best-trained psychologist in the army. My direct supervisor, a brilliant researcher, had a degree in chemistry.
1955年,作为以色列国防军的一名21岁的中尉,我被指派为整个军队建立一个面试系统。如果你想知道为什么这样一个 的责任会强加在一个如此年轻的人身上,请记住,当时以色列国本身只有七年的历史;它的所有机构都在建设中,必须有人来建设它们。奇怪的是 ,尽管今天听起来很奇怪,我的心理学学士学位可能使我有资格成为军队中受过最好训练的心理学家。我的直接主管是一位出色的研究员,他的学位 ,是化学专业的。
An interview routine was already in place when I was given my mission. Every soldier drafted into the army completed a battery of psychometric tests, and each man considered for combat duty was interviewed for an assessment of personality. The goal was to assign the recruit a score of general fitness for combat and to find the best match of his personality among various branches: infantry, artillery, armor, and so on. The interviewers were themselves young draftees, selected for this assignment by virtue of their high intelligence and interest in dealing with people. Most were women, who were at the time exempt from combat duty. Trained for a few weeks in how to conduct a fifteen-to twenty-minute interview, they were encouraged to cover a range of topics and to form a general impression of how well the recruit would do in the army.
当我接到任务时,面试程序已经到位。 每个应征入伍的士兵都要完成一系列的心理测试,每个被考虑参加战斗任务的人都要接受面试,以评估其性格。 目的是给新兵分配一个总体上适合战斗的分数,并在各个部门中找到与他性格最匹配的人: 步兵、炮兵、装甲兵,等等。采访者本身就是年轻的被征召者,他们凭借高智商和与人打交道的兴趣被选为这项任务。 大多数是女性,她们在当时被免除战斗任务。他们接受了几个星期的培训,学习如何进行15到20分钟的面试,鼓励他们涵盖一系列的话题,并对新兵在军队中的表现形成一个总体 。
Unfortunately, follow-up evaluations had already indicated that this interview procedure was almost useless for predicting the future success of recruits. I was instructed to design an interview that would be more useful but would not take more time. I was also told to try out the new interview and to evaluate its accuracy. From the perspective of a serious professional, I was no more qualified for the task than I was to build a bridge across the Amazon.
不幸的是,后续评估已经表明,这种面试程序对于预测新兵未来的成功几乎毫无用处。 我被指示设计一种更有用但不会花费更多时间的面试。 我还被告知要尝试新的面试并评估其准确性。从一个严肃的专业人士的角度来看 ,我没有资格完成这项任务,就像我在亚马逊河上建造一座桥一样。
Fortunately, I had read Paul Meehl’s “little book,” which had appeared just a year earlier. I was convinced by his argument that simple, statistical rules are superior to intuitive “clinical” judgments. I concluded that the then current interview had failed at least in part because it allowed the interviewers to do what they found most interesting, which was to learn about the dynamics of the interviewee’s mental life. Instead, we should use the limited time at our disposal to obtain as much specific information as possible about the interviewee’s life in his normal environment. Another lesson I learned from Meehl was that we should abandon the procedure in which the interviewers’ global evaluations of the recruit determined the final decision. Meehl’s book suggested that such evaluations should not be trusted and that statistical summaries of separately evaluated attributes would achieve higher validity.
幸运的是,我读过保罗-米勒的 "小书",这本书在一年前刚刚出版。我的结论是,当时的访谈至少有一部分是失败的,因为 ,它允许访谈者做他们认为最有趣的事情,也就是了解被访谈者的精神生活动态。相反,我们应该利用我们所掌握的有限时间,尽可能多地获得关于被访谈者在正常环境中生活的具体信息。我从Meehl那里学到的另一个教训是,我们应该放弃由面试官对招聘人员的 总体评价来决定最终决定的程序。Meehl的书中建议,不应该相信这种评价,对单独评价的属性进行统计汇总会取得更高的有效性。
I decided on a procedure in which the interviewers would evaluate several relevant personality traits and score each separately. The final score of fitness for combat duty would be computed according to a standard formula, with no further input from the interviewers. I made up a list of six characteristics that appeared relevant to performance in a combat unit, including “responsibility,” “sociability,” and “masculine pride.” I then composed, for each trait, a series of factual questions about the individual’s life before his enlistment, including the number of different jobs he had held, how regular and punctual he had been in his work or studies, the frequency of his interactions with friends, and his interest and participation in sports, among others. The idea was to evaluate as objectively as possible how well the recruit had done on each dimension.
我决定采用一个程序,由面试官评估几个相关的性格特征,并分别打分。最终的适合战斗的分数 ,将根据一个标准的公式计算出来,面试官无需进一步输入。我编了一个清单,列出了六个似乎与战斗单位的表现有关的特征 ,包括 "责任感"、"交际能力 "和 "男性的自豪感"。然后,我为每个特质编写了一系列关于个人入伍前生活的事实问题,包括他所从事的不同工作的数量 ,他在工作或学习中的规律性和准时性如何,他与朋友交往的频率,以及他对体育的兴趣和参与情况等等。我们的想法是尽可能客观地评估新兵在每个方面的表现如何。
By focusing on standardized, factual questions, I hoped to combat the halo effect, where favorable first impressions influence later judgments. As a further precaution against halos, I instructed the interviewers to go through the six traits in a fixed sequence, rating each trait on a five-point scale before going on to the next. And that was that. I informed the interviewers that they need not concern themselves with the recruit’s future adjustment to the military. Their only task was to elicit relevant facts about his past and to use that information to score each personality dimension. “Your function is to provide reliable measurements,” I told them. “Leave the predictive validity to me,” by which I meant the formula that I was going to devise to combine their specific ratings.
通过专注于标准化的事实性问题,我希望能消除光环效应,即有利的 第一印象会影响后来的判断。作为对光环的进一步预防,我指示面试官按照固定的顺序浏览六个特征,在进入下一个问题之前对每个特征进行五点评分。 就这样,我告诉面试官,他们不需要关心新兵未来对军队的适应。他们唯一的任务是 ,了解他过去的相关事实,并利用这些信息对每个人格维度进行评分。"你们的职能是提供可靠的测量结果。"我告诉他们。"把预测有效性留给我,"我指的是我要设计的公式,以结合他们的具体评级。
The interviewers came close to mutiny. These bright young people were displeased to be ordered, by someone hardly older than themselves, to switch off their intuition and focus entirely on boring factual questions. One of them complained, “You are turning us into robots!” So I compromised. “Carry out the interview exactly as instructed,” I told them, “and when you are done, have your wish: close your eyes, try to imagine the recruit as a soldier, and assign him a score on a scale of 1 to 5.”
面试官们几乎要叛变了。 这些聪明的年轻人不高兴的是,被一个几乎不比他们大的人命令关闭他们的直觉,完全专注于无聊的事实问题。 其中一个人抱怨说:"你正在把我们变成机器人!"所以我妥协了。"完全按照指示进行采访。"我告诉他们,"当你们完成后,请实现你们的愿望:闭上眼睛,试着把新兵想象成一个士兵,并在1到5的范围内给他打分。"
Several hundred interviews were conducted by this new method, and a few months later we collected evaluations of the soldiers’ performance from the commanding officers of the units to which they had been assigned. The results made us happy. As Meehl’s book had suggested, the new interview procedure was a substantial improvement over the old one. The sum of our six ratings predicted soldiers’ performance much more accurately than the global evaluations of the previous interviewing method, although far from perfectly. We had progressed from “completely useless” to “moderately useful.”
用这种新方法进行了几百次访谈,几个月后,我们从这些士兵被分配到的单位的指挥官那里收集了对他们表现的评价。 结果让我们很高兴。正如Meehl的书中所说,新的面试程序比旧的面试程序有了很大的改进。我们的六个评级的总和预测了士兵的表现 ,比以前的面试方法的总体评价要准确得多,尽管远非完美。我们已经从 "完全无用 "发展到 "适度有用"。
The big surprise to me was that the intuitive judgment that the interviewers summoned up in the “close your eyes” exercise also did very well, indeed just as well as the sum of the six specific ratings. I learned from this finding a lesson that I have never forgotten: intuition adds value even in the justly derided selection interview, but only after a disciplined collection of objective information and disciplined scoring of separate traits. I set a formula that gave the “close your eyes” evaluation the same weight as the sum of the six trait ratings. A more general lesson that I learned from this episode was do not simply trust intuitive judgment—your own or that of others—but do not dismiss it, either.
让我大吃一惊的是,面试官在 "闭上眼睛 "的练习中唤起的直觉判断也做得非常好,确实与六个具体评分的总和一样好。 我从这一发现中学到了一个我从未忘记的教训:即使在 被公正地嘲笑的选拔面试中,直觉也会增加价值,但只有在有规律地收集客观信息和有规律地对单独的特质进行打分之后才行。 我设定了一个公式,给 "闭上眼睛 "的评价与六个特质评分之和以相同的权重。 我从这一事件中学到的一个更普遍的教训是,不要简单地相信直觉的判断--你自己的或别人的--但也不要否定它。
Some forty-five years later, after I won a Nobel Prize in economics, I was for a short time a minor celebrity in Israel. On one of my visits, someone had the idea of escorting me around my old army base, which still housed the unit that interviews new recruits. I was introduced to the commanding officer of the Psychological Unit, and she described their current interviewing practices, which had not changed much from the system I had designed; there was, it turned out, a considerable amount of research indicating that the interviews still worked well. As she came to the end of her description of how the interviews are conducted, the officer added, “And then we tell them, ‘Close your eyes.’”
大约四十五年后,在我获得诺贝尔经济学奖之后,我在以色列曾一度成为小有名气的人。在我的一次访问中,有人出主意护送我参观我以前的军队基地,那里仍然有面试新兵的单位。我被介绍给心理股的指挥官 ,她描述了他们目前的面试做法,与我设计的系统相比没有什么变化;事实证明,有相当多的研究表明,面试仍然很有效。当她对如何进行面谈的描述接近尾声时,这位警官补充说:"然后我们告诉他们,'闭上你的眼睛'"。
DO IT YOURSELF
自己动手
The message of this chapter is readily applicable to tasks other than making manpower decisions for an army. Implementing interview procedures in the spirit of Meehl and Dawes requires relatively little effort but substantial discipline. Suppose that you need to hire a sales representative for your firm. If you are serious about hiring the best possible person for the job, this is what you should do. First, select a few traits that are prerequisites for success in this position (technical proficiency, engaging personality, reliability, and so on). Don’t overdo it—six dimensions is a good number. The traits you choose should be as independent as possible from each other, and you should feel that you can assess them reliably by asking a few factual questions. Next, make a list of those questions for each trait and think about how you will score it, say on a 1–5 scale. You should have an idea of what you will call “very weak” or “very strong.”
本章的信息很容易适用于为军队做人力决策以外的任务。本着Meehl和Dawes的精神实施访谈程序需要相对较少的努力,但需要大量的纪律。假设你需要为你的公司雇用一名销售代表。如果你是认真的,想为这个工作雇用最好的人,这个 ,就是你应该做的。首先,选择一些在这个职位上成功的先决条件的特征(技术能力、吸引人的个性、可靠性,等等)。 不要过分追求,六个维度是一个好数字。你选择的特质应该尽可能地相互独立,而且你应该觉得你可以通过问一些事实性的问题来可靠地评估它们。接下来,将每个特质的这些问题列成一个 ,并思考你将如何打分,比如用1-5分来打分。你应该有一个想法,你将把什么称为 "非常弱 "或 "非常强"。
These preparations should take you half an hour or so, a small investment that can make a significant difference in the quality of the people you hire. To avoid halo effects, you must collect the information on one trait at a time, scoring each before you move on to the next one. Do not skip around. To evaluate each candidate, add up the six scores. Because you are in charge of the final decision, you should not do a “close your eyes.” Firmly resolve that you will hire the candidate whose final score is the highest, even if there is another one whom you like better—try to resist your wish to invent broken legs to change the ranking. A vast amount of research offers a promise: you are much more likely to find the best candidate if you use this procedure than if you do what people normally do in such situations, which is to go into the interview unprepared and to make choices by an overall intuitive judgment such as “I looked into his eyes and liked what I saw.”
这些准备工作应该花费你半个小时左右的时间,这种小的投资可以使你雇用的人的质量有很大的不同。为了避免光环效应,你必须一次收集一个特质的信息 ,在你进入下一个特质之前,对每一个特质进行评分。不要跳来跳去。 要评估每个候选人,把六个人的分数加起来。因为你负责最后的决定,你不应该做 "闭目养神"。坚定地下定决心,你将雇用最终得分最高的候选人,甚至 ,如果有另一个你更喜欢的人--尽量克制你想发明断腿 ,以改变排名的愿望。大量的研究提供了一个承诺:如果你使用这个程序,你更有可能找到最好的候选人,而不是像人们在这种情况下通常所做的那样,即毫无准备地进入面试,并通过整体的直觉判断做出选择,例如 "我看着他的眼睛,喜欢我看到的东西"。
SPEAKING OF JUDGES VS. FORMULAS
说到法官与公式的关系
“Whenever we can replace human judgment by a formula, we should at least consider it.”
"每当 ,我们可以用一个公式来取代人类的判断,我们至少应该考虑一下。"
“He thinks his judgments are complex and subtle, but a simple combination of scores could probably do better.”
"他认为他的判断是复杂而微妙的,但简单的分数组合可能可以做得更好。"
“Let’s decide in advance what weight to give to the data we have on the candidates’ past performance. Otherwise we will give too much weight to our impression from the interviews.”
"让我们事先决定对我们所掌握的关于候选人过去表现的数据给予何种权重。否则,我们就会过于重视我们从采访中得到的印象"。
22
22
Expert Intuition: When Can We Trust It?
专家的直觉:我们何时能相信它?
Professional controversies bring out the worst in academics. Scientific journals occasionally publish exchanges, often beginning with someone’s critique of another’s research, followed by a reply and a rejoinder. I have always thought that these exchanges are a waste of time. Especially when the original critique is sharply worded, the reply and the rejoinder are often exercises in what I have called sarcasm for beginners and advanced sarcasm. The replies rarely concede anything to a biting critique, and it is almost unheard of for a rejoinder to admit that the original critique was misguided or erroneous in any way. On a few occasions I have responded to criticisms that I thought were grossly misleading, because a failure to respond can be interpreted as conceding error, but I have never found the hostile exchanges instructive. In search of another way to deal with disagreements, I have engaged in a few “adversarial collaborations,” in which scholars who disagree on the science agree to write a jointly authored paper on their differences, and sometimes conduct research together. In especially tense situations, the research is moderated by an arbiter .
专业的争议带来了学术界最糟糕的情况。科学期刊偶尔会发表交流文章,通常以某人对另一个人的研究的批评开始,然后是 ,进行回复和反驳。特别是当最初的批评措辞尖锐时,答复和反驳往往是我所说的对初学者的讽刺和高级讽刺的练习。 答复很少对尖锐的批评做出任何让步,而且几乎闻所未闻的是,反驳者承认最初的批评是 误导或有任何错误的地方。有几次,我对那些我认为严重误导的批评做出了回应,因为不回应会被解释为承认错误,但我从来没有发现那些敌对的交流是有指导意义的。 为了寻找另一种处理分歧的方式,我参与了一些 "对抗性合作",在这些合作中,对科学有分歧的学者 ,同意就他们的分歧写一篇共同撰写的论文,有时还一起开展研究。在特别紧张的情况下,研究由 一名仲裁员主持 。
My most satisfying and productive adversarial collaboration was with Gary Klein, the intellectual leader of an association of scholars and practitioners who do not like the kind of work I do. They call themselves students of Naturalistic Decision Making, or NDM, and mostly work in organizations where they often study how experts work. The NDMers adamantly reject the focus on biases in the heuristics and biases approach. They criticize this model as overly concerned with failures and driven by artificial experiments rather than by the study of real people doing things that matter. They are deeply skeptical about the value of using rigid algorithms to replace human judgment, and Paul Meehl is not among their heroes. Gary Klein has eloquently articulated this position over many years.
我最满意和最有成效的对抗性合作是与加里-克莱因(Gary Klein)的合作,他是一个不喜欢我所做工作的学者和从业者协会的知识领袖。他们称自己为自然决策的学生 ,或称NDM,大多在组织中工作,他们经常研究专家如何工作。NDM人士坚决反对启发式和偏见方法中对偏见的关注。 他们批评这种模式过分关注失败,并由人工 实验驱动,而不是通过研究真实的人做重要的事情。他们对 ,用僵化的算法来取代人类的判断的价值深表怀疑,而保罗-米尔并不在他们的英雄之列。加里-克莱因多年来雄辩地 阐述了这一立场 。
This is hardly the basis for a beautiful friendship, but there is more to the story. I had never believed that intuition is always misguided. I had also been a fan of Klein’s studies of expertise in firefighters since I first saw a draft of a paper he wrote in the 1970s, and was impressed by his book Sources of Power , much of which analyzes how experienced professionals develop intuitive skills. I invited him to join in an effort to map the boundary that separates the marvels of intuition from its flaws. He was intrigued by the idea and we went ahead with the project—with no certainty that it would succeed. We set out to answer a specific question: When can you trust an experienced professional who claims to have an intuition? It was obvious that Klein would be more disposed to be trusting, and I would be more skeptical. But could we agree on principles for answering the general question?
这很难成为一段美好友谊的基础,但故事还有更多的内容。 我从未相信直觉总是被误导。自从我第一次 ,看到他在1970年代写的一篇论文的草稿,我也一直是克莱因对消防员专业知识的研究的粉丝,他的《 力量之源 》一书给我留下了深刻的印象,其中大部分内容分析了经验丰富的专业人士如何发展直觉技能。我邀请他加入一个努力,绘制出将直觉的奇迹与缺陷分开的边界。我们设置了 ,以回答一个具体的问题:你什么时候可以相信一个声称有直觉的有经验的专业人士? 很明显,克莱因会更倾向于信任,而我则会更怀疑。但我们能否就回答一般问题的原则达成一致?
Over seven or eight years we had many discussions, resolved many disagreements, almost blew up more than once, wrote many drafts, became friends, and eventually published a joint article with a title that tells the story: “Conditions for Intuitive Expertise: A Failure to Disagree.” Indeed, we did not encounter real issues on which we disagreed—but we did not really agree.
在七八年的时间里,我们进行了许多讨论,解决了许多分歧,不止一次地差点炸开了锅,写了 许多草稿,成为了朋友,并最终联合发表了一篇文章,标题可以说明问题:"直觉专长的条件:分歧的失败"。事实上,我们没有遇到真正的问题,我们在这些问题上有分歧,但我们并没有真正达成一致。
MARVELS AND FLAWS
奇迹和缺陷
Malcolm Gladwell’s bestseller Blink appeared while Klein and I were working on the project, and it was reassuring to find ourselves in agreement about it. Gladwell’s book opens with the memorable story of art experts faced with an object that is described as a magnificent example of a kouros , a sculpture of a striding boy. Several of the experts had strong visceral reactions: they felt in their gut that the statue was a fake but were not able to articulate what it was about it that made them uneasy. Everyone who read the book—millions did—remembers that story as a triumph of intuition. The experts agreed that they knew the sculpture was a fake without knowing how they knew—the very definition of intuition. The story appears to imply that a systematic search for the cue that guided the experts would have failed, but Klein and I both rejected that conclusion. From our point of view, such an inquiry was needed, and if it had been conducted properly (which Klein knows how to do), it would probably have succeeded.
Malcolm Gladwell的畅销书 《Blink》 在Klein和我从事这个项目的时候出现了,发现 ,我们对它的看法一致,这让我们感到欣慰。 Gladwell的书以一个令人难忘的故事开篇,艺术专家面对一个被描述为 Kouros 的宏伟例子的物体,一个大步流星的男孩的雕塑。有几位专家有强烈的内脏反应:他们从直觉上感觉到那座雕像是假的,但却无法说清楚是什么让他们感到不安。每个人都 ,数百万人都读过这本书,他们记得这个故事是直觉的胜利。 专家们一致认为,他们知道那座雕塑是假的,但却不知道他们是如何知道的--这正是直觉的定义。这个故事似乎暗示,系统地寻找引导专家的线索会失败,但克莱因和我都拒绝了这个结论。从我们的观点来看,这样的调查是需要的, ,如果调查进行得当(克莱因知道如何做),它可能会成功。
Although many readers of the kouros example were surely drawn to an almost magical view of expert intuition, Gladwell himself does not hold that position. In a later chapter he describes a massive failure of intuition: Americans elected President Harding, whose only qualification for the position was that he perfectly looked the part. Square jawed and tall, he was the perfect image of a strong and decisive leader. People voted for someone who looked strong and decisive without any other reason to believe that he was. An intuitive prediction of how Harding would perform as president arose from substituting one question for another. A reader of this book should expect such an intuition to be held with confidence.
虽然 ,许多读者对库罗斯的例子肯定会被一种几乎神奇的专家直觉观点所吸引,但格拉德威尔本人并不持这种立场。在后面一章中,他描述了直觉的巨大失败:美国人选出了哈定总统,他担任这个职位的唯一资格 ,就是他看起来非常合适。方下巴和高个子,他是一个强大和果断的领导者的完美形象。通过用一个问题代替另一个问题,对哈丁作为总统的表现有了一个直观的预测。本书的读者应该期待这样的直觉 ,并自信地持有。
INTUITION AS RECOGNITION
作为识别的直觉
The early experiences that shaped Klein’s views of intuition were starkly different from mine. My thinking was formed by observing the illusion of validity in myself and by reading Paul Meehl’s demonstrations of the inferiority of clinical prediction. In contrast, Klein’s views were shaped by his early studies of fireground commanders (the leaders of firefighting teams). He followed them as they fought fires and later interviewed the leader about his thoughts as he made decisions. As Klein described it in our joint article, he and his collaborators
塑造克莱因对直觉的看法的早期经历与我截然不同。我的想法是通过观察自己身上的有效性错觉和阅读Paul Meehl关于临床预测劣势的证明而形成的。 相比之下,Klein的观点是通过他早期对火场指挥官(消防队的领导 )的研究而形成的。他在他们灭火的时候跟踪他们,后来采访了领导,了解他在做决定时的想法。 正如Klein在我们的联合文章中描述的,他和他的合作者
investigated how the commanders could make good decisions without comparing options. The initial hypothesis was that commanders would restrict their analysis to only a pair of options, but that hypothesis proved to be incorrect. In fact, the commanders usually generated only a single option, and that was all they needed. They could draw on the repertoire of patterns that they had compiled during more than a decade of both real and virtual experience to identify a plausible option, which they considered first. They evaluated this option by mentally simulating it to see if it would work in the situation they were facing …. If the course of action they were considering seemed appropriate, they would implement it. If it had shortcomings, they would modify it. If they could not easily modify it, they would turn to the next most plausible option and run through the same procedure until an acceptable course of action was found.
最初的假设是,指挥官会将他们的分析限制在一对选项上, ,但这一假设被证明是不正确的。事实上,指挥官通常只产生一个选项,这就是他们所需要的。 他们可以利用他们在十多年的真实和虚拟经验中汇编的模式来确定一个合理的选项,他们首先考虑这个选项。他们通过精神上的模拟来评估这个方案,看它 ,在他们面临的情况下是否可行....。如果他们考虑的行动方案似乎是适当的,他们就会实施。 如果它有缺点,他们就会修改它。如果他们不能轻易修改,他们会转向下一个最合理的选择,并通过同样的程序,直到找到一个可接受的行动方案。
Klein elaborated this description into a theory of decision making that he called the recognition-primed decision (RPD) model, which applies to firefighters but also describes expertise in other domains, including chess. The process involves both System 1 and System 2. In the first phase, a tentative plan comes to mind by an automatic function of associative memory—System 1. The next phase is a deliberate process in which the plan is mentally simulated to check if it will work—an operation of System 2. The model of intuitive decision making as pattern recognition develops ideas presented some time ago by Herbert Simon, perhaps the only scholar who is recognized and admired as a hero and founding figure by all the competing clans and tribes in the study of decision making. I quoted Herbert Simon’s definition of intuition in the introduction, but it will make more sense when I repeat it now: “The situation has provided a cue; this cue has given the expert access to information stored in memory, and the information provides the answer. Intuition is nothing more and nothing less than recognition .”
克莱因将这一描述阐述为一个 决策理论,他称之为识别激励决策(RPD)模型,它适用于消防员,但也描述了其他领域的专业知识,包括国际象棋。 过程涉及系统1和系统2。在第一阶段,通过联想记忆的自动功能--系统1,一个暂定的计划出现在脑海中。下一阶段是一个深思熟虑的过程,在这个过程中,计划被 ,在心理上进行模拟,以检查它是否能成功--系统2的操作。直觉决策作为模式识别的模型发展了赫伯特-西蒙(Herbert Simon)一段时间前提出的观点,他也许是唯一被决策研究中所有竞争的部族和部落认可和 钦佩的英雄 和创始人的学者。我在 介绍中引用了赫伯特-西蒙对直觉的定义,但我现在重复一下会更有意义:"情况已经提供了一个线索;这个线索让专家获得了储存在记忆中的信息,而信息提供了答案。 直觉不外乎是 识别 ,也 不外乎是识别 。"
This strong statement reduces the apparent magic of intuition to the everyday experience of memory. We marvel at the story of the firefighter who has a sudden urge to escape a burning house just before it collapses, because the firefighter knows the danger intuitively, “without knowing how he knows.” However, we also do not know how we immediately know that a person we see as we enter a room is our friend Peter. The moral of Simon’s remark is that the mystery of knowing without knowing is not a distinctive feature of intuition; it is the norm of mental life.
这种强烈的陈述将直觉的明显魔力降低到记忆的日常经验中。我们惊叹于消防员的故事 ,他在燃烧的房屋倒塌前突然有了逃离的冲动,因为消防员凭直觉知道危险, "却不知道自己是如何知道的" 。然而,我们也不知道我们是如何立即知道我们进入一个房间时看到的人是我们的朋友彼得。西蒙这番话的寓意是,不知道的神秘感不是直觉的独特特征 ;它是精神生活的规范。
ACQUIRING SKILL
获得的技能
How does the information that supports intuition get “stored in memory”? Certain types of intuitions are acquired very quickly. We have inherited from our ancestors a great facility to learn when to be afraid. Indeed, one experience is often sufficient to establish a long-term aversion and fear. Many of us have the visceral memory of a single dubious dish that still leaves us vaguely reluctant to return to a restaurant. All of us tense up when we approach a spot in which an unpleasant event occurred, even when there is no reason to expect it to happen again. For me, one such place is the ramp leading to the San Francisco airport, where years ago a driver in the throes of road rage followed me from the freeway, rolled down his window, and hurled obscenities at me. I never knew what caused his hatred, but I remember his voice whenever I reach that point on my way to the airport.
支持直觉的信息如何被 "储存在记忆中"?某些类型的直觉获得得非常快。 我们从祖先那里继承了学习何时害怕的巨大便利。 事实上,一次经历往往足以建立长期的厌恶和恐惧。我们中的许多人都有这样的内脏记忆: 一道可疑的菜肴,它仍然让我们隐约不愿再去一家餐馆。当我们接近一个发生过不愉快事件的地方时,我们所有人都会紧张起来,即使没有理由期待它再次发生。 对我来说,这样一个地方是通往旧金山机场的斜坡,多年前,一名司机在路怒症的折磨下,从高速公路上跟踪我,摇下他的 窗,并对我进行了污言秽语。我从来不知道是什么导致了他的仇恨,但每当我在去机场的路上到达那个点时,我就会想起他的声音。
My memory of the airport incident is conscious and it fully explains the emotion that comes with it. On many occasions, however, you may feel uneasy in a particular place or when someone uses a particular turn of phrase without having a conscious memory of the triggering event. In hindsight, you will label that unease an intuition if it is followed by a bad experience. This mode of emotional learning is closely related to what happened in Pavlov’s famous conditioning experiments, in which the dogs learned to recognize the sound of the bell as a signal that food was coming. What Pavlov’s dogs learned can be described as a learned hope. Learned fears are even more easily acquired.
我对机场事件的记忆是有意识的,它完全解释了随之而来的情绪。 然而,在许多场合,你可能在一个特定的地方或当某人使用一个特定的短语时感到不安,而对触发的事件没有有意识的 。事后看来, ,如果这种不安之后有不好的经历,你会给这种不安贴上直觉的标签。这种情感学习模式与巴甫洛夫著名的条件反射实验中发生的情况密切相关,在该实验中,狗学会了将铃声识别为食物即将到来的信号。巴甫洛夫的狗学到的东西可以被描述为一种学习的希望。 ,学习的恐惧甚至更容易获得。
Fear can also be learned—quite easily, in fact—by words rather than by experience. The fireman who had the “sixth sense” of danger had certainly had many occasions to discuss and think about types of fires he was not involved in, and to rehearse in his mind what the cues might be and how he should react. As I remember from experience, a young platoon commander with no experience of combat will tense up while leading troops through a narrowing ravine, because he was taught to identify the terrain as favoring an ambush. Little repetition is needed for learning.
恐惧也可以通过语言而不是通过经验来学习--事实上相当容易。拥有危险 "第六感 "的消防员肯定有很多机会讨论和思考他没有参与的火灾类型,并在脑海中预演可能的线索和他应该如何反应。 根据我的经验记忆,一个没有战斗经验的年轻排长 ,在带领部队通过一个狭窄的峡谷时,会紧张起来,因为他被教导要识别地形有利于伏击的情况。学习需要的重复次数很少。
Emotional learning may be quick, but what we consider as “expertise” usually takes a long time to develop. The acquisition of expertise in complex tasks such as high-level chess, professional basketball, or firefighting is intricate and slow because expertise in a domain is not a single skill but rather a large collection of miniskills. Chess is a good example. An expert player can understand a complex position at a glance, but it takes years to develop that level of ability. Studies of chess masters have shown that at least 10,000 hours of dedicated practice (about 6 years of playing chess 5 hours a day) are required to attain the highest levels of performance. During those hours of intense concentration, a serious chess player becomes familiar with thousands of configurations, each consisting of an arrangement of related pieces that can threaten or defend each other.
情感学习可能很快,但我们认为的 "专业知识 "通常需要很长的时间来发展。在复杂的任务中,如高水平的国际象棋、专业的 篮球或消防等,专业知识的获得是错综复杂和缓慢的,因为一个领域的专业知识不是单一的技能,而是大量的迷你技能的集合。国际象棋就是一个很好的例子。一个专家级的棋手可以对复杂的局面一目了然,但需要多年的时间来培养这种能力。 对国际象棋大师的研究表明,要达到最高水平,至少需要 10,000小时 的专门练习(大约6年,每天下5小时的 )。在那几个小时的高度集中中,一个严肃的棋手会熟悉数以千计的配置,每个配置都是由可以互相威胁或防御的相关棋子组成的。
Learning high-level chess can be compared to learning to read. A first grader works hard at recognizing individual letters and assembling them into syllables and words, but a good adult reader perceives entire clauses. An expert reader has also acquired the ability to assemble familiar elements in a new pattern and can quickly “recognize” and correctly pronounce a word that she has never seen before. In chess, recurrent patterns of interacting pieces play the role of letters, and a chess position is a long word or a sentence.
学习高水平的国际象棋可以与学习阅读相比。一个一年级学生在识别单个 字母并将其组合成音节和单词方面很努力,但一个优秀的成人读者可以感知整个句子。一个专业的读者还获得了将熟悉的元素组合成新模式的能力,可以迅速 "识别 "并正确发音一个她以前从未见过的单词。在国际象棋中,相互作用的棋子的重复模式扮演着字母的角色,而一个国际象棋的位置就是 一个长的单词或句子。
A skilled reader who sees it for the first time will be able to read the opening stanza of Lewis Carroll’s “Jabberwocky” with perfect rhythm and intonation, as well as pleasure:
一个熟练的读者如果第一次看到它,就能以完美的节奏和语调以及愉悦的心情来阅读刘易斯-卡罗尔的《炸药包》的开头一节。
’Twas brillig, and the slithy toves
灿烂的光芒和狭长的手套
Did gyre and gimble in the wabe:
在瓦贝做了陀螺和金波。
All mimsy were the borogoves,
所有 ,模仿的是Borogoves。
And the mome raths outgrabe.
而且,这些人还会被赶出家门。
Acquiring expertise in chess is harder and slower than learning to read because there are many more letters in the “alphabet” of chess and because the “words” consist of many letters. After thousands of hours of practice, however, chess masters are able to read a chess situation at a glance. The few moves that come to their mind are almost always strong and sometimes creative. They can deal with a “word” they have never encountered, and they can find a new way to interpret a familiar one.
获得国际象棋的专业知识比 学习阅读更难、更慢,因为国际象棋的 "字母表 "中的字母更多,而且 "单词 "由许多字母组成。然而,经过数千小时的练习,国际象棋大师们能够一目了然地阅读棋局。他们想到的几个动作几乎都很有力,有时还很有创意。他们可以处理一个他们从未遇到过的 "词",他们可以找到 ,以新的方式解释一个熟悉的词。
THE ENVIRONMENT OF SKILL
技能的环境
Klein and I quickly found that we agreed both on the nature of intuitive skill and on how it is acquired. We still needed to agree on our key question: When can you trust a self-confident professional who claims to have an intuition?
克莱因和我很快发现,我们在直觉技能的性质和如何获得直觉技能方面都达成了一致。 我们仍然需要在我们的关键问题上达成一致:你什么时候可以相信一个自称有直觉的专业人士?
We eventually concluded that our disagreement was due in part to the fact that we had different experts in mind. Klein had spent much time with fireground commanders, clinical nurses, and other professionals who have real expertise. I had spent more time thinking about clinicians, stock pickers, and political scientists trying to make unsupportable long-term forecasts. Not surprisingly, his default attitude was trust and respect; mine was skepticism. He was more willing to trust experts who claim an intuition because, as he told me, true experts know the limits of their knowledge. I argued that there are many pseudo-experts who have no idea that they do not know what they are doing (the illusion of validity), and that as a general proposition subjective confidence is commonly too high and often uninformative.
我们最终得出结论,我们的分歧部分是由于我们心中有不同的 专家。克莱恩花了很多时间与火场指挥官、临床护士和其他拥有真正专业知识的专业人士在一起。我花了更多的时间思考临床医生、选股者和政治科学家试图做出不靠谱的长期预测。 毫不奇怪,他的默认态度是信任和尊重;我的态度是怀疑。他更愿意相信那些 、声称有直觉的专家,因为正如他告诉我的,真正的专家知道他们知识的极限。我认为,有很多伪专家根本不知道他们不知道自己在做什么(有效性的错觉),作为一个普遍的命题,主观的信心普遍过高,而且往往不具参考意义。
Earlier I traced people’s confidence in a belief to two related impressions: cognitive ease and coherence. We are confident when the story we tell ourselves comes easily to mind, with no contradiction and no competing scenario. But ease and coherence do not guarantee that a belief held with confidence is true. The associative machine is set to suppress doubt and to evoke ideas and information that are compatible with the currently dominant story. A mind that follows WY SIATI will achieve high confidence much too easily by ignoring what it does not know. It is therefore not surprising that many of us are prone to have high confidence in unfounded intuitions. Klein and I eventually agreed on an important principle: the confidence that people have in their intuitions is not a reliable guide to their validity. In other words, do not trust anyone—including yourself—to tell you how much you should trust their judgment.
早些时候,我把人们对一个信念的信心追溯到两个相关的印象: 认知上的轻松和连贯性。当我们告诉自己的故事很容易想到,没有矛盾,没有竞争的情景时,我们就有信心。但是,轻松和连贯性并不能保证自信地持有的信念是真实的。联想机器被设定为抑制怀疑,并唤起与当前主导故事相一致的想法和信息。一个遵循 WY SIATI的头脑会通过忽视它不知道的东西而太容易获得高信心。因此,我们中的许多人容易对毫无根据的直觉有很高的信心,这并不奇怪。克莱因和我最终在一个重要的原则上达成了一致:人们对自己直觉的 ,并不是对其有效性的可靠指导。换句话说,不要相信任何人--包括你自己-- ,告诉你应该相信他们的判断。
If subjective confidence is not to be trusted, how can we evaluate the probable validity of an intuitive judgment? When do judgments reflect true expertise? When do they display an illusion of validity? The answer comes from the two basic conditions for acquiring a skill:
如果主观自信不可信,我们如何评价直觉判断的可能有效性? 判断何时反映真正的专业知识? 何时显示出有效性的幻觉?答案来自于获得技能的两个基本条件。
an environment that is sufficiently regular to be predictable
一个有足够规律的、可预测的环境
an opportunity to learn these regularities through prolonged practice
有机会 ,通过长期的实践来学习这些规律性的东西。
When both these conditions are satisfied, intuitions are likely to be skilled. Chess is an extreme example of a regular environment, but bridge and poker also provide robust statistical regularities that can support skill. Physicians, nurses, athletes, and firefighters also face complex but fundamentally orderly situations. The accurate intuitions that Gary Klein has described are due to highly valid cues that the expert’s System 1 has learned to use, even if System 2 has not learned to name them. In contrast, stock pickers and political scientists who make long-term forecasts operate in a zero-validity environment. Their failures reflect the basic unpredictability of the events that they try to forecast.
当这两个条件都满足时,直觉很可能是有技巧的。 国际象棋是规则环境的一个极端例子,但桥牌和扑克也提供了强大的统计规则,可以支持技巧。医生、护士、运动员和消防员也面临着复杂但基本有序的情况。加里-克莱因所描述的准确直觉 ,是由于专家的系统1已经学会了使用高度有效的线索,即使系统2还没有学会命名它们。 相反,进行长期预测的选股者和政治学家在一个零有效性的环境中运作。 他们的失败反映了他们试图预测的事件的基本不可预测性。
Some environments are worse than irregular. Robin Hogarth described “wicked” environments, in which professionals are likely to learn the wrong lessons from experience. He borrows from Lewis Thomas the example of a physician in the early twentieth century who often had intuitions about patients who were about to develop typhoid. Unfortunately, he tested his hunch by palpating the patient’s tongue, without washing his hands between patients. When patient after patient became ill, the physician developed a sense of clinical infallibility. His predictions were accurate—but not because he was exercising professional intuition!
有些环境比 不规则。罗宾-霍加斯描述了 "邪恶 "的环境,在这种环境中,专业人士很可能从经验中吸取错误的教训。他从刘易斯-托马斯那里借用了二十世纪初一位医生的例子,这位医生经常对即将发生伤寒的病人有直觉。不幸的是,他通过触摸病人的舌头来测试他的预感,在病人之间没有洗手, 。当一个又一个病人发病时,医生产生了一种临床上的无懈可击的感觉。他的预测是准确的--但不是因为他在行使专业的直觉。
Meehl’s clinicians were not inept and their failure was not due to lack of talent. They performed poorly because they were assigned tasks that did not have a simple solution. The clinicians’ predicament was less extreme than the zero-validity environment of long-term political forecasting, but they operated in low-validity situations that did not allow high accuracy. We know this to be the case because the best statistical algorithms, although more accurate than human judges, were never very accurate. Indeed, the studies by Meehl and his followers never produced a “smoking gun” demonstration, a case in which clinicians completely missed a highly valid cue that the algorithm detected. An extreme failure of this kind is unlikely because human learning is normally efficient. If a strong predictive cue exists, human observers will find it, given a decent opportunity to do so. Statistical algorithms greatly outdo humans in noisy environments for two reasons: they are more likely than human judges to detect weakly valid cues and much more likely to maintain a modest level of accuracy by using such cues consistently.
Meehl的临床医生并非无能,他们的失败也不是因为缺乏天赋。他们的表现很差,因为他们被分配的任务没有一个简单的解决方案。临床医生的困境 ,没有长期政治预测的零效力环境那么极端,但他们在低效力的情况下操作,不允许有高的准确性。我们知道这是事实,因为最好的统计算法,虽然比人类法官更准确,但从来没有非常准确。事实上,Meehl及其追随者的 研究从未产生过 "吸烟枪 "证明, ,其中临床医生完全错过了算法检测到的一个高度有效的线索。这种极端的失败是不可能的,因为人类的学习通常是高效的。如果存在一个强有力的预测线索,只要有一个合适的机会,人类观察者就会发现它。统计算法在嘈杂的环境中大大超过了人类,原因有二:它们比人类 ,更有可能发现弱效的线索,而且更有可能通过持续使用这种线索来保持适度的准确性。
It is wrong to blame anyone for failing to forecast accurately in an unpredictable world. However, it seems fair to blame professionals for believing they can succeed in an impossible task. Claims for correct intuitions in an unpredictable situation are self-delusional at best, sometimes worse. In the absence of valid cues, intuitive “hits” are due either to luck or to lies. If you find this conclusion surprising, you still have a lingering belief that intuition is magic. Remember this rule: intuition cannot be trusted in the absence of stable regularities in the environment.
指责任何人在一个不可预测的世界中未能准确预测是不对的。 然而,指责专业人士相信他们能在一个不可能的任务中取得成功似乎是公平的。在不可预测的情况下声称有正确的直觉,充其量是自欺欺人的 ,有时甚至更糟。在没有有效线索的情况下,直觉的 "命中 "要么是运气,要么是谎言。 如果你觉得这个结论令人惊讶,那么你仍然有一个挥之不去的信念,即直觉是魔术。记住这个规则:在环境中没有稳定的规律性的情况下,不能相信直觉。
FEEDBACK AND PRACTICE
反馈和实践
Some regularities in the environment are easier to discover and apply than others. Think of how you developed your style of using the brakes on your car. As you were mastering the skill of taking curves, you gradually learned when to let go of the accelerator and when and how hard to use the brakes. Curves differ, and the variability you experienced while learning ensures that you are now ready to brake at the right time and strength for any curve you encounter. The conditions for learning this skill are ideal, because you receive immediate and unambiguous feedback every time you go around a bend: the mild reward of a comfortable turn or the mild punishment of some difficulty in handling the car if you brake either too hard or not quite hard enough. The situations that face a harbor pilot maneuvering large ships are no less regular, but skill is much more difficult to acquire by sheer experience because of the long delay between actions and their noticeable outcomes. Whether professionals have a chance to develop intuitive expertise depends essentially on the quality and speed of feedback, as well as on sufficient opportunity to practice.
环境中的一些规律性的东西比其他的更容易发现 和应用。想想你是如何发展你的汽车刹车使用风格的。当你在掌握走弯道的技巧时,你逐渐学会了什么时候放开油门,什么时候和多大程度上使用刹车。曲线不同,你在学习时经历的变化确保你现在已经准备好在正确的时间和强度下为你遇到的任何曲线刹车。 学习这种技能的条件是理想的,因为你每次转弯时都会得到即时和明确的反馈:如果你刹车太猛或不够猛,就会得到舒适转弯的轻微奖励,或者得到一些操控汽车的困难的轻微惩罚。 港口飞行员操纵大船所面临的情况也是如此,但由于行动和明显的结果之间有较长的延迟,技能更难通过单纯的经验获得。 专业人员是否有机会发展直观的专业知识,主要取决于反馈的质量和速度,以及足够的实践机会。
Expertise is not a single skill; it is a collection of skills, and the same professional may be highly expert in some of the tasks in her domain while remaining a novice in others. By the time chess players become experts, they have “seen everything” (or almost everything), but chess is an exception in this regard. Surgeons can be much more proficient in some operations than in others. Furthermore, some aspects of any professional’s tasks are much easier to learn than others. Psychotherapists have many opportunities to observe the immediate reactions of patients to what they say. The feedback enables them to develop the intuitive skill to find the words and the tone that will calm anger, forge confidence, or focus the patient’s attention. On the other hand, therapists do not have a chance to identify which general treatment approach is most suitable for different patients. The feedback they receive from their patients’ long-term outcomes is sparse, delayed, or (usually) nonexistent, and in any case too ambiguous to support learning from experience.
专业知识不是一种单一的技能;它是一种技能的集合,同一个专业人员可能在其领域的某些任务中是高度 专家,而在其他任务中仍然是新手。当棋手成为专家时, ,他们已经 "看到了一切"(或几乎一切),但国际象棋在这方面是个例外。外科医生在某些操作上可能比其他操作要熟练得多。 此外,任何专业人员的任务的某些方面比其他方面要容易学习得多。心理治疗师有 很多机会来观察病人对他们所说的话的直接反应。这些反馈使他们能够发展直觉技能,找到能够平息愤怒、锻造信心或集中病人注意力的话语和语气。 另一方面,治疗师没有机会确定哪种一般的治疗方法最适合不同的病人。他们从病人的长期结果中得到的反馈 ,是稀少的、延迟的或(通常)不存在的,而且在任何情况下都太含糊,无法支持从经验中学习。
Among medical specialties, anesthesiologists benefit from good feedback, because the effects of their actions are likely to be quickly evident. In contrast, radiologists obtain little information about the accuracy of the diagnoses they make and about the pathologies they fail to detect. Anesthesiologists are therefore in a better position to develop useful intuitive skills. If an anesthesiologist says, “I have a feeling something is wrong,” everyone in the operating room should be prepared for an emergency.
在医学专业中,麻醉师受益于良好的反馈,因为他们的行动效果可能很快就会显现出来。相比之下,放射科医生获得的关于他们做出的诊断的准确性 ,以及关于他们未能发现的病症的信息很少。因此,麻醉科医生更有条件发展有用的直觉技能。如果麻醉师说:"我感觉有些不对劲",手术室里的每个人都应该准备好应对紧急情况。
Here again, as in the case of subjective confidence, the experts may not know the limits of their expertise. An experienced psychotherapist knows that she is skilled in working out what is going on in her patient’s mind and that she has good intuitions about what the patient will say next. It is tempting for her to conclude that she can also anticipate how well the patient will do next year, but this conclusion is not equally justified. Short-term anticipation and long-term forecasting are different tasks, and the therapist has had adequate opportunity to learn one but not the other. Similarly, a financial expert may have skills in many aspects of his trade but not in picking stocks, and an expert in the Middle East knows many things but not the future. The clinical psychologist, the stock picker, and the pundit do have intuitive skills in some of their tasks, but they have not learned to identify the situations and the tasks in which intuition will betray them. The unrecognized limits of professional skill help explain why experts are often overconfident.
在这里,与主观信心的情况一样,专家们可能不知道他们的专业知识的局限性。一个有经验的 心理治疗师知道,她能熟练地弄清病人的想法,而且她对病人接下来会说什么有很好的直觉。她很想得出结论,她也能预测病人明年会做得多好,但这个结论并不同样合理。短期预测和长期预测是不同的任务, ,治疗师有足够的机会学习一个,但不是另一个。同样,一个金融专家可能在其行业的许多方面有技能,但在挑选股票方面却没有,一个中东专家知道许多事情,但不知道未来。临床心理学家、选股者和专家学者在某些任务中确实有直觉技能,但他们没有学会识别 ,以及直觉会背叛他们的任务。未被认识到的专业技能的局限性有助于解释为什么专家经常过度自信。
EVALUATING VALIDITY
评估有效性
At the end of our journey, Gary Klein and I agreed on a general answer to our initial question: When can you trust an experienced professional who claims to have an intuition? Our conclusion was that for the most part it is possible to distinguish intuitions that are likely to be valid from those that are likely to be bogus. As in the judgment of whether a work of art is genuine or a fake, you will usually do better by focusing on its provenance than by looking at the piece itself. If the environment is sufficiently regular and if the judge has had a chance to learn its regularities, the associative machinery will recognize situations and generate quick and accurate predictions and decisions. You can trust someone’s intuitions if these conditions are met.
在我们的旅程结束时,加里-克莱因和我对我们最初的问题达成了一个一般性的答案:你什么时候可以相信一个有经验的专业人士,他 ,声称有一种直觉。我们的结论是,在大多数情况下, ,有可能区分可能是有效的直觉和可能是假的直觉。就像判断一件艺术品是真品还是赝品一样,关注它的出处通常会比看作品本身做得更好。 如果环境有足够的规律性,如果判断者有机会学习它的规律性,联想机器 ,就能识别情况并产生快速而准确的预测和决定。如果满足这些条件,你可以相信某人的直觉。
Unfortunately, associative memory also generates subjectively compelling intuitions that are false. Anyone who has watched the chess progress of a talented youngster knows well that skill does not become perfect all at once, and that on the way to near perfection some mistakes are made with great confidence. When evaluating expert intuition you should always consider whether there was an adequate opportunity to learn the cues, even in a regular environment.
不幸的是,联想记忆也会产生主观上令人信服的直觉,而这些直觉是错误的。任何观察过天才少年的国际象棋进展的人都清楚,技能不会一下子变得完美,在通往 接近完美的道路上,有些错误是很有把握的。在评估专家的直觉时,你应该始终考虑是否有足够的机会学习提示,即使是在常规环境下。
In a less regular, or low-validity, environment, the heuristics of judgment are invoked. System 1 is often able to produce quick answers to difficult questions by substitution, creating coherence where there is none. The question that is answered is not the one that was intended, but the answer is produced quickly and may be sufficiently plausible to pass the lax and lenient review of System 2. You may want to forecast the commercial future of a company, for example, and believe that this is what you are judging, while in fact your evaluation is dominated by your impressions of the energy and competence of its current executives. Because substitution occurs automatically, you often do not know the origin of a judgment that you (your System 2) endorse and adopt. If it is the only one that comes to mind, it may be subjectively undistinguishable from valid judgments that you make with expert confidence. This is why subjective confidence is not a good diagnostic of accuracy: judgments that answer the wrong question can also be made with high confidence.
在一个不太正规的,或者说低效的环境中,判断的启发式方法被调用。 系统1往往能够通过替换来快速产生对困难问题的答案,在没有连贯性的地方创造 。被回答的问题并不是原意的问题,但答案很快就产生了,而且可能足够可信,以通过系统2的宽松和懒惰的审查。例如,你可能想预测一家公司的商业未来,并认为这就是你的判断,而事实上你的评价是由你对其现任高管的精力和能力的印象 。由于替代是自动发生的,你往往不知道你(你的系统2)认可和采用的判断的起源。如果它是唯一想到的,它可能在主观上与你以专家的信心做出的有效判断没有区别。这就是为什么主观置信度不能很好地诊断准确性的原因: ,回答错误的判断也可能是高置信度的。
You may be asking, Why didn’t Gary Klein and I come up immediately with the idea of evaluating an expert’s intuition by assessing the regularity of the environment and the expert’s learning history—mostly setting aside the expert’s confidence? And what did we think the answer could be? These are good questions because the contours of the solution were apparent from the beginning. We knew at the outset that fireground commanders and pediatric nurses would end up on one side of the boundary of valid intuitions and that the specialties studied by Meehl would be on the other, along with stock pickers and pundits.
你可能会问,为什么加里-克莱因和我没有立即想到通过评估环境的规律性和专家的学习历史来评估专家的直觉--主要是把专家的信心放在一边?而我们认为答案可能是什么?这些问题很好,因为 ,解决方案的轮廓从一开始就很明显。我们一开始就知道,火场指挥官和儿科护士最终会站在有效直觉边界的一边,而Meehl研究的专业会在另一边,还有选股人和专家学者。
It is difficult to reconstruct what it was that took us years, long hours of discussion, endless exchanges of drafts and hundreds of e-mails negotiating over words, and more than once almost giving up. But this is what always happens when a project ends reasonably well: once you understand the main conclusion, it seems it was always obvious.
,很难重构是什么让我们花了数年时间,长时间的讨论,无休止的交流 ,以及数百封电子邮件的文字谈判,而且不止一次地几乎放弃了。但是,当一个项目合理地结束时,总会发生这种情况:一旦你理解了主要的结论,似乎它总是显而易见的。
As the title of our article suggests, Klein and I disagreed less than we had expected and accepted joint solutions of almost all the substantive issues that were raised. However, we also found that our early differences were more than an intellectual disagreement. We had different attitudes, emotions, and tastes, and those changed remarkably little over the years. This is most obvious in the facts that we find amusing and interesting. Klein still winces when the word bias is mentioned, and he still enjoys stories in which algorithms or formal procedures lead to obviously absurd decisions. I tend to view the occasional failures of algorithms as opportunities to improve them. On the other hand, I find more pleasure than Klein does in the comeuppance of arrogant experts who claim intuitive powers in zero-validity situations. In the long run, however, finding as much intellectual agreement as we did is surely more important than the persistent emotional differences that remained.
正如我们文章的标题所示,克莱因和我的分歧比我们预期的要小,并接受了几乎所有实质性问题的共同解决方案,这些问题 。然而,我们也发现,我们早期的分歧不仅仅是智力上的分歧。 我们有不同的态度、情感和品味,而这些在多年来变化明显不大。 这在我们认为好笑和有趣的事实中最为明显。提到 偏见 这个词,Klein仍然会感到害怕,他仍然喜欢那些算法或正式程序 ,导致明显荒谬的决定的故事。我倾向于将算法的偶尔失败视为改进它们的机会。另一方面,我比Klein更喜欢在零效力的情况下对那些声称有直觉能力的傲慢的专家进行惩罚。 然而,从长远来看,像我们这样找到智力上的一致,肯定比持续存在的情感 ,差异更重要。
SPEAKING OF EXPERT INTUITION
说到专家的直觉
“How much expertise does she have in this particular task? How much practice has she had?”
"她在这项特殊的任务中拥有多少专业知识?她有多少练习?"
“Does he really believe that the environment of start-ups is sufficiently regular to justify an intuition that goes against the base rates?”
"他真的相信,创业的环境有足够的规律性,可以证明违背基准利率的直觉是正确的吗?"
“She is very confident in her decision, but subjective confidence is a poor index of the accuracy of a judgment.”
"她对自己的决定非常有信心,但主观信心是衡量判断准确性的一个糟糕的指标。"
“Did he really have an opportunity to learn? How quick and how clear was the feedback he received on his judgments?”
"他真的有机会学习吗?他对自己的判断收到的反馈有多快,有多清楚?"
23
23
The Outside View
外界的看法
A few years after my collaboration with Amos began, I convinced some officials in the Israeli Ministry of Education of the need for a curriculum to teach judgment and decision making in high schools. The team that I assembled to design the curriculum and write a textbook for it included several experienced teachers, some of my psychology students, and Seymour Fox, then dean of the Hebrew University’s School of Education, who was an expert in curriculum development.
在我与阿莫斯的合作开始几年后,我说服了以色列教育部的一些官员,认为有必要在高中开设教授判断和决策的课程。 ,我组建了一个团队来设计课程并为其编写教科书,其中包括几位经验丰富的教师、我的一些心理学学生,以及当时希伯来大学教育学院院长西摩-福克斯,他是课程开发方面的专家。
After meeting every Friday afternoon for about a year, we had constructed a detailed outline of the syllabus, had written a couple of chapters, and had run a few sample lessons in the classroom. We all felt that we had made good progress. One day, as we were discussing procedures for estimating uncertain quantities, the idea of conducting an exercise occurred to me. I asked everyone to write down an estimate of how long it would take us to submit a finished draft of the textbook to the Ministry of Education. I was following a procedure that we already planned to incorporate into our curriculum: the proper way to elicit information from a group is not by starting with a public discussion but by confidentially collecting each person’s judgment. This procedure makes better use of the knowledge available to members of the group than the common practice of open discussion. I collected the estimates and jotted the results on the blackboard. They were narrowly centered around two years; the low end was one and a half, the high end two and a half years.
在大约一年的时间里,我们每周五下午开会,构建了一个详细的教学大纲,写了几个章节, ,并在课堂上上了几节样板课。我们都觉得我们已经取得了良好的进展。有一天,当我们讨论估计不确定数量的程序时,我想到了进行一次练习的想法。我让每个人写下我们向教育部提交教科书成品草案所需的估计时间。我是按照我们已经计划纳入课程的程序 :从一个群体中获取信息的正确方法不是从公开讨论开始,而是秘密地收集每个人的判断。与公开讨论的常见做法相比,这种程序能更好地利用小组成员所掌握的知识。我收集了估计值,并在黑板上记下了结果。 ,它们都是以两年为中心狭长的;低端是一年半,高端是两年半。
Then I had another idea. I turned to Seymour, our curriculum expert, and asked whether he could think of other teams similar to ours that had developed a curriculum from scratch. This was a time when several pedagogical innovations like “new math” had been introduced, and Seymour said he could think of quite a few. I then asked whether he knew the history of these teams in some detail, and it turned out that he was familiar with several. I asked him to think of these teams when they had made as much progress as we had. How long, from that point, did it take them to finish their textbook projects?
然后我有了另一个想法。我转向我们的课程专家西摩, ,问他是否能想到其他与我们类似的团队,从零开始开发课程。这时已经出现了一些像 "新数学 "这样的教学创新,西摩说 ,他能想到不少。我接着问他是否对这些团队的历史有一些详细的了解,结果发现他对几个团队都很熟悉。我让他想一想,当这些团队取得和我们一样多的进步时。从那时起,他们花了多长时间来完成他们的教科书项目?
He fell silent. When he finally spoke, it seemed to me that he was blushing, embarrassed by his own answer: “You know, I never realized this before, but in fact not all the teams at a stage comparable to ours ever did complete their task. A substantial fraction of the teams ended up failing to finish the job.”
他陷入了沉默。当他终于开口时,在我看来,他脸红了, ,为自己的回答感到尴尬:"你知道,我以前从未意识到这一点,但事实上,并非所有处于与我们相当阶段的团队都完成过任务。相当一部分团队最终没能完成工作。"
This was worrisome; we had never considered the possibility that we might fail. My anxiety rising, I asked how large he estimated that fraction was. “About 40%,” he answered. By now, a pall of gloom was falling over the room. The next question was obvious: “Those who finished,” I asked. “How long did it take them?” “I cannot think of any group that finished in less than seven years,” he replied, “nor any that took more than ten.”
这是令人担忧的;我们从未考虑过我们可能会失败的可能性。我的焦虑上升,我问他估计这部分有多大。"大约40%," ,他回答。现在,房间里已经笼罩了一层阴霾。下一个问题很明显:"那些完成的人,"我问。"他们花了多长时间?""我想不出有哪个小组是在不到七年的时间里完成的,"他回答说,"也没有哪个小组是花了十年以上的。"
I grasped at a straw: “When you compare our skills and resources to those of the other groups, how good are we? How would you rank us in comparison with these teams?” Seymour did not hesitate long this time. “We’re below average,” he said, “but not by much.” This came as a complete surprise to all of us—including Seymour, whose prior estimate had been well within the optimistic consensus of the group. Until I prompted him, there was no connection in his mind between his knowledge of the history of other teams and his forecast of our future.
我抓住了一根稻草:"当你把我们的技能和资源与其他团体的技能和资源相比较时,我们有多好?与这些球队相比,你会如何 我们的排名?"西摩这次没有犹豫太久。"我们低于平均水平,"他说,"但不是很多。"这让我们所有人都大吃一惊--包括西摩,他之前的估计完全在小组的乐观共识范围内。 在我提示他之前,他的脑海中没有将他对其他球队历史的了解与他对我们未来的预测 。
Our state of mind when we heard Seymour is not well described by stating what we “knew.” Surely all of us “knew” that a minimum of seven years and a 40% chance of failure was a more plausible forecast of the fate of our project than the numbers we had written on our slips of paper a few minutes earlier. But we did not acknowledge what we knew. The new forecast still seemed unreal, because we could not imagine how it could take so long to finish a project that looked so manageable. No crystal ball was available to tell us the strange sequence of unlikely events that were in our future. All we could see was a reasonable plan that should produce a book in about two years, conflicting with statistics indicating that other teams had failed or had taken an absurdly long time to complete their mission. What we had heard was base-rate information, from which we should have inferred a causal story: if so many teams failed, and if those that succeeded took so long, writing a curriculum was surely much harder than we had thought. But such an inference would have conflicted with our direct experience of the good progress we had been making. The statistics that Seymour provided were treated as base rates normally are—noted and promptly set aside.
我们听到西摩时的心境,并不能用陈述我们 "知道 "的事情来很好地描述。当然,我们所有人都 "知道",与我们几分钟前写在纸条上的数字相比,至少七年的时间和40%的失败几率是对我们项目命运的更可信的预测。但我们并没有承认我们所知道的事情。新的预测仍然显得 不真实,因为我们无法想象怎么会花这么长时间来完成一个看起来很容易管理的项目。没有水晶球可以告诉我们,在我们的未来有一连串奇怪的不可能的事件。我们所看到的是一个合理的计划,应该在两年左右出书,与表明其他团队失败或花了荒谬的长 时间来完成任务的统计数字相冲突。我们所听到的是基数信息,我们应该从中推断出一个因果 故事:如果这么多团队失败了,如果那些成功的团队花了这么长时间,编写课程肯定比我们想象的难得多。但这样的推断会与我们一直以来取得的良好进展的直接经验相冲突。西摩 ,所提供的统计数据被视为通常注意到的基本比率,并迅速搁置。
We should have quit that day. None of us was willing to invest six more years of work in a project with a 40% chance of failure. Although we must have sensed that persevering was not reasonable, the warning did not provide an immediately compelling reason to quit. After a few minutes of desultory debate, we gathered ourselves together and carried on as if nothing had happened. The book was eventually completed eight(!) years later. By that time I was no longer living in Israel and had long since ceased to be part of the team, which completed the task after many unpredictable vicissitudes. The initial enthusiasm for the idea in the Ministry of Education had waned by the time the text was delivered and it was never used.
我们应该在那一天退出。我们中没有人愿意在一个有40%失败可能性的项目上再投资六年的工作。虽然我们一定感觉到坚持下去是不合理的,但警告并没有提供一个立即令人信服的理由来放弃。经过几分钟的争论,我们把 ,然后若无其事地继续工作。 这本书最终在八年(!)后完成。那时我已不再住在以色列,也早已不再是团队的一员,在经历了许多不可预知的变故后,完成了任务。教育部最初对这一想法的热情在文本交付时已经减弱,而且 ,从未使用过。
This embarrassing episode remains one of the most instructive experiences of my professional life. I eventually learned three lessons from it. The first was immediately apparent: I had stumbled onto a distinction between two profoundly different approaches to forecasting, which Amos and I later labeled the inside view and the outside view . The second lesson was that our initial forecasts of about two years for the completion of the project exhibited a planning fallacy. Our estimates were closer to a best-case scenario than to a realistic assessment. I was slower to accept the third lesson, which I call irrational perseverance: the folly we displayed that day in failing to abandon the project. Facing a choice, we gave up rationality rather than give up the enterprise.
这段尴尬的经历至今仍是我职业生活中最具启发性的经历之一。 我最终从中吸取了三个教训。第一个问题是显而易见的:我偶然发现了两种截然不同的预测方法之间的区别,阿莫斯和我后来将其称为 内部观点和外部观点 。第二个教训是,我们最初的预测 ,大约两年时间完成项目,这显示了一个规划谬误。我们的估计更接近于最好的情况,而不是现实的评估。 我接受第三个教训的速度较慢,我称之为非理性的坚持:那天我们没有放弃这个项目所表现出的愚蠢。 面对一个选择,我们放弃了理性,而不是放弃了这个企业。
DRAWN TO THE INSIDE VIEW
绘制了 内视图
On that long-ago Friday, our curriculum expert made two judgments about the same problem and arrived at very different answers . The inside view is the one that all of us, including Seymour, spontaneously adopted to assess the future of our project. We focused on our specific circumstances and searched for evidence in our own experiences. We had a sketchy plan: we knew how many chapters we were going to write, and we had an idea of how long it had taken us to write the two that we had already done. The more cautious among us probably added a few months to their estimate as a margin of error.
在那个很久以前的星期五,我们的课程专家对同一个问题做出了两个判断,得出了 非常不同的答案 。 内部观点 是我们所有的人,包括西摩,自发采用的观点,以评估我们项目的未来。 我们专注于我们的具体情况,并在我们自己的经验中寻找证据。 我们有一个粗略的计划:我们知道我们要写多少 章,我们对我们花了多长时间写完已经完成的两章有一个想法。我们中更谨慎的人可能在他们的估计上加上几个月作为误差幅度。
Extrapolating was a mistake. We were forecasting based on the information in front of us—WYSIATI—but the chapters we wrote first were probably easier than others, and our commitment to the project was probably then at its peak. But the main problem was that we failed to allow for what Donald Rumsfeld famously called the “unknown unknowns.” There was no way for us to foresee, that day, the succession of events that would cause the project to drag out for so long. The divorces, the illnesses, the crises of coordination with bureaucracies that delayed the work could not be anticipated. Such events not only cause the writing of chapters to slow down, they also produce long periods during which little or no progress is made at all. The same must have been true, of course, for the other teams that Seymour knew about. The members of those teams were also unable to imagine the events that would cause them to spend seven years to finish, or ultimately fail to finish, a project that they evidently had thought was very feasible. Like us, they did not know the odds they were facing. There are many ways for any plan to fail, and although most of them are too improbable to be anticipated, the likelihood that something will go wrong in a big project is high.
推断是一个错误。我们是根据摆在我们面前的信息 ,即YYSIATI进行预测的,但我们先写的章节可能比其他章节更容易,而且我们对项目的承诺 ,当时可能处于高峰期。但主要问题是,我们没有考虑到唐纳德-拉姆斯菲尔德著名的 "未知数"。那一天,我们没有办法预见到将导致该项目拖得如此之久的一系列事件。离婚、疾病、与官僚机构协调的危机,这些都耽误了工作,无法 。这些事件不仅导致章节的写作速度减慢,还产生了很长一段时间,期间几乎没有进展。 当然,西摩知道的其他团队也一定是如此。这些团队的成员也无法想象会有什么事件导致他们花七年时间来完成,或者最终无法完成一个他们显然认为非常可行的项目 。和我们一样,他们也不知道他们所面临的几率。 任何计划都有很多失败的方式,尽管其中大多数都是不可能预料的,但在一个大项目中出错 的 可能性是很大的。
The second question I asked Seymour directed his attention away from us and toward a class of similar cases. Seymour estimated the base rate of success in that reference class: 40% failure and seven to ten years for completion. His informal survey was surely not up to scientific standards of evidence, but it provided a reasonable basis for a baseline prediction: the prediction you make about a case if you know nothing except the category to which it belongs. As we saw earlier, the baseline prediction should be the anchor for further adjustments. If you are asked to guess the height of a woman about whom you know only that she lives in New York City, your baseline prediction is your best guess of the average height of women in the city. If you are now given case-specific information, for example that the woman’s son is the starting center of his high school basketball team, you will adjust your estimate away from the mean in the appropriate direction. Seymour’s comparison of our team to others suggested that the forecast of our outcome was slightly worse than the baseline prediction, which was already grim.
我问西摩的第二个问题把他的注意力从我们身上引开,引向了一类类似的案例。西摩 ,估计了这一类参考案例的基本成功率:40%失败,7到10年完成。他的非正式调查当然达不到科学的证据标准,但它为基线预测提供了一个合理的基础:如果你对一个案例一无所知,只知道它属于哪一类,你就会做出预测。 正如我们前面所看到的, 基线预测 应该是进一步调整的锚。如果让你猜测一个女人的身高,而你只知道她住在纽约市,你的基线预测就是你对该市女性平均身高的最佳猜测。如果你现在得到了特定案例的信息,例如,这个女人的儿子是他高中篮球队的首发中锋,你将在适当的方向上调整你的估计 ,远离平均值。 西摩将我们的团队与其他人进行比较,表明我们的结果预测比基线预测略差,而基线预测已经很严峻。
The spectacular accuracy of the outside-view forecast in our problem was surely a fluke and should not count as evidence for the validity of the outside view . The argument for the outside view should be made on general grounds: if the reference class is properly chosen, the outside view will give an indication of where the ballpark is, and it may suggest, as it did in our case, that the inside-view forecasts are not even close to it.
在我们的问题中,外景预测的惊人准确性肯定是一种侥幸,不应该算作 外景 的有效性的证据。支持外部 的观点应该基于一般的理由:如果参考类选择得当,外部观点会给出一个球门在哪里的指示,它可能会像在我们的案例中那样,暗示内部观点的预测甚至不接近它。
For a psychologist, the discrepancy between Seymour’s two judgments is striking. He had in his head all the knowledge required to estimate the statistics of an appropriate reference class, but he reached his initial estimate without ever using that knowledge. Seymour’s forecast from his inside view was not an adjustment from the baseline prediction, which had not come to his mind. It was based on the particular circumstances of our efforts. Like the participants in the Tom W experiment, Seymour knew the relevant base rate but did not think of applying it.
对于一个心理学家来说,西摩的两个判断之间的差异是惊人的。他脑子里有估计 适当参考类的统计数字所需的所有知识 ,但他没有使用过这些知识就达到了最初的估计。西摩的预测从他的内部观点来看,并不是从基线预测的调整,他没有想到这一点。 它是基于我们努力的特殊情况。 像汤姆-W实验的参与者一样,西摩知道相关的基线比率,但没有想到 ,应用它。
Unlike Seymour, the rest of us did not have access to the outside view and could not have produced a reasonable baseline prediction. It is noteworthy, however, that we did not feel we needed information about other teams to make our guesses. My request for the outside view surprised all of us, including me! This is a common pattern: people who have information about an individual case rarely feel the need to know the statistics of the class to which the case belongs.
与西摩不同的是,我们其他人没有机会看到外面的景象,也不可能做出合理的基线预测。 然而,值得注意的是,我们并不觉得我们需要关于其他团队的信息来进行猜测。 我对外面景象的要求让我们所有人都感到惊讶,包括我在内!我的要求是:"我们要有足够的信心。这是一个常见的模式:掌握个人 案件信息的人很少觉得有必要了解该案件所属类别的统计数据。
When we were eventually exposed to the outside view, we collectively ignored it. We can recognize what happened to us; it is similar to the experiment that suggested the futility of teaching psychology. When they made predictions about individual cases about which they had a little information (a brief and bland interview), Nisbett and Borgida’s students completely neglected the global results they had just learned. “Pallid” statistical information is routinely discarded when it is incompatible with one’s personal impressions of a case. In the competition with the inside view, the outside view doesn’t stand a chance.
当我们最终接触到外界的观点时,我们集体忽略了它。 我们可以认识到发生在我们身上的事情;这类似于暗示心理学教学无用的实验。 当他们对他们有一点信息(简短而 平淡的采访)的个别案例进行预测时,尼斯贝特和博吉达的学生完全忽略了他们刚刚学到的全球结果。当 "苍白 "的统计信息 与个人对案件的印象不一致时,通常会被抛弃。 在与内部观点的竞争中,外部观点是没有机会的。
The preference for the inside view sometimes carries moral overtones. I once asked my cousin, a distinguished lawyer, a question about a reference class: “What is the probability of the defendant winning in cases like this one?” His sharp answer that “every case is unique” was accompanied by a look that made it clear he found my question inappropriate and superficial. A proud emphasis on the uniqueness of cases is also common in medicine, in spite of recent advances in evidence-based medicine that point the other way. Medical statistics and baseline predictions come up with increasing frequency in conversations between patients and physicians. However, the remaining ambivalence about the outside view in the medical profession is expressed in concerns about the impersonality of procedures that are guided by statistics and checklists.
对内部观点的偏爱有时带有道德色彩。我曾经问过 我的表弟,一位杰出的律师,一个关于参考类的问题:"像这样的案件,被告获胜的概率是多少?"他犀利地回答说 "每个案例都是独一无二的",同时还用眼神表明他认为我的问题不合适,而且很肤浅。尽管最近以证据为基础的 医学的进展指向另一个方向,但对病例独特性的自豪强调在医学中也很常见。在病人和医生的对话中,医学统计和基线预测出现的频率越来越高。 然而,医学界对外界观点的剩余矛盾心理表现在对以统计和检查表为指导的 程序的非个性化的 担忧。
THE PLANNING FALLACY
规划谬误
In light of both the outside-view forecast and the eventual outcome, the original estimates we made that Friday afternoon appear almost delusional. This should not come as a surprise: overly optimistic forecasts of the outcome of projects are found everywhere. Amos and I coined the term planning fallacy to describe plans and forecasts that
鉴于 外景预测和最终的结果,我们在那个星期五下午所做的最初估计似乎几乎是妄想。 这不应该令人惊讶:对项目结果过于乐观的预测随处可见。阿莫斯和我创造了 计划谬误 一词,来描述那些计划和预测
are unrealistically close to best-case scenarios
不切实际地接近于最佳情况。
could be improved by consulting the statistics of similar cases
可以通过咨询 ,改善类似案件的统计。
Examples of the planning fallacy abound in the experiences of individuals, governments, and businesses. The list of horror stories is endless.
在个人、政府和企业的经历中,规划谬误的例子比比皆是。 恐怖的故事不胜枚举。
In July 1997, the proposed new Scottish Parliament building in Edinburgh was estimated to cost up to £40 million. By June 1999, the budget for the building was £109 million. In April 2000, legislators imposed a £195 million “cap on costs.” By November 2001, they demanded an estimate of “final cost,” which was set at £241 million. That estimated final cost rose twice in 2002, ending the year at £294.6 million. It rose three times more in 2003, reaching £375.8 million by June. The building was finally completed in 2004 at an ultimate cost of roughly £431 million.
1997年7月,拟在爱丁堡建造的新 苏格兰议会大厦 估计费用高达4000万英镑。 到1999年6月,该大厦的预算为1.09亿英镑。 2000年4月,立法者对1.95亿英镑的 "费用上限"。到2001年11月,他们要求对 "最终成本 "进行估算,该估算被定为2.41亿英镑。 该估算的最终成本在2002年上升了两倍,年底达到2.946亿英镑。 2003年又上升了三倍,到6月达到3.758亿英镑。该建筑最终于2004年完工,最终成本约为4.31亿英镑。
A 2005 study examined rail projects undertaken worldwide between 1969 and 1998. In more than 90% of the cases, the number of passengers projected to use the system was overestimated. Even though these passenger shortfalls were widely publicized, forecasts did not improve over those thirty years; on average, planners overestimated how many people would use the new rail projects by 106%, and the average cost overrun was 45%. As more evidence accumulated, the experts did not become more reliant on it .
2005年的一项研究审查了1969年至1998年期间在世界各地进行的铁路项目 。在超过90%的情况下,预计使用该系统的乘客数量被高估。即使这些乘客的短缺被广泛宣传,预测在这30年里并没有改善;平均而言,规划者将使用新铁路项目的人数高估了106%,而平均成本超支45%。随着更多证据的积累, ,专家们 并没有变得更加依赖它 。
In 2002, a survey of American homeowners who had remodeled their kitchens found that, on average, they had expected the job to cost $18,658; in fact, they ended up paying an average of $38,769.
2002年, 对 改造过厨房的 美国房主进行 的一项 调查 发现,平均而言,他们预计这项工作将花费18,658美元;事实上,他们最终平均支付了38,769美元。
The optimism of planners and decision makers is not the only cause of overruns. Contractors of kitchen renovations and of weapon systems readily admit (though not to their clients) that they routinely make most of their profit on additions to the original plan. The failures of forecasting in these cases reflect the customers’ inability to imagine how much their wishes will escalate over time. They end up paying much more than they would if they had made a realistic plan and stuck to it.
规划者和决策者的乐观态度并不是造成超支的唯一原因。 厨房装修和武器系统的承包商很容易承认(虽然 ,但不向他们的客户承认),他们通常会在原计划的基础上增加利润的部分。在这些案例中,预测的失败反映了客户无法想象他们的愿望会随着时间的推移而升级。 他们最终支付的费用比他们制定一个现实的计划并坚持下去的时候要多得多。
Errors in the initial budget are not always innocent. The authors of unrealistic plans are often driven by the desire to get the plan approved—whether by their superiors or by a client—supported by the knowledge that projects are rarely abandoned unfinished merely because of overruns in costs or completion times . In such cases, the greatest responsibility for avoiding the planning fallacy lies with the decision makers who approve the plan. If they do not recognize the need for an outside view, they commit a planning fallacy.
初始预算中的错误并不总是无辜的。不切实际的计划的作者 ,他们的动机往往是希望计划得到批准--不管是 ,还是由客户批准--支持他们的知识是:项目很少仅仅因为成本或 完成时间 的超支而未完成而放弃。在这种情况下,避免规划谬误的最大责任在于批准规划的决策者。如果他们没有认识到 ,就犯了一个规划谬误。
MITIGATING THE PLANNING FALLACY
减轻规划谬误的影响
The diagnosis of and the remedy for the planning fallacy have not changed since that Friday afternoon, but the implementation of the idea has come a long way. The renowned Danish planning expert Bent Flyvbjerg, now at Oxford University, offered a forceful summary:
自那个星期五的下午以来,对规划谬误的诊断和补救措施并没有改变,但这个想法的实施却有了长足的进步。 著名的丹麦规划专家Bent Flyvbjerg,现在在牛津大学,提供了一个强有力的总结。
The prevalent tendency to underweight or ignore distributional information is perhaps the major source of error in forecasting. Planners should therefore make every effort to frame the forecasting problem so as to facilitate utilizing all the distributional information that is available.
普遍存在的低估 ,或忽视分布信息的倾向也许是预测中错误的主要来源。因此,规划者应尽一切努力构建预测问题,以便于利用所有可用的分布信息。
This may be considered the single most important piece of advice regarding how to increase accuracy in forecasting through improved methods. Using such distributional information from other ventures similar to that being forecasted is called taking an “outside view” and is the cure to the planning fallacy.
这可能被认为是关于如何通过改进方法提高预测准确性的最重要的一条建议。使用 ,这种来自与被预测的企业相似的分布信息被称为 "外部观点",是解决规划谬误的方法。
The treatment for the planning fallacy has now acquired a technical name, reference class forecasting , and Flyvbjerg has applied it to transportation projects in several countries. The outside view is implemented by using a large database, which provides information on both plans and outcomes for hundreds of projects all over the world, and can be used to provide statistical information about the likely overruns of cost and time, and about the likely underperformance of projects of different types.
对计划谬误的处理现在已经有了一个技术名称,即 参考类预测 ,Flyvbjerg已经将其应用于几个国家的交通项目。外部观点是通过使用一个大型数据库来实现的, ,该数据库提供了全世界数百个项目的计划和结果的信息,可以用来提供关于成本和时间可能超支的统计信息,以及关于不同类型的项目可能表现不佳的信息。
The forecasting method that Flyvbjerg applies is similar to the practices recommended for overcoming base-rate neglect:
Flyvbjerg应用的预测方法类似于为克服基数忽略而推荐的做法。
Identify an appropriate reference class (kitchen renovations, large railway projects, etc.).
确定 一个适当的参考类(厨房装修、大型铁路项目等)。
Obtain the statistics of the reference class (in terms of cost per mile of railway, or of the percentage by which expenditures exceeded budget). Use the statistics to generate a baseline prediction.
获得参考类的统计数据(以每英里铁路的成本或支出超过预算的百分比计算)。 使用这些统计数据来生成一个基线预测。
Use specific information about the case to adjust the baseline prediction, if there are particular reasons to expect the optimistic bias to be more or less pronounced in this project than in others of the same type.
如果有特别的理由预期 ,乐观偏差在这个项目中比在其他同类型的项目中更明显或更不明显,则使用 有关案例的具体信息来调整基线预测。
Flyvbjerg’s analyses are intended to guide the authorities that commission public projects, by providing the statistics of overruns in similar projects. Decision makers need a realistic assessment of the costs and benefits of a proposal before making the final decision to approve it. They may also wish to estimate the budget reserve that they need in anticipation of overruns, although such precautions often become self-fulfilling prophecies. As one official told Flyvbjerg, “A budget reserve is to contractors as red meat is to lions, and they will devour it.”
Flyvbjerg的分析旨在指导委托公共项目的当局,提供类似项目的超支统计数据。 决策者在做出批准提案的最终决定前,需要对提案的成本和收益进行现实的评估。他们也可能 ,估计他们需要的预算储备,以备超支,尽管这种预防措施往往成为自我实现的预言。 正如一位官员告诉Flyvbjerg,"预算储备对承包商来说就像红肉对狮子一样,他们会吞噬它。"
Organizations face the challenge of controlling the tendency of executives competing for resources to present overly optimistic plans. A well-run organization will reward planners for precise execution and penalize them for failing to anticipate difficulties, and for failing to allow for difficulties that they could not have anticipated—the unknown unknowns.
组织面临的挑战是如何控制争夺资源的高管提出过于乐观的计划的倾向。 一个运行良好的组织会奖励精确执行的计划者,并惩罚他们未能预见困难,以及未能考虑到他们无法预见的困难--未知的未知因素。
DECISIONS AND ERRORS
决定和错误
That Friday afternoon occurred more than thirty years ago. I often thought about it and mentioned it in lectures several times each year. Some of my friends got bored with the story, but I kept drawing new lessons from it. Almost fifteen years after I first reported on the planning fallacy with Amos, I returned to the topic with Dan Lovallo. Together we sketched a theory of decision making in which the optimistic bias is a significant source of risk taking. In the standard rational model of economics, people take risks because the odds are favorable—they accept some probability of a costly failure because the probability of success is sufficient. We proposed an alternative idea.
那个星期五的下午发生在三十多年前。 我经常想到它,并在每年的讲座中多次提到它。我的一些 的朋友对这个故事感到厌烦,但我不断从中吸取新的教训。在我与阿莫斯首次报道规划谬误近15年后,我又与丹-洛瓦洛回到了这个话题。我们一起勾勒了一个决策理论,其中乐观偏差是风险承担的重要来源。在经济学的标准理性模型中,人们承担风险是因为几率是有利的--他们 ,因为成功的概率是足够的,所以接受一些代价高昂的失败的概率。 我们提出了一个替代的想法。
When forecasting the outcomes of risky projects, executives too easily fall victim to the planning fallacy. In its grip, they make decisions based on delusional optimism rather than on a rational weighting of gains, losses, and probabilities. They overestimate benefits and underestimate costs. They spin scenarios of success while overlooking the potential for mistakes and miscalculations. As a result, they pursue initiatives that are unlikely to come in on budget or on time or to deliver the expected returns—or even to be completed.
在预测风险项目的结果时,高管们太容易成为计划谬误的受害者。在它的控制下,他们基于妄想的乐观主义而不是对收益、损失和概率的理性权衡来做决定。他们高估了收益 ,低估了成本。他们编造成功的情景,却忽视了错误和误判的可能性。 结果,他们追求的倡议不可能按预算或时间完成,也不可能实现预期的回报,甚至不可能完成。
In this view, people often (but not always) take on risky projects because they are overly optimistic about the odds they face. I will return to this idea several times in this book—it probably contributes to an explanation of why people litigate, why they start wars, and why they open small businesses.
在 这一观点中,人们经常(但不总是)承担风险项目,因为他们对面临的机会过于乐观。 我将在本书中多次提到这个想法--它可能有助于解释人们为什么要打官司,为什么要发动战争,以及为什么要开小企业。
FAILING A TEST
不合格的测试
For many years, I thought that the main point of the curriculum story was what I had learned about my friend Seymour: that his best guess about the future of our project was not informed by what he knew about similar projects. I came off quite well in my telling of the story, in which I had the role of clever questioner and astute psychologist. I only recently realized that I had actually played the roles of chief dunce and inept leader.
多年来,我认为课程故事的重点是我对我的朋友西摩的了解:他对我们项目的未来的最佳猜测并不是由他对类似项目的了解 。我在讲述这个故事时表现得相当好,我在其中扮演了聪明的提问者和精明的心理学家。我最近才意识到,我实际上扮演了首席傻瓜和无能领导人的角色。
The project was my initiative, and it was therefore my responsibility to ensure that it made sense and that major problems were properly discussed by the team, but I failed that test. My problem was no longer the planning fallacy. I was cured of that fallacy as soon as I heard Seymour’s statistical summary. If pressed, I would have said that our earlier estimates had been absurdly optimistic. If pressed further, I would have admitted that we had started the project on faulty premises and that we should at least consider seriously the option of declaring defeat and going home. But nobody pressed me and there was no discussion; we tacitly agreed to go on without an explicit forecast of how long the effort would last. This was easy to do because we had not made such a forecast to begin with. If we had had a reasonable baseline prediction when we started, we would not have gone into it, but we had already invested a great deal of effort—an instance of the sunk-cost fallacy , which we will look at more closely in the next part of the book. It would have been embarrassing for us—especially for me—to give up at that point, and there seemed to be no immediate reason to do so. It is easier to change directions in a crisis, but this was not a crisis, only some new facts about people we did not know. The outside view was much easier to ignore than bad news in our own effort. I can best describe our state as a form of lethargy—an unwillingness to think about what had happened. So we carried on. There was no further attempt at rational planning for the rest of the time I spent as a member of the team—a particularly troubling omission for a team dedicated to teaching rationality. I hope I am wiser today, and I have acquired a habit of looking for the outside view. But it will never be the natural thing to do.
该项目是我的倡议,因此我有责任确保它是有意义的,而且重大问题都经过团队的适当讨论, ,但我没有通过这一测试。我的问题不再是计划谬误。我一听到西摩的统计总结,就被治愈了这种谬误。如果被逼问,我会说我们早先的估计是荒谬的乐观。如果进一步追问,我会承认,我们是在错误的前提下开始这个项目的,我们至少应该认真考虑宣布 ,然后回家的选择。但是没有人催促我,也没有讨论;我们默契地同意继续下去,没有明确预测努力会持续多久。这很容易做到,因为我们一开始就没有做这样的预测。如果我们在开始时有一个合理的基线预测,我们就不会去做,但我们已经投入了大量的精力--一个实例 ,我们将在本书的下一部分更仔细地研究这个问题。对我们来说,尤其是对我来说,在那个时候放弃会很难堪,而且似乎也没有立即放弃的理由。在危机中更容易改变方向,但这不是危机,只是一些我们不了解的人的新事实。外部的观点比 我们自己的努力中的坏消息更容易被忽视。我可以把我们的状态描述为一种昏昏欲睡的状态--不愿意去思考所发生的事情。 所以我们继续前进。在我作为团队成员的其余时间里,没有进一步尝试理性规划--对于一个致力于教授理性的团队来说,这是一个特别令人不安的遗漏。 我希望我今天更明智,我已经养成了一个习惯,即寻找 ,以了解外面的情况。但这永远不会是自然的事情。
SPEAKING OF THE OUTSIDE VIEW
说到 外界的看法
“He’s taking an inside view. He should forget about his own case and look for what happened in other cases.”
"他在了解内部情况。他应该忘记自己的案子,寻找其他案件中发生的情况。"
“She is the victim of a planning fallacy. She’s assuming a best-case scenario, but there are too many different ways for the plan to fail, and she cannot foresee them all.”
"她是规划谬误的受害者。她假设的是最好的情况,但计划有太多不同的失败方式,她无法预见到所有这些情况。"
“Suppose you did not know a thing about this particular legal case, only that it involves a malpractice claim by an individual against a surgeon. What would be your baseline prediction? How many of these cases succeed in court? How many settle? What are the amounts? Is the case we are discussing stronger or weaker than similar claims?”
"假设你 ,对这个特定的法律案件一无所知,只知道它涉及一个人对一名外科医生的不当行为索赔。你的基线预测是什么? 其中有多少案件在法庭上成功?有多少人定居?数量是多少?我们所讨论的案例与类似的主张相比,是更强还是更弱?"
“We are making an additional investment because we do not want to admit failure. This is an instance of the sunk-cost fallacy.”
"我们正在进行额外的投资,因为我们不想承认 的失败。这是一个沉没成本谬误的例子"。
24
24
The Engine of Capitalism
资本主义的引擎
The planning fallacy is only one of the manifestations of a pervasive optimistic bias. Most of us view the world as more benign than it really is, our own attributes as more favorable than they truly are, and the goals we adopt as more achievable than they are likely to be. We also tend to exaggerate our ability to forecast the future, which fosters optimistic overconfidence. In terms of its consequences for decisions, the optimistic bias may well be the most significant of the cognitive biases. Because optimistic bias can be both a blessing and a risk, you should be both happy and wary if you are temperamentally optimistic.
规划谬误只是普遍存在的乐观主义偏见的表现之一。我们中的大多数人都认为世界比实际情况更良性,我们自己的属性比实际情况更有利, ,我们采取的目标比他们可能实现的更多。我们还倾向于夸大我们预测未来的能力,这助长了乐观的过度自信。就其对决策的影响而言,乐观偏见很可能是认知偏见中最重要的一种。 因为乐观偏见既是一种祝福,也是一种风险,如果你是 气质上的乐观,你应该既高兴又警惕。
OPTIMISTS
择要
Optimism is normal, but some fortunate people are more optimistic than the rest of us. If you are genetically endowed with an optimistic bias, you hardly need to be told that you are a lucky person— you already feel fortunate . An optimistic attitude is largely inherited, and it is part of a general disposition for well-being, which may also include a preference for seeing the bright side of everything . If you were allowed one wish for your child, seriously consider wishing him or her optimism. Optimists are normally cheerful and happy, and therefore popular; they are resilient in adapting to failures and hardships, their chances of clinical depression are reduced, their immune system is stronger, they take better care of their health, they feel healthier than others and are in fact likely to live longer. A study of people who exaggerate their expected life span beyond actuarial predictions showed that they work longer hours, are more optimistic about their future income, are more likely to remarry after divorce (the classic “triumph of hope over experience” ), and are more prone to bet on individual stocks. Of course, the blessings of optimism are offered only to individuals who are only mildly biased and who are able to “accentuate the positive” without losing track of reality.
乐观是正常的,但有些幸运的人比我们其他人更乐观。 如果你在遗传上有乐观的偏向,你几乎不需要被告知你是一个幸运的人-- 你已经感到幸运了 。乐观的态度在很大程度上是遗传的,它是对幸福的普遍倾向的一部分,这也可能包括对看到 一切光明面 的偏好 。如果允许你为你的孩子许一个愿望,请认真考虑祝他或她乐观。 乐观的人通常是开朗和快乐的,因此很受欢迎;他们在适应失败和困难方面有弹性,他们患临床抑郁症的机会减少,他们的免疫系统更强大,他们更好地照顾自己的健康,他们感到比其他人更健康 ,事实上很可能活得更长。一项对 ,将自己的预期寿命夸大到超出精算预测的人的研究表明,他们工作时间更长,对自己的未来收入更乐观,离婚后更有可能再婚(典型的 "希望战胜经验") ,并且更倾向于押注个人股票。当然,乐观主义的祝福 ,只提供给那些只有轻微偏见的个人,他们能够 "强调积极的一面",而不失去现实。
Optimistic individuals play a disproportionate role in shaping our lives. Their decisions make a difference; they are the inventors, the entrepreneurs, the political and military leaders—not average people. They got to where they are by seeking challenges and taking risks. They are talented and they have been lucky, almost certainly luckier than they acknowledge. They are probably optimistic by temperament; a survey of founders of small businesses concluded that entrepreneurs are more sanguine than midlevel managers about life in general. Their experiences of success have confirmed their faith in their judgment and in their ability to control events. Their self-confidence is reinforced by the admiration of others . This reasoning leads to a hypothesis: the people who have the greatest influence on the lives of others are likely to be optimistic and overconfident, and to take more risks than they realize.
乐观的人在塑造我们的生活方面发挥着不成比例的作用。 他们的决定会带来改变;他们是发明家、企业家、政治和军事领导人,而不是普通人。 他们通过寻求挑战和 承担风险而达到他们的目的。他们有才华,他们很幸运,几乎可以肯定比他们承认的更幸运。他们可能在性情上很乐观;一项对小企业创始人的调查得出的结论是,企业家 比中层管理人员 对一般的生活 更加乐观 。他们的成功经验证实了他们对自己的判断力和控制事件的能力的信心。 他们的自信心因 他人的钦佩 而得到加强。这种推理导致了一个假设: 对他人生活影响 最大的人很可能是乐观和过度自信的,并且比他们意识到的更多风险。
The evidence suggests that an optimistic bias plays a role—sometimes the dominant role—whenever individuals or institutions voluntarily take on significant risks. More often than not, risk takers underestimate the odds they face, and do not invest sufficient effort to find out what the odds are. Because they misread the risks, optimistic entrepreneurs often believe they are prudent, even when they are not. Their confidence in their future success sustains a positive mood that helps them obtain resources from others, raise the morale of their employees, and enhance their prospects of prevailing. When action is needed, optimism, even of the mildly delusional variety, may be a good thing.
证据表明,每当个人或机构自愿承担 重大风险时,乐观的偏见就会发挥作用,有时甚至是主导作用。更常见的情况是,风险承担者低估了他们面临的几率,并且没有投入足够的精力去了解几率是多少。 由于他们误读了风险,乐观的企业家往往认为他们是谨慎的,即使他们不是。他们对未来成功的信心维持了一种积极的情绪,帮助他们从别人那里获得资源,提高员工的士气 ,并增强他们的胜利前景。当需要采取行动时,乐观主义,即使是轻微的妄想,也可能是一件好事。
ENTREPRENEURIAL DELUSIONS
创业妄想
The chances that a small business will survive for five years in the United States are about 35%. But the individuals who open such businesses do not believe that the statistics apply to them. A survey found that American entrepreneurs tend to believe they are in a promising line of business: their average estimate of the chances of success for “any business like yours” was 60%—almost double the true value. The bias was more glaring when people assessed the odds of their own venture. Fully 81% of the entrepreneurs put their personal odds of success at 7 out of 10 or higher, and 33% said their chance of failing was zero .
在美国,一个小企业能够存活五年的机会约为35%。但开办这种企业的人并不相信这些统计数据适用于他们。一项调查发现, 美国企业家倾向于相信他们从事的是有前途的行业:他们对 "任何像你这样的企业 "的成功几率 ,平均估计为60%--几乎是真实值的两倍。当人们评估自己企业的几率时,这种偏见更加明显。有81%的企业家认为他们个人的成功几率为10分之7或更高,33%表示他们 失败的机会为零 。
The direction of the bias is not surprising. If you interviewed someone who recently opened an Italian restaurant, you would not expect her to have underestimated her prospects for success or to have a poor view of her ability as a restaurateur. But you must wonder: Would she still have invested money and time if she had made a reasonable effort to learn the odds—or, if she did learn the odds (60% of new restaurants are out of business after three years), paid attention to them? The idea of adopting the outside view probably didn’t occur to her.
偏差的方向并不令人惊讶。如果你采访了一个最近开了一家意大利餐馆的人,你不会想到她会低估自己的成功前景,或者对自己作为餐馆老板的能力有不好的看法。 但你一定会想:如果她做出合理的努力来了解赔率--或者,如果她确实了解了赔率 (60%的新餐馆在三年后倒闭),关注了这些赔率,她还会投入资金和时间吗?她可能没有想到采用外部观点的想法。
One of the benefits of an optimistic temperament is that it encourages persistence in the face of obstacles. But persistence can be costly. An impressive series of studies by Thomas Åstebro sheds light on what happens when optimists receive bad news. He drew his data from a Canadian organization—the Inventor’s Assistance Program—which collects a small fee to provide inventors with an objective assessment of the commercial prospects of their idea. The evaluations rely on careful ratings of each invention on 37 criteria, including need for the product, cost of production, and estimated trend of demand. The analysts summarize their ratings by a letter grade, where D and E predict failure—a prediction made for over 70% of the inventions they review. The forecasts of failure are remarkably accurate: only 5 of 411 projects that were given the lowest grade reached commercialization, and none was successful.
乐观气质的好处之一是,它鼓励人们在面对障碍时坚持不懈。 但是,坚持可能要付出代价。 Thomas Åstebro的一系列令人印象深刻的研究揭示了当乐观主义者收到 坏消息时会发生什么。他的数据来自加拿大的一个组织--发明家援助计划--该组织收取少量费用,为发明家提供对其想法的商业前景的客观评估。这些评估依赖于对每项发明的37项标准的仔细评级,包括对产品的需求、生产成本和估计的需求趋势。分析师们通过字母等级来总结他们的评级 ,其中D和E预示着失败--这是对他们审查的70%以上的发明的预测。对失败的预测非常准确:411个被评为 最低等级 的项目中只有5个达到商业化,而且没有一个成功。
Discouraging news led about half of the inventors to quit after receiving a grade that unequivocally predicted failure. However, 47% of them continued development efforts even after being told that their project was hopeless, and on average these persistent (or obstinate) individuals doubled their initial losses before giving up. Significantly, persistence after discouraging advice was relatively common among inventors who had a high score on a personality measure of optimism—on which inventors generally scored higher than the general population. Overall, the return on private invention was small, “lower than the return on private equity and on high-risk securities.” More generally, the financial benefits of self-employment are mediocre: given the same qualifications, people achieve higher average returns by selling their skills to employers than by setting out on their own. The evidence suggests that optimism is widespread, stubborn, and costly .
令人沮丧的消息导致大约一半的发明家在收到明确预示失败的成绩后退出。然而, ,其中47%的人在被告知他们的项目无望后仍继续进行开发工作,平均而言,这些坚持(或固执)的人在放弃前将其最初的损失翻倍。 值得注意的是,在那些在乐观主义的人格测量中得分较高的发明者中,坚持下去的情况相对普遍。总体而言,私人发明的回报很小,"低于私人股权和高风险证券的回报"。更为普遍的是,自营职业的经济效益平平:在相同的资格条件下,人们向雇主出售自己的技能比自己创业获得的平均回报更高。证据表明,乐观主义是 ,很 普遍,很顽固,代价很 高。
Psychologists have confirmed that most people genuinely believe that they are superior to most others on most desirable traits—they are willing to bet small amounts of money on these beliefs in the laboratory. In the market, of course, beliefs in one’s superiority have significant consequences. Leaders of large businesses sometimes make huge bets in expensive mergers and acquisitions, acting on the mistaken belief that they can manage the assets of another company better than its current owners do. The stock market commonly responds by downgrading the value of the acquiring firm, because experience has shown that efforts to integrate large firms fail more often than they succeed. The misguided acquisitions have been explained by a “hubris hypothesis” : the executives of the acquiring firm are simply less competent than they think they are.
心理学家 ,证实了大多数人真正相信自己在大多数理想的特征上优于其他人--他们愿意在实验室里为这些信念赌上 少量 的钱。 当然,在市场上,对自己优越性的信念有重大的影响。大企业的领导人有时会在昂贵的 合并和收购中下大赌注,因为他们错误地认为他们能比另一家公司的现有所有者更好地管理其资产。股票市场通常会以降低收购公司的价值作为回应,因为经验表明,整合大公司的努力往往失败多于成功。这些被误导的收购被 "傲慢假说 "所解释 :收购公司的高管只是比他们自己认为的能力差。
The economists Ulrike Malmendier and Geoffrey Tate identified optimistic CEOs by the amount of company stock that they owned personally and observed that highly optimistic leaders took excessive risks. They assumed debt rather than issue equity and were more likely than others to “overpay for target companies and undertake value-destroying mergers .” Remarkably, the stock of the acquiring company suffered substantially more in mergers if the CEO was overly optimistic by the authors’ measure. The stock market is apparently able to identify overconfident CEOs. This observation exonerates the CEOs from one accusation even as it convicts them of another: the leaders of enterprises who make unsound bets do not do so because they are betting with other people’s money. On the contrary, they take greater risks when they personally have more at stake. The damage caused by overconfident CEOs is compounded when the business press anoints them as celebrities; the evidence indicates that prestigious press awards to the CEO are costly to stockholders. The authors write, “We find that firms with award-winning CEOs subsequently underperform, in terms both of stock and of operating performance. At the same time, CEO compensation increases, CEOs spend more time on activities outside the company such as writing books and sitting on outside boards, and they are more likely to engage in earnings management .”
经济学家Ulrike Malmendier和Geoffrey Tate通过他们个人拥有的公司股票数量来确定乐观的首席执行官,并观察到高度乐观的领导人承担了过度的风险。 他们承担债务而不是发行股票,并且比其他人更有可能 "为目标公司支付过高的价格 ,并进行 破坏价值的兼并 。"值得注意的是,如果按照作者的衡量标准,CEO过于乐观,那么收购公司的股票在兼并中受到的影响要大得多。 股票市场显然能够识别出过于自信的CEO。这一观察结果免除了对首席执行官们的一项指控,但同时也给他们定下了另一项罪名:那些进行不健全赌注的企业领导人 ,因为他们是用别人的钱进行赌注。相反,当他们个人拥有更多利益时,他们会承担更大的风险。当商业媒体将首席执行官任命为名人时,过度自信的首席执行官所造成的损害就更大了;证据表明,著名的媒体对首席执行官的奖励对股票持有人来说是有代价的。作者写道:"我们发现,拥有获奖的 首席执行官的公司随后在股票和经营业绩方面都表现不佳。同时,首席执行官的报酬增加,首席执行官花更多的时间在公司以外的活动上,如写书和参加外部董事会,他们更有可能 从事收益管理 。"
Many years ago, my wife and I were on vacation on Vancouver Island, looking for a place to stay. We found an attractive but deserted motel on a little-traveled road in the middle of a forest. The owners were a charming young couple who needed little prompting to tell us their story. They had been schoolteachers in the province of Alberta; they had decided to change their life and used their life savings to buy this motel, which had been built a dozen years earlier. They told us without irony or self-consciousness that they had been able to buy it cheap, “because six or seven previous owners had failed to make a go of it.” They also told us about plans to seek a loan to make the establishment more attractive by building a restaurant next to it. They felt no need to explain why they expected to succeed where six or seven others had failed. A common thread of boldness and optimism links businesspeople, from motel owners to superstar CEOs.
许多年前,我和妻子在温哥华岛度假,寻找住宿的地方。 ,我们在森林中一条人迹罕至的道路上找到了一家有吸引力但却荒废的汽车旅馆。店主是一对迷人的年轻夫妇,他们不需要什么提示就能告诉我们他们的故事。 他们曾是阿尔伯塔省的学校教师;他们决定改变他们的 ,用他们一生的积蓄买下了这家十几年前建成的汽车旅馆。他们不无讽刺地告诉我们, ,也不自觉地告诉我们,他们能够便宜地买下它,"因为之前的六、七个主人都没能把它做起来"。他们还告诉我们,他们计划寻求贷款,通过在旁边建一个餐厅,使该机构更具吸引力。 他们觉得没有必要解释为什么他们期望在其他六七个人失败的地方取得成功。大胆和乐观的共同主线将 商人联系起来,从汽车旅馆老板到超级明星CEO。
The optimistic risk taking of entrepreneurs surely contributes to the economic dynamism of a capitalistic society, even if most risk takers end up disappointed. However, Marta Coelho of the London School of Economics has pointed out the difficult policy issues that arise when founders of small businesses ask the government to support them in decisions that are most likely to end badly. Should the government provide loans to would-be entrepreneurs who probably will bankrupt themselves in a few years? Many behavioral economists are comfortable with the “libertarian paternalistic” procedures that help people increase their savings rate beyond what they would do on their own. The question of whether and how government should support small business does not have an equally satisfying answer.
企业家乐观的风险承担肯定有助于资本主义社会的经济活力,即使大多数风险承担者最终都会失望。 然而,伦敦经济学院的Marta Coelho指出,当小企业的创始人要求政府支持他们做出 ,而这些决定很可能以失败告终时,就会出现困难的政策问题。政府是否应该向那些可能在几年内就会破产的未来企业家提供贷款?许多行为经济学家对 "自由主义家长式 "的程序感到满意,这些程序帮助人们提高储蓄率,超过他们自己会做的。政府是否应该以及如何支持小型 ,这个问题没有一个同样令人满意的答案。
COMPETITION NEGLECT
忽略了竞争
It is tempting to explain entrepreneurial optimism by wishful thinking, but emotion is only part of the story. Cognitive biases play an important role, notably the System 1 feature WYSIATI.
用一厢情愿来解释企业家的乐观情绪是很诱人的,但情绪只是故事的一部分。 认知偏差起着重要的作用,特别是系统1的特征WYSIATI。
We focus on our goal, anchor on our plan, and neglect relevant base rates, exposing ourselves to the planning fallacy.
我们专注于我们的目标,锚定在我们的计划上,而忽略了相关的基本费率,使自己暴露在计划的谬误之下。
We focus on what we want to do and can do, neglecting the plans and skills of others.
我们专注于 我们想做的和能做的,而忽视了其他人的计划和技能。
Both in explaining the past and in predicting the future, we focus on the causal role of skill and neglect the role of luck. We are therefore prone to an illusion of control .
在解释过去和预测未来时,我们都把重点放在技能的因果作用上,而忽略了运气的作用。因此,我们很容易产生一种 控制的错觉 。
We focus on what we know and neglect what we do not know, which makes us overly confident in our beliefs.
我们专注于我们知道的东西,而忽略了我们不知道的东西,这使我们对自己的信念过于自信。
The observation that “90% of drivers believe they are better than average” is a well-established psychological finding that has become part of the culture, and it often comes up as a prime example of a more general above average effect. However, the interpretation of the finding has changed in recent years, from self-aggrandizement to a cognitive bias . Consider these two questions:
观察到 "90%的司机认为 ,他们比平均水平要好",这是一个公认的心理学发现,已经成为文化的一部分,它经常作为一个更普遍的高于 平均水平效应的典型例子出现。然而,近年来对这一发现的解释已经发生了变化,从 自我吹嘘到认知偏见 。 考虑到这两个问题。
Are you a good driver?
你是个好司机吗?
Are you better than average as a driver?
作为 ,你是否比一般的司机好?
The first question is easy and the answer comes quickly: most drivers say yes. The second question is much harder and for most respondents almost impossible to answer seriously and correctly, because it requires an assessment of the average quality of drivers. At this point in the book it comes as no surprise that people respond to a difficult question by answering an easier one. They compare themselves to the average without ever thinking about the average. The evidence for the cognitive interpretation of the above-average effect is that when people are asked about a task they find difficult (for many of us this could be “Are you better than average in starting conversations with strangers?”), they readily rate themselves as below average. The upshot is that people tend to be overly optimistic about their relative standing on any activity in which they do moderately well.
第一个问题很简单,答案很快就出来了:大多数司机说是。第二个问题要难得多,对大多数受访者来说,几乎不可能认真、正确地回答,因为它需要对司机的平均素质进行评估。在本书的这一点上,人们通过回答一个较容易的问题来回应一个困难的问题,这并不奇怪。他们 ,将自己与平均水平进行比较,却从未考虑过平均水平。对高于平均水平效应的认知解释的证据是,当人们被问及他们认为困难的任务时(对我们中的许多人来说,这可能是 "你在与陌生人开始谈话时比平均水平好吗?"),他们很容易将自己评为低于平均水平。其结果是,人们倾向于 ,对自己在任何活动中的相对地位过度乐观,而他们在这些活动中做得适度好。
I have had several occasions to ask founders and participants in innovative start-ups a question: To what extent will the outcome of your effort depend on what you do in your firm? This is evidently an easy question; the answer comes quickly and in my small sample it has never been less than 80%. Even when they are not sure they will succeed, these bold people think their fate is almost entirely in their own hands. They are surely wrong: the outcome of a start-up depends as much on the achievements of its competitors and on changes in the market as on its own efforts. However, WY SIATI plays its part, and entrepreneurs naturally focus on what they know best—their plans and actions and the most immediate threats and opportunities, such as the availability of funding. They know less about their competitors and therefore find it natural to imagine a future in which the competition plays little part.
我曾多次向创新型初创企业的创始人和参与者提出一个问题:你的努力成果在多大程度上取决于你在公司的工作?这显然是一个简单的问题;答案很快就出来了,在我的小样本中,答案从未低于80%。 即使他们不确定自己会成功,这些大胆的人也认为他们的命运几乎完全掌握在自己手中。他们肯定是错的:一家新创公司的结果既取决于竞争对手的成就和市场的变化,也取决于自身的努力。 然而,WY SIATI发挥了它的作用,企业家们自然会关注他们最熟悉的东西--他们的计划和行动, 最直接的威胁和机会,如资金的可用性。他们对自己的竞争对手了解较少,因此发现很自然地想象一个竞争者几乎不发挥作用的未来。
Colin Camerer and Dan Lovallo, who coined the concept of competition neglect, illustrated it with a quote from the then chairman of Disney Studios. Asked why so many expensive big-budget movies are released on the same days (such as Memorial Day and Independence Day), he replied:
科林-卡梅勒和丹-洛瓦洛提出了 "忽视竞争 "的概念,他们用当时迪斯尼工作室主席的一句话来说明。当被问及为什么这么多昂贵的大预算 电影在同一天上映(如阵亡将士纪念日和独立日)时,他回答说。
Hubris. Hubris. If you only think about your own business, you think, “I’ve got a good story department, I’ve got a good marketing department, we’re going to go out and do this.” And you don’t think that everybody else is thinking the same way. In a given weekend in a year you’ll have five movies open, and there’s certainly not enough people to go around.
傲慢。傲慢。如果你只考虑你自己的业务,你会想,"我有一个好的故事部门,我有一个好的营销部门,我们 ,出去做这个。"而且你不认为其他人也有同样的想法。在一年中的某个周末,你会有五部电影开演, ,当然没有足够的人去看。
The candid answer refers to hubris, but it displays no arrogance, no conceit of superiority to competing studios. The competition is simply not part of the decision, in which a difficult question has again been replaced by an easier one. The question that needs an answer is this: Considering what others will do, how many people will see our film? The question the studio executives considered is simpler and refers to knowledge that is most easily available to them: Do we have a good film and a good organization to market it? The familiar System 1 processes of WY SIATI and substitution produce both competition neglect and the above-average effect. The consequence of competition neglect is excess entry: more competitors enter the market than the market can profitably sustain, so their average outcome is a loss . The outcome is disappointing for the typical entrant in the market, but the effect on the economy as a whole could well be positive. In fact, Giovanni Dosi and Dan Lovallo call entrepreneurial firms that fail but signal new markets to more qualified competitors “optimistic martyrs”—good for the economy but bad for their investors.
这个坦率的答案指的是傲慢,但它没有表现出傲慢,没有自负地认为自己比竞争对手的工作室优越。 竞争根本不是决定的一部分,在这个决定中,一个困难的问题再次被一个容易的问题所取代。需要回答的问题是:考虑到其他人会做什么,有多少人会看到我们的 影片?工作室主管们考虑的问题比较简单,指的是他们最容易获得的知识:我们是否有一部好的电影和一个好的组织来推销它?熟悉的系统1的WY SIATI和替代过程产生了竞争忽视和高于平均水平的效果。 竞争忽视的后果是过度进入:更多的竞争者进入市场 ,超过了市场可以承受的利润,所以他们的 平均结果是一个损失 。对于市场上的典型进入者来说,结果是令人失望的,但对整个经济的影响很可能是积极的。 事实上,乔瓦尼-多西和丹-洛瓦洛把那些失败但向更有资格的竞争对手发出新市场信号的创业公司称为 "乐观的殉道者"--对经济有好处,但对他们的 投资者不利。
OVERCONFIDENCE
过渡性
For a number of years, professors at Duke University conducted a survey in which the chief financial officers of large corporations estimated the returns of the Standard & Poor’s index over the following year. The Duke scholars collected 11,600 such forecasts and examined their accuracy. The conclusion was straightforward: financial officers of large corporations had no clue about the short-term future of the stock market; the correlation between their estimates and the true value was slightly less than zero! When they said the market would go down, it was slightly more likely than not that it would go up. These findings are not surprising. The truly bad news is that the CFOs did not appear to know that their forecasts were worthless.
多年来,杜克大学的教授们进行了一项调查,大公司的首席财务官对标准普尔指数下一年的回报进行了估计。 杜克大学的学者们收集了11,600份这样的预测,并检查了它们的准确性。 结论很直接:大公司的财务人员对股市的短期前景毫无头绪,他们的估计与真实价值之间的相关性略低于零!他们的估计是:"我们的股票是一个很好的例子。当他们说市场会下跌的时候,市场上涨的可能性比不上涨的可能性略大。这些发现并不令人惊讶。真正的坏消息是,首席财务官们似乎并不知道他们的预测毫无价值。
In addition to their best guess about S&P returns, the participants provided two other estimates: a value that they were 90% sure would be too high, and one that they were 90% sure would be too low. The range between the two values is called an “80% confidence interval” and outcomes that fall outside the interval are labeled “surprises.” An individual who sets confidence intervals on multiple occasions expects about 20% of the outcomes to be surprises. As frequently happens in such exercises, there were far too many surprises; their incidence was 67%, more than 3 times higher than expected. This shows that CFOs were grossly overconfident about their ability to forecast the market. Overconfidence is another manifestation of WYSIATI: when we estimate a quantity, we rely on information that comes to mind and construct a coherent story in which the estimate makes sense. Allowing for the information that does not come to mind—perhaps because one never knew it—is impossible.
除了他们对标准普尔收益的 最佳猜测外,参与者还提供了另外两个估计值:一个是他们90%确定会过高的值,另一个是他们90%确定会过低的值。这两个值之间的范围被称为 "80%的置信区间",超出区间的结果被标记为 "意外"。 一个在多个场合设置置信区间的人预计大约有 20%的结果是意外。正如在这种练习中经常发生的那样,意外太多了;其发生率为67%,比预期高3倍多。这表明,首席财务官对他们预测市场的能力严重过度自信。 过度自信 是所见即所得的另一种表现:当我们估计一个数量时,我们依靠想到的信息, ,构建一个连贯的故事,在其中估计有意义。允许那些没有出现在脑海中的信息--也许是因为人们从未了解过--是不可能的。
The authors calculated the confidence intervals that would have reduced the incidence of surprises to 20%. The results were striking. To maintain the rate of surprises at the desired level, the CFOs should have said, year after year, “There is an 80% chance that the S&P return next year will be between −10% and +30%.” The confidence interval that properly reflects the CFOs’ knowledge (more precisely, their ignorance) is more than 4 times wider than the intervals they actually stated.
作者计算了将意外的发生率降低到20%的置信区间。结果是惊人的。为了将意外率维持在理想的水平,首席财务官们应该年复一年地说, ,"明年标准普尔指数的回报率有80%的可能性在-10%和+30%之间"。正确反映首席财务官知识(更确切地说,他们的无知)的置信区间比他们实际陈述的区间宽4倍以上。
Social psychology comes into the picture here, because the answer that a truthful CFO would offer is plainly ridiculous. A CFO who informs his colleagues that “there is a good chance that the S&P returns will be between −10% and +30%” can expect to be laughed out of the room. The wide confidence interval is a confession of ignorance, which is not socially acceptable for someone who is paid to be knowledgeable in financial matters. Even if they knew how little they know, the executives would be penalized for admitting it. President Truman famously asked for a “one-armed economist” who would take a clear stand; he was sick and tired of economists who kept saying, “On the other hand …”
社会心理学在这里出现了,因为一个诚实的CFO会提供的答案显然是荒谬的。一位首席财务官告诉 他的同事,"标准普尔指数的回报率很有可能在-10%和+30%之间",他可能会被人笑掉大牙。宽大的置信区间是对无知的承认,这对一个拿着工资在金融事务方面有知识的人来说是不被社会接受的。 即使他们知道自己知道的有多少,高管们也会因为承认这一点而受到惩罚。 杜鲁门总统有一个著名的要求,那就是要有一个 "独臂的经济学家 "来明确立场;他已经厌倦了那些不断说 "从另一方面来说... "的经济学家。
Organizations that take the word of overconfident experts can expect costly consequences. The study of CFOs showed that those who were most confident and optimistic about the S&P index were also overconfident and optimistic about the prospects of their own firm, which went on to take more risk than others. As Nassim Taleb has argued, inadequate appreciation of the uncertainty of the environment inevitably leads economic agents to take risks they should avoid. However, optimism is highly valued, socially and in the market; people and firms reward the providers of dangerously misleading information more than they reward truth tellers. One of the lessons of the financial crisis that led to the Great Recession is that there are periods in which competition, among experts and among organizations, creates powerful forces that favor a collective blindness to risk and uncertainty.
听信过于自信的专家的话的组织可以预期代价高昂的后果。 对首席财务官的研究表明,那些对标准普尔指数最自信和最乐观的人也对自己公司的前景过于自信和乐观 ,从而比其他人承担更多的风险。正如纳西姆-塔勒布(Nassim Taleb)所言,对环境的不确定性认识不足,不可避免地导致经济主体承担他们应该避免的风险。然而,乐观主义在社会上和市场上都受到高度重视;人们和公司对危险的误导性信息的提供者的奖励超过对讲真话者的奖励。 导致大衰退的金融危机的教训之一是,在某些时期,专家之间和组织之间的竞争会产生强大的力量,有利于集体对风险和不确定性视而不见。
The social and economic pressures that favor overconfidence are not restricted to financial forecasting. Other professionals must deal with the fact that an expert worthy of the name is expected to display high confidence. Philip Tetlock observed that the most overconfident experts were the most likely to be invited to strut their stuff in news shows. Overconfidence also appears to be endemic in medicine. A study of patients who died in the ICU compared autopsy results with the diagnosis that physicians had provided while the patients were still alive. Physicians also reported their confidence. The result: “clinicians who were ‘completely certain’ of the diagnosis antemortem were wrong 40% of the time.” Here again, expert overconfidence is encouraged by their clients: “Generally, it is considered a weakness and a sign of vulnerability for clinicians to appear unsure. Confidence is valued over uncertainty and there is a prevailing censure against disclosing uncertainty to patients .” Experts who acknowledge the full extent of their ignorance may expect to be replaced by more confident competitors, who are better able to gain the trust of clients. An unbiased appreciation of uncertainty is a cornerstone of rationality—but it is not what people and organizations want. Extreme uncertainty is paralyzing under dangerous circumstances, and the admission that one is merely guessing is especially unacceptable when the stakes are high. Acting on pretended knowledge is often the preferred solution.
有利于过度自信的社会和经济压力并不 ,只限于金融预测。其他专业人员必须应对这样一个事实: ,一个名副其实的专家应该表现出高度的自信。菲利普-泰特洛克观察到,最自信的专家最有可能被邀请在新闻节目中大显身手。过度自信似乎也是医学界的普遍现象。一项对死于重症监护室的病人的研究将尸检结果与医生在病人还在世时提供的诊断进行了比较, 。医生们也报告了他们的信心。结果是:"'完全确定' 死前诊断 的临床医生有40%的时间是错误的"。在这里,专家的过度自信再次得到了客户的鼓励:"一般来说,临床医生表现出不自信被认为是一种弱点和脆弱的表现。 自信比不确定更受重视,而且有一种普遍的谴责 ,反对 向病人透露不确定 。"承认自己完全无知的专家可能会被更有信心的竞争者所取代,因为他们能更好地获得客户的信任。 对不确定性的无偏见理解是理性的基石,但这并不是人们和组织所希望的。在危险的情况下,极端的不确定性会使人瘫痪, ,当风险很大时,承认自己只是在猜测,尤其令人无法接受。根据假装的知识采取行动,往往是首选的解决方案。
When they come together, the emotional, cognitive, and social factors that support exaggerated optimism are a heady brew, which sometimes leads people to take risks that they would avoid if they knew the odds. There is no evidence that risk takers in the economic domain have an unusual appetite for gambles on high stakes; they are merely less aware of risks than more timid people are. Dan Lovallo and I coined the phrase “bold forecasts and timid decisions” to describe the background of risk taking .
当它们结合在一起时,支持夸张的乐观主义的情感、认知和社会因素是一种令人陶醉的混合物,这有时会导致人们去冒险,如果他们知道几率的话,他们会避免这种风险。没有证据表明 经济领域的风险承担者对高赌注的赌博有不寻常的胃口;他们只是比更胆小的人对风险的认识更少。 丹-洛瓦洛和我创造了 "大胆的预测和胆小的决定 "这个短语来描述 风险承担的背景 。
The effects of high optimism on decision making are, at best, a mixed blessing, but the contribution of optimism to good implementation is certainly positive. The main benefit of optimism is resilience in the face of setbacks. According to Martin Seligman, the founder of positive psychology, an “optimistic explanation style” contributes to resilience by defending one’s self-image. In essence, the optimistic style involves taking credit for successes but little blame for failures. This style can be taught, at least to some extent, and Seligman has documented the effects of training on various occupations that are characterized by a high rate of failures, such as cold-call sales of insurance (a common pursuit in pre-Internet days). When one has just had a door slammed in one’s face by an angry homemaker, the thought that “she was an awful woman” is clearly superior to “I am an inept salesperson.” I have always believed that scientific research is another domain where a form of optimism is essential to success: I have yet to meet a successful scientist who lacks the ability to exaggerate the importance of what he or she is doing, and I believe that someone who lacks a delusional sense of significance will wilt in the face of repeated experiences of multiple small failures and rare successes, the fate of most researchers.
高度乐观对决策的影响充其量是喜忧参半,但乐观对良好执行的贡献 ,肯定是积极的。乐观的主要好处是面对挫折时的复原力。根据积极心理学的创始人马丁-塞利格曼(Martin Seligman)的说法,"乐观的解释风格 "通过捍卫一个人的自我形象,有助于提高复原力。这种风格是可以被教授的,至少在某种程度上, ,塞利格曼记录了培训对各种以高失败率为特征的职业的影响,如保险的冷门销售(在互联网之前的时代是一种常见的追求)。当一个人刚刚 ,被一个愤怒的家庭主妇摔门而去时,"她是一个可怕的女人 "的想法显然比 "我是一个无能的销售人员 "要好。我一直认为 ,科学研究是另一个领域,在这个领域中,某种形式的乐观主义对成功至关重要:我还没有见过一个成功的科学家缺乏夸大他或她正在做的事情的重要性的能力,我相信,一个缺乏妄想的意义感的人,在面对多次小失败和罕见的成功的反复经历中会萎缩,这是大多数 研究人员的命运。
THE PREMORTEM: A PARTIAL REMEDY
预检:部分补救措施
Can overconfident optimism be overcome by training? I am not optimistic. There have been numerous attempts to train people to state confidence intervals that reflect the imprecision of their judgments, with only a few reports of modest success. An often cited example is that geologists at Royal Dutch Shell became less overconfident in their assessments of possible drilling sites after training with multiple past cases for which the outcome was known. In other situations, overconfidence was mitigated (but not eliminated) when judges were encouraged to consider competing hypotheses. However, overconfidence is a direct consequence of features of System 1 that can be tamed—but not vanquished. The main obstacle is that subjective confidence is determined by the coherence of the story one has constructed, not by the quality and amount of the information that supports it.
过于自信的乐观情绪可以通过训练来克服吗?我并不乐观。已经有很多人试图训练人们说出反映其判断不精确的置信区间,但只有少数报告取得了一定的成功。一个经常被引用的例子是, 荷兰皇家壳牌公司 的地质学家在接受了过去多个已知结果的案例训练后,对可能的钻探地点的评估 ,变得不那么自信了。在其他情况下,当鼓励法官考虑竞争性假设时,过度自信得到了缓解(但没有消除)。 然而,过度自信是系统1特征的直接后果,它可以被驯服,但不能被征服。主要障碍是,主观信心是由一个人构建的故事的一致性决定的 ,而不是由支持它的信息的质量和数量决定的。
Organizations may be better able to tame optimism and individuals than individuals are. The best idea for doing so was contributed by Gary Klein, my “adversarial collaborator” who generally defends intuitive decision making against claims of bias and is typically hostile to algorithms. He labels his proposal the premortem . The procedure is simple: when the organization has almost come to an important decision but has not formally committed itself, Klein proposes gathering for a brief session a group of individuals who are knowledgeable about the decision. The premise of the session is a short speech: “Imagine that we are a year into the future. We implemented the plan as it now exists. The outcome was a disaster. Please take 5 to 10 minutes to write a brief history of that disaster.”
组织可能比个人更能驯服乐观主义和个人。 这样做的最佳想法是由加里-克莱因贡献的,他是我的 "对抗性合作者",通常为直觉决策辩护,反对偏见的说法,并且通常对算法持敌对态度。 他给自己的建议贴上了 " 预验 "的标签。程序很简单:当组织几乎已经做出了一个重要的决定,但还没有正式承诺时,克莱恩建议召集一群对该决定有了解的人进行一次简短的会议。 会议的前提是一个简短的演讲:"想象一下,我们是在未来一年。 我们实施了现在存在的计划 。结果是一场灾难。请花5到10分钟写出那场灾难的简要历史。"
Gary Klein’s idea of the premortem usually evokes immediate enthusiasm. After I described it casually at a session in Davos, someone behind me muttered, “It was worth coming to Davos just for this!” (I later noticed that the speaker was the CEO of a major international corporation.) The premortem has two main advantages: it overcomes the groupthink that affects many teams once a decision appears to have been made, and it unleashes the imagination of knowledgeable individuals in a much-needed direction.
加里-克莱因的 "预审 "理念通常会立即唤起人们的热情。 在达沃斯的一次会议上,我随口描述了它,我身后有人嘀咕道:"就为了这个,值得来达沃斯!"(后来我注意到,演讲者是一家大型国际公司的首席执行官。) 预检有两个主要的优点:它克服了许多团队一旦出现决定就会影响 的群体思维,并且它释放了有知识的人的想象力,使其朝着一个急需的方向发展。
As a team converges on a decision—and especially when the leader tips her hand—public doubts about the wisdom of the planned move are gradually suppressed and eventually come to be treated as evidence of flawed loyalty to the team and its leaders. The suppression of doubt contributes to overconfidence in a group where only supporters of the decision have a voice. The main virtue of the premortem is that it legitimizes doubts. Furthermore, it encourages even supporters of the decision to search for possible threats that they had not considered earlier. The premortem is not a panacea and does not provide complete protection against nasty surprises, but it goes some way toward reducing the damage of plans that are subject to the biases of WY SIATI and uncritical optimism.
当一个团队在一个决定上趋于一致时--特别是当领导者提示她的时候--对计划行动的智慧的公开怀疑会逐渐被压制,最终 ,被当作对团队及其领导者的忠诚度有缺陷的证据。在一个只有决定的支持者才有发言权的群体中,对怀疑的压制助长了过度自信。 预审的主要优点是使怀疑合法化。 此外,它甚至鼓励决定的支持者去寻找他们之前没有考虑过的可能的威胁。预检 ,并不是万能的,也不能完全避免令人讨厌的意外,但它在一定程度上减少了受WY SIATI的偏见和不加批判的乐观主义影响的计划的损害。
SPEAKING OF OPTIMISM
说到乐观主义
“They have an illusion of control. They seriously underestimate the obstacles.”
"他们有一种控制的幻觉。他们严重低估了这些障碍"。
“They seem to suffer from an acute case of competitor neglect.”
"他们似乎患上了严重的竞争者忽视症"。
“This is a case of overconfidence. They seem to believe they know more than they actually do know.”
"这是一个 过度自信的案例。他们似乎认为自己知道的比实际知道的多"。
“We should conduct a premortem session. Someone may come up with a threat we have neglected.”
"我们应该进行一次预验收。 有人可能会提出我们所忽视的威胁。"
25
25
Bernoulli’s Errors
伯努利的错误
One day in the early 1970s, Amos handed me a mimeographed essay by a Swiss economist named Bruno Frey, which discussed the psychological assumptions of economic theory. I vividly remember the color of the cover: dark red. Bruno Frey barely recalls writing the piece, but I can still recite its first sentence: “The agent of economic theory is rational, selfish, and his tastes do not change.”
20世纪70年代初的一天,阿莫斯递给我一篇瑞士经济学家布鲁诺-弗雷的油印文章,其中讨论了经济理论的心理学假设。我清楚地记得封面的颜色: 暗红色。布鲁诺-弗雷几乎不记得写过这篇文章,但我仍能背出它的第一句话:"经济理论的代理人是理性的、自私的,他的品味不会改变。"
I was astonished. My economist colleagues worked in the building next door, but I had not appreciated the profound difference between our intellectual worlds. To a psychologist, it is self-evident that people are neither fully rational nor completely selfish, and that their tastes are anything but stable. Our two disciplines seemed to be studying different species, which the behavioral economist Richard Thaler later dubbed Econs and Humans.
我很惊讶,我的经济学家同事就在隔壁大楼工作,但我没有意识到我们的知识世界之间存在着深刻的差异。对心理学家来说,不言而喻的是,人们既不是 ,也不完全自私,而且他们的品味也不稳定。 我们的两个学科似乎是在研究不同的物种,行为经济学家理查德-塞勒(Richard Thaler)后来将其称为生态人和人类。
Unlike Econs, the Humans that psychologists know have a System 1. Their view of the world is limited by the information that is available at a given moment (WYSIATI), and therefore they cannot be as consistent and logical as Econs. They are sometimes generous and often willing to contribute to the group to which they are attached. And they often have little idea of what they will like next year or even tomorrow. Here was an opportunity for an interesting conversation across the boundaries of the disciplines. I did not anticipate that my career would be defined by that conversation.
与Econs不同,心理学家所认识的Humans有一个系统1。他们对世界的看法受限于某一特定时刻的信息(WYSIATI), ,因此他们不能像Econs那样具有一致性和逻辑性。他们有时很慷慨,往往愿意为他们所依附的团体做出贡献。而且他们往往对明年甚至明天会喜欢什么没什么概念。这里是一个跨越学科界限进行有趣对话的机会。我没有预料到,我的职业生涯会被那次 的谈话所定义。
Soon after he showed me Frey’s article, Amos suggested that we make the study of decision making our next project. I knew next to nothing about the topic, but Amos was an expert and a star of the field, and he said he would coach me. While still a graduate student he had coauthored a textbook, Mathematical Psychology , and he directed me to a few chapters that he thought would be a good introduction.
在他给我看了弗雷的文章后不久,阿莫斯建议我们把决策研究作为我们的下一个项目。我对 这个主题几乎一无所知,但阿莫斯是这个领域的专家和明星,他说他会指导我。当他还是一个研究生的时候,他就与人合著了一本教科书《 数学心理学 》,他指导我看了几个章节,他认为这将是 一个好的介绍。
I soon learned that our subject matter would be people’s attitudes to risky options and that we would seek to answer a specific question: What rules govern people’s choices between different simple gambles and between gambles and sure things?
我很快就知道,我们的主题是人们对风险选择的态度,我们将寻求回答一个具体的问题:在不同的简单赌博之间以及在赌博和确定的事情之间,什么规则支配着人们的选择?
Simple gambles (such as “40% chance to win $300”) are to students of decision making what the fruit fly is to geneticists. Choices between such gambles provide a simple model that shares important features with the more complex decisions that researchers actually aim to understand. Gambles represent the fact that the consequences of choices are never certain. Even ostensibly sure outcomes are uncertain: when you sign the contract to buy an apartment, you do not know the price at which you later may have to sell it, nor do you know that your neighbor’s son will soon take up the tuba. Every significant choice we make in life comes with some uncertainty—which is why students of decision making hope that some of the lessons learned in the model situation will be applicable to more interesting everyday problems. But of course the main reason that decision theorists study simple gambles is that this is what other decision theorists do.
简单的赌博(如 "40%的机会赢得300美元")对决策学生来说就像果蝇对遗传学家一样。这种赌博之间的选择 ,提供了一个简单的模型,与研究人员实际上旨在理解的更复杂的决策有重要的特征。赌博代表了这样一个事实:选择的后果从来都是不确定的。 即使是表面上确定的结果也是不确定的:当你签署购买公寓的合同时,你不知道以后可能要以什么价格出售,你也不 ,你邻居的儿子很快就会去学大号。我们在生活中做出的每一个重要选择都伴随着一些不确定性--这就是为什么研究决策的学生希望在模型情况下学到的一些经验能够适用于更有趣的日常问题。 但当然,决策理论家研究简单赌博的主要原因是,这就是其他决策 理论家所做的。
The field had a theory, expected utility theory, which was the foundation of the rational-agent model and is to this day the most important theory in the social sciences. Expected utility theory was not intended as a psychological model; it was a logic of choice, based on elementary rules (axioms) of rationality. Consider this example:
该领域有一个理论,即预期效用理论,它是理性代理模型的基础,至今仍是社会科学中最重要的理论。 预期效用理论不是作为一个心理学模型;它是一种选择逻辑,基于理性的基本规则(公理)。 考虑一下这个例子。
If you prefer an apple to a banana,
如果你喜欢苹果而不是香蕉。
then
然后
you also prefer a 10% chance to win an apple to a 10% chance to win a banana.
你也喜欢10%的机会赢得一个苹果,而不是10%的机会赢得一个香蕉。
The apple and the banana stand for any objects of choice (including gambles), and the 10% chance stands for any probability. The mathematician John von Neumann, one of the giant intellectual figures of the twentieth century, and the economist Oskar Morgenstern had derived their theory of rational choice between gambles from a few axioms. Economists adopted expected utility theory in a dual role: as a logic that prescribes how decisions should be made, and as a description of how Econs make choices. Amos and I were psychologists, however, and we set out to understand how Humans actually make risky choices, without assuming anything about their rationality.
苹果和香蕉代表任何选择对象(包括赌博),10%的机会代表任何概率。 数学家约翰-冯-诺伊曼(John von Neumann)是二十世纪巨大的知识分子之一,经济学家奥斯卡-莫根斯坦(Oskar Morgenstern)从几条公理中得出了他们的赌博理性选择理论 。经济学家以双重身份采用了预期效用理论:作为一种逻辑,规定了应该如何做出决策,以及作为对生态环境如何做出选择的描述。然而,阿莫斯和我是心理学家, ,我们着手了解人类实际上是如何做出风险选择的,而不对他们的理性做任何假设。
We maintained our routine of spending many hours each day in conversation, sometimes in our offices, sometimes at restaurants, often on long walks through the quiet streets of beautiful Jerusalem. As we had done when we studied judgment, we engaged in a careful examination of our own intuitive preferences. We spent our time inventing simple decision problems and asking ourselves how we would choose. For example:
我们保持了我们的惯例,在每个 ,有时在办公室,有时在餐馆,经常在美丽的耶路撒冷安静的街道上长时间散步,进行交谈。正如我们在研究判断力时所做的那样,我们对自己的直觉偏好进行了仔细检查。 我们花时间发明了简单的决策问题,并问自己会如何选择。比如说。
Which do you prefer?
你喜欢哪一个?
A. Toss a coin. If it comes up heads you win $100, and if it comes up tails you win nothing.
A.抛出一个 硬币。如果结果是正面,你赢了100美元,如果结果是反面,你什么都没赢。
B. Get $46 for sure.
B. 肯定能得到46美元。
We were not trying to figure out the most rational or advantageous choice; we wanted to find the intuitive choice, the one that appeared immediately tempting. We almost always selected the same option. In this example, both of us would have picked the sure thing, and you probably would do the same. When we confidently agreed on a choice, we believed—almost always correctly, as it turned out—that most people would share our preference, and we moved on as if we had solid evidence. We knew, of course, that we would need to verify our hunches later, but by playing the roles of both experimenters and subjects we were able to move quickly.
我们并不是要找出最理性或最有利的选择;我们想找到直观的选择,即看起来很诱人的选择。 我们几乎总是选择相同的选项。 在这个例子中,我们两个人都会选择肯定的东西,你可能也会这样做 。当我们自信地同意一个选择时,我们相信--几乎总是正确的,因为事实证明--大多数人都会同意我们的偏好,我们继续前进,好像我们有坚实的证据。 当然,我们知道,我们需要在以后验证我们的直觉,但通过扮演实验者和受试者的角色,我们能够快速前进。
Five years after we began our study of gambles, we finally completed an essay that we titled “Prospect Theory: An Analysis of Decision under Risk.” Our theory was closely modeled on utility theory but departed from it in fundamental ways. Most important, our model was purely descriptive, and its goal was to document and explain systematic violations of the axioms of rationality in choices between gambles. We submitted our essay to Econometrica , a journal that publishes significant theoretical articles in economics and in decision theory. The choice of venue turned out to be important; if we had published the identical paper in a psychological journal, it would likely have had little impact on economics. However, our decision was not guided by a wish to influence economics; Econometrica just happened to be where the best papers on decision making had been published in the past, and we were aspiring to be in that company. In this choice as in many others, we were lucky. Prospect theory turned out to be the most significant work we ever did, and our article is among the most often cited in the social sciences. Two years later, we published in Science an account of framing effects: the large changes of preferences that are sometimes caused by inconsequential variations in the wording of a choice problem.
在我们开始研究赌博的五年后, ,我们终于完成了一篇题为 "前景理论:风险下的决策分析 "的文章。我们的理论紧密地以效用理论为模型,但在根本上偏离了它。 最重要的是,我们的模型是纯粹的描述性的,其目标是记录和解释在赌博的选择中对理性公理的系统性违反。 我们将我们的论文提交给了《 计量经济学》 ,该杂志出版了经济学和决策理论方面的重要理论文章。地点的选择原来是很重要的;如果我们在心理学杂志上发表相同的论文,它很可能对经济学没有什么影响。然而,我们的决定并不是出于影响经济学的愿望; 《计量经济学》 恰好是过去发表过最好的决策论文的地方, ,我们渴望成为这样的公司。在这个选择中,正如在其他许多选择中一样,我们是幸运的。前景理论被证明是我们所做的最重要的工作,我们的文章是社会科学中最常被引用的文章之一。两年后,我们在 《科学》 杂志上发表了关于框架效应的说明: ,偏好的巨大变化有时是由选择问题的措辞的无关紧要的变化引起的, 。
During the first five years we spent looking at how people make decisions, we established a dozen facts about choices between risky options. Several of these facts were in flat contradiction to expected utility theory. Some had been observed before, a few were new. Then we constructed a theory that modified expected utility theory just enough to explain our collection of observations. That was prospect theory.
在我们研究人们如何做决定的头五年里,我们建立了十几个关于在风险选项之间做出选择的事实。 其中有几个事实与预期效用理论完全矛盾。一些人以前被观察过,一些人是新的。然后,我们构建了一个理论,对预期效用 理论进行了修改,刚好可以解释我们的观察结果。这就是前景理论。
Our approach to the problem was in the spirit of a field of psychology called psychophysics, which was founded and named by the German psychologist and mystic Gustav Fechner (1801–1887). Fechner was obsessed with the relation of mind and matter. On one side there is a physical quantity that can vary, such as the energy of a light, the frequency of a tone, or an amount of money. On the other side there is a subjective experience of brightness, pitch, or value. Mysteriously, variations of the physical quantity cause variations in the intensity or quality of the subjective experience. Fechner’s project was to find the psychophysical laws that relate the subjective quantity in the observer’s mind to the objective quantity in the material world. He proposed that for many dimensions, the function is logarithmic—which simply means that an increase of stimulus intensity by a given factor (say, times 1.5 or times 10) always yields the same increment on the psychological scale. If raising the energy of the sound from 10 to 100 units of physical energy increases psychological intensity by 4 units, then a further increase of stimulus intensity from 100 to 1,000 will also increase psychological intensity by 4 units.
我们处理这个问题的方法是本着心理学的一个领域,即精神物理学的精神,它是由德国心理学家和神秘主义者古斯塔夫-费希纳(1801-1887)创立和命名的。 费希纳对心与物的关系非常着迷。一边是可以变化的物理量,如 ,如光的能量,音的频率,或金钱的数量。神奇的是,物理量的变化导致主观体验的强度或质量的变化。 Fechner的项目是找到心理物理规律,将观察者头脑中的主观量与物质世界中的客观量联系起来 。他提出,对于许多维度来说,功能是对数的,这只是意味着刺激强度增加一个给定的因素(例如,乘以1.5或乘以10)总是在心理尺度上产生相同的增量。 如果将声音的能量从10个单位的物理能量提高到100个单位,心理强度增加4个单位,那么 ,刺激强度从100到1000的进一步增加,也将增加4个单位的心理强度。
BERNOULLI’S ERROR
伯努利的错误
As Fechner well knew, he was not the first to look for a function that relates psychological intensity to the physical magnitude of the stimulus. In 1738, the Swiss scientist Daniel Bernoulli anticipated Fechner’s reasoning and applied it to the relationship between the psychological value or desirability of money (now called utility ) and the actual amount of money. He argued that a gift of 10 ducats has the same utility to someone who already has 100 ducats as a gift of 20 ducats to someone whose current wealth is 200 ducats. Bernoulli was right, of course: we normally speak of changes of income in terms of percentages, as when we say “she got a 30% raise.” The idea is that a 30% raise may evoke a fairly similar psychological response for the rich and for the poor , which an increase of $100 will not do. As in Fechner’s law, the psychological response to a change of wealth is inversely proportional to the initial amount of wealth, leading to the conclusion that utility is a logarithmic function of wealth. If this function is accurate, the same psychological distance separates $100,000 from $1 million, and $10 million from $100 million .
正如费希纳所知道的,他并不是第一个寻找将心理强度与刺激的物理大小联系起来的函数。1738年,瑞士科学家丹尼尔-伯努利预见到了费希纳的推理,并将其应用于 ,即货币的心理价值或可取性(现在称为 效用 )与货币的实际数量之间的关系。他认为,10个金币的礼物对一个已经拥有100个金币的人来说,与20个金币的礼物对一个目前财富为200个金币的人来说,具有相同的效用。当然,伯努利是对的:我们通常用百分比来谈论收入的变化,如我们说 "她加薪30%"。 这个想法是,加薪30%可能会唤起 富人和穷人 相当相似的心理反应,而增加100美元则不会。正如费希纳定律, ,对财富变化的心理反应与最初的财富数量成反比,从而得出结论,效用是财富的对数函数。如果这个函数是准确的,同样的心理 距离将10万美元与100万美元分开, 1000万美元与1亿美元分开 。
Bernoulli drew on his psychological insight into the utility of wealth to propose a radically new approach to the evaluation of gambles, an important topic for the mathematicians of his day. Prior to Bernoulli, mathematicians had assumed that gambles are assessed by their expected value: a weighted average of the possible outcomes, where each outcome is weighted by its probability. For example, the expected value of:
伯努利利用他对财富效用的心理学洞察力,提出了一种评估赌博的全新方法,这对当时的数学家来说是一个重要的课题。 在伯努利之前,数学家假设赌博是由其期望值来评估的: 可能结果的加权平均数,其中每个结果都按其概率加权。 例如,期望值。
80% chance to win $100 and 20% chance to win $10
80%的机会赢得100美元,20%的机会赢得10美元
is $82 (0.8 × 100 + 0.2 × 10).
是82美元(0.8 × 100 + 0.2 × 10)。
Now ask yourself this question: Which would you prefer to receive as a gift, this gamble or $80 for sure? Almost everyone prefers the sure thing. If people valued uncertain prospects by their expected value, they would prefer the gamble, because $82 is more than $80. Bernoulli pointed out that people do not in fact evaluate gambles in this way.
现在问你自己这个问题:你更愿意收到哪个礼物,这个赌博和80美元的肯定?几乎每个人都更喜欢有把握的事情。如果人们以预期价值来评价不确定的前景, ,他们会更喜欢赌博,因为82美元比80美元多。伯努利指出,人们事实上并不以这种方式评价赌博。
Bernoulli observed that most people dislike risk (the chance of receiving the lowest possible outcome), and if they are offered a choice between a gamble and an amount equal to its expected value they will pick the sure thing. In fact a risk-averse decision maker will choose a sure thing that is less than expected value, in effect paying a premium to avoid the uncertainty. One hundred years before Fechner, Bernoulli invented psychophysics to explain this aversion to risk. His idea was straightforward: people’s choices are based not on dollar values but on the psychological values of outcomes, their utilities. The psychological value of a gamble is therefore not the weighted average of its possible dollar outcomes; it is the average of the utilities of these outcomes, each weighted by its probability.
伯努利观察到,大多数人不喜欢风险(获得最低可能结果的机会),如果让他们在赌博和等于其预期价值的金额之间做出选择,他们会选择肯定的东西。事实上,一个规避风险的决策 者会选择一个低于预期值的确定的东西,实际上是为避免不确定性而支付溢价。在费希纳之前100年,伯努利发明了心理物理学来解释这种对风险的厌恶。 他的想法很直接:人们的选择不是基于美元价值,而是基于结果的心理价值,即他们的效用。因此,一场赌博的心理价值 ,不是其可能的美元结果的加权平均值;它是这些结果的效用的平均值,每个结果都按其概率加权计算。
Table 3
表3
Table 3 shows a version of the utility function that Bernoulli calculated; it presents the utility of different levels of wealth, from 1 million to 10 million. You can see that adding 1 million to a wealth of 1 million yields an increment of 30 utility points, but adding 1 million to a wealth of 9 million increment of 20 utility points, but adding 1 million to a wealth of 9 million adds only 4 points. Bernoulli proposed that the diminishing marginal value of wealth (in the modern jargon) is what explains risk aversion—the common preference that people generally show for a sure thing over a favorable gamble of equal or slightly higher expected value. Consider this choice:
表3 显示了伯努利计算的效用函数的一个版本;它提出了从100万到1000万的不同级别财富的效用 。你可以看到,在100万的财富上增加100万,会产生30个效用点的 ,但在900万的财富上增加100万,会增加20个效用点,但在900万的财富上增加100万,只增加4点。伯努利提出,财富的边际价值递减(用现代的行话说)是解释风险的原因 ,即人们通常表现出的对确定的事情的偏好,而不是对预期价值相同或稍高的有利赌博的偏好。 考虑一下这个选择。
Equal chances to have 1 million or 7 million
拥有100万或700万的平等机会
Utility: (0 + 84)/2 = 42
实用性:(0+84)/2=42
OR
或
Have 4 million with certainty
有400万的把握
Utility: 60
实用性: 60
The expected value of the gamble and the “sure thing” are equal in ducats (4 million), but the psychological utilities of the two options are different, because of the diminishing utility of wealth: the increment of utility from 1 million to 4 million is 60 units, but an equal increment, from 4 to 7 million, increases the utility of wealth by only 24 units. The utility of the gamble is 84/2 = 42 (the utility of its two outcomes, each weighted by its probability of 1/2). The utility of 4 million is 60. Because 60 is more than 42, an individual with this utility function will prefer the sure thing. Bernoulli’s insight was that a decision maker with diminishing marginal utility for wealth will be risk averse.
赌博和 "稳赚不赔 "的预期价值相等(400万),但两个选项的心理 ,因为财富的效用递减:从100万到400万的效用递增是60个单位,但同样的递增,从400万到700万,财富的效用只增加了24个单位。 赌博的效用是84/2=42(其两个结果的效用,每个按其概率的1/2加权计算)。400万的效用是60。 因为60比42多,有这种效用函数的个人会更喜欢有把握的东西。伯努利的见解是,一个对财富的边际效用递减的决策者会厌恶风险。
Bernoulli’s essay is a marvel of concise brilliance. He applied his new concept of expected utility (which he called “moral expectation”) to compute how much a merchant in St. Petersburg would be willing to pay to insure a shipment of spice from Amsterdam if “he is well aware of the fact that at this time of year of one hundred ships which sail from Amsterdam to Petersburg, five are usually lost.” His utility function explained why poor people buy insurance and why richer people sell it to them. As you can see in the table, the loss of 1 million causes a loss of 4 points of utility (from 100 to 96) to someone who has 10 million and a much larger loss of 18 points (from 48 to 30) to someone who starts off with 3 million. The poorer man will happily pay a premium to transfer the risk to the richer one, which is what insurance is about. Bernoulli also offered a solution to the famous “St. Petersburg paradox,” in which people who are offered a gamble that has infinite expected value (in ducats) are willing to spend only a few ducats for it. Most impressive, his analysis of risk attitudes in terms of preferences for wealth has stood the test of time: it is still current in economic analysis almost 300 years later.
伯努利的文章是一个简明辉煌的奇迹。他运用他的预期效用的新概念(他称之为 "道德期望")来计算,如果 "他很清楚在每年的这个时候,从阿姆斯特丹开往彼得堡的一百艘船中,通常有五艘会丢失",那么圣彼得堡的商人 ,愿意为来自阿姆斯特丹的一批香料支付多少保险。他的效用函数解释了为什么穷人要买保险,为什么富人要把保险卖给他们。正如你在表中看到的,损失100万会使拥有1000万的人损失4个点的效用 (从100到96),而对开始拥有300万的人来说,损失更大,达18个点(从48到30)。穷人会很高兴地支付保险费,将风险转移给更富有的人,这就是保险的意义。 伯努利还为著名的 "圣彼得堡悖论 " 提供了一个解决方案,即人们在得到一个具有无限预期 价值(以杜卡为单位)的赌博时,只愿意为其花费几个杜卡。最令人印象深刻的是,他在财富偏好方面对风险态度的分析经受住了时间的考验:几乎在300年后的今天,它仍然是经济分析中的主流。
The longevity of the theory is all the more remarkable because it is seriously flawed. The errors of a theory are rarely found in what it asserts explicitly; they hide in what it ignores or tacitly assumes. For an example, take the following scenarios:
该理论的长寿更引人注目,因为它有严重的缺陷。一个理论的错误很少在它明确断言 ; ,它们隐藏在它忽略的或默示的假设中。举例来说,以下列情况为例。
Today Jack and Jill each have a wealth of 5 million.
今天,杰克和吉尔各自拥有500万的财富。
Yesterday, Jack had 1 million and Jill had 9 million.
昨天,杰克有100万,吉尔有900万。
Are they equally happy? (Do they have the same utility?)
他们是否同样快乐? (他们是否有同样的效用?)
Bernoulli’s theory assumes that the utility of their wealth is what makes people more or less happy. Jack and Jill have the same wealth, and the theory therefore asserts that they should be equally happy, but you do not need a degree in psychology to know that today Jack is elated and Jill despondent. Indeed, we know that Jack would be a great deal happier than Jill even if he had only 2 million today while she has 5. So Bernoulli’s theory must be wrong.
伯努利的理论假设,他们财富的效用是使人们更多或更少的快乐。杰克和吉尔拥有同样的财富, ,因此该理论断言他们应该同样幸福,但你不需要心理学学位就能知道,今天杰克兴高采烈,而吉尔垂头丧气。事实上,我们知道,即使杰克今天只有200万,而她有500万,他也会比吉尔快乐得多,所以伯努利的理论一定是错误的。
The happiness that Jack and Jill experience is determined by the recent change in their wealth, relative to the different states of wealth that define their reference points (1 million for Jack, 9 million for Jill). This reference dependence is ubiquitous in sensation and perception. The same sound will be experienced as very loud or quite faint, depending on whether it was preceded by a whisper or by a roar. To predict the subjective experience of loudness, it is not enough to know its absolute energy; you also need to know the reference sound to which it is automatically compared. Similarly, you need to know about the background before you can predict whether a gray patch on a page will appear dark or light. And you need to know the reference before you can predict the utility of an amount of wealth.
杰克和吉尔所体验到的幸福是由他们的财富最近的 ,相对于定义他们参考点的不同财富状态(杰克为100万,吉尔为900万)决定的。 这种参考依赖在感觉和知觉中无处不在。 同样的声音会被体验为非常响亮或相当微弱,取决于它之前是低语还是吼叫。要预测响度的主观体验,仅仅知道它的绝对能量是不够的, ;你还需要知道与之自动比较的参考声音。同样,你需要了解背景,然后才能预测页面上的一个灰色斑块会显得深还是浅。而在预测一笔财富的效用之前,你需要知道参照物。
For another example of what Bernoulli’s theory misses, consider Anthony and Betty:
关于伯努利理论所遗漏的另一个例子, 考虑安东尼和贝蒂。
Anthony’s current wealth is 1 million.
安东尼目前的财富为100万。
Betty’s current wealth is 4 million.
贝蒂目前的财富是400万。
They are both offered a choice between a gamble and a sure thing.
他们都被提供了一个在赌博和肯定之间的选择。
The gamble: equal chances to end up owning 1 million or 4 million
赌博:最终拥有100万或400万的机会相同
OR
或
The sure thing: own 2 million for sure
肯定的事:肯定拥有200万
In Bernoulli’s account, Anthony and Betty face the same choice: their expected wealth will be 2.5 million if they take the gamble and 2 million if they prefer the sure-thing option. Bernoulli would therefore expect Anthony and Betty to make the same choice, but this prediction is incorrect. Here again, the theory fails because it does not allow for the different reference points from which Anthony and Betty consider their options. If you imagine yourself in Anthony’s and Betty’s shoes, you will quickly see that current wealth matters a great deal. Here is how they may think:
在伯努利的论述中,安东尼和贝蒂面临着同样的选择:如果 ,他们的预期财富将是250万,如果 ,他们更愿意选择稳赚不赔的方案,他们的预期财富将是200万。因此,伯努利预计安东尼和贝蒂会做出同样的选择,但这种预测是错误的。在这里,该理论再次失败,因为它不允许安东尼和贝蒂从不同的 参考点 考虑他们的选择。 如果你想象自己处于安东尼和贝蒂的位置,你会很快 ,目前的财富是非常重要的。以下是他们可能的想法。
Anthony (who currently owns 1 million): “If I choose the sure thing, my wealth will double with certainty. This is very attractive. Alternatively, I can take a gamble with equal chances to quadruple my wealth or to gain nothing.”
安东尼(目前拥有100万):"如果我选择肯定的事情,我的财富将肯定地翻倍。或者,我可以用同等的机会来赌博,使我的财富翻两番,或者什么都不得到。"
Betty (who currently owns 4 million): “If I choose the sure thing, I lose half of my wealth with certainty, which is awful. Alternatively, I can take a gamble with equal chances to lose three-quarters of my wealth or to lose nothing.”
贝蒂(目前拥有400万):"如果我选择了确定的东西,我就会失去一半的财富 ,这很可怕。或者,我可以用同等的机会进行赌博,输掉我四分之三的财富或什么都不输。"
You can sense that Anthony and Betty are likely to make different choices because the sure-thing option of owning 2 million makes Anthony happy and makes Betty miserable. Note also how the sure outcome differs from the worst outcome of the gamble: for Anthony, it is the difference between doubling his wealth and gaining nothing; for Betty, it is the difference between losing half her wealth and losing three-quarters of it. Betty is much more likely to take her chances, as others do when faced with very bad options. As I have told their story, neither Anthony nor Betty thinks in terms of states of wealth: Anthony thinks of gains and Betty thinks of losses. The psychological outcomes they assess are entirely different, although the possible states of wealth they face are the same.
你可以感觉到,安东尼和贝蒂很可能做出不同的选择,因为拥有200万的肯定选择让安东尼高兴,让贝蒂痛苦。还要注意 肯定 的结果与赌博的 最坏 结果有什么不同:对 安东尼来说,这是财富翻倍和一无所获的区别;对贝蒂来说,这是失去一半财富和失去四分之三财富的区别。 贝蒂更倾向于抓住机会,就像其他人在面临非常糟糕的选择时那样。正如我所讲述的他们的故事,安东尼和贝蒂都没有从财富状态的角度进行思考:安东尼想到的是收益,而贝蒂想到的是损失 。他们评估的心理结果完全不同,尽管他们面临的可能的财富状态是相同的。
Because Bernoulli’s model lacks the idea of a reference point, expected utility theory does not represent the obvious fact that the outcome that is good for Anthony is bad for Betty. His model could explain Anthony’s risk aversion, but it cannot explain Betty’s risk-seeking preference for the gamble, a behavior that is often observed in entrepreneurs and in generals when all their options are bad.
由于伯努利的模型缺乏参考点的概念,预期效用理论并不代表这样一个明显的事实:对安东尼有利的结果对贝蒂不利。他的模型可以解释安东尼的风险规避,但不能解释贝蒂对赌博的 ,这种行为经常在企业家和将军身上观察到,当他们所有的选择都是坏的。
All this is rather obvious, isn’t it? One could easily imagine Bernoulli himself constructing similar examples and developing a more complex theory to accommodate them; for some reason, he did not. One could also imagine colleagues of his time disagreeing with him, or later scholars objecting as they read his essay; for some reason, they did not either.
所有这些都是相当明显的,不是吗?人们可以很容易地想象伯努利自己构建类似的例子,并发展出更复杂的理论来适应它们;由于某种原因,他没有这样做。我们也可以想象,他那个时代的同事不同意他的观点 ,或者后来的学者在阅读他的文章时提出反对意见;由于某些原因,他们也没有这样做。
The mystery is how a conception of the utility of outcomes that is vulnerable to such obvious counterexamples survived for so long. I can explain it only by a weakness of the scholarly mind that I have often observed in myself. I call it theory-induced blindness: once you have accepted a theory and used it as a tool in your thinking, it is extraordinarily difficult to notice its flaws. If you come upon an observation that does not seem to fit the model, you assume that there must be a perfectly good explanation that you are somehow missing. You give the theory the benefit of the doubt, trusting the community of experts who have accepted it. Many scholars have surely thought at one time or another of stories such as those of Anthony and Betty, or Jack and Jill, and casually noted that these stories did not jibe with utility theory. But they did not pursue the idea to the point of saying, “This theory is seriously wrong because it ignores the fact that utility depends on the history of one’s wealth , not only on present wealth.” As the psychologist Daniel Gilbert observed, disbelieving is hard work, and System 2 is easily tired.
神秘的是,一个容易受到如此明显的反例影响的结果效用概念是如何生存了这么久的。我只能通过我自己经常观察到的学术思想的弱点来解释 。我把它称为理论诱导的盲目性:一旦你接受了一个理论 ,并把它作为你思考的工具,就会异常难以注意到它的缺陷。如果你遇到一个似乎不符合模型的观察,你会认为一定有一个完美的解释,而你却不知不觉地错过了。 你给这个理论以怀疑的好处,相信那些已经接受它的专家群体。许多学者肯定曾在某个时间 ,比如安东尼和贝蒂的故事,或者杰克和吉尔的故事,并随口指出这些故事与效用理论不相符。但他们并没有追问到说:"这个理论是严重错误的,因为它忽略了一个事实,即效用取决于 一个人的财富的历史 ,而不仅仅是现在的财富。"正如心理学家丹尼尔-吉尔伯特(Daniel Gilbert)所观察到的,不相信 ,是一项艰苦的工作,而系统2很容易疲劳。
SPEAKING OF BERNOULLI’S ERRORS
说到伯努利的错误
“He was very happy with a $20,000 bonus three years ago, but his salary has gone up by 20% since, so he will need a higher bonus to get the same utility.”
"三年前他对2万美元的奖金非常满意,但此后他的工资上涨了20%,所以他需要更高的奖金来获得同样的效用。"
“Both candidates are willing to accept the salary we’re offering, but they won’t be equally satisfied because their reference points are different. She currently has a much higher salary.”
"两位候选人都愿意接受我们提供的薪水,但他们不会同样满意,因为他们的参考点不同。她目前有 ,工资高得多"。
“She’s suing him for alimony. She would actually like to settle, but he prefers to go to court. That’s not surprising—she can only gain, so she’s risk averse. He, on the other hand, faces options that are all bad, so he’d rather take the risk.”
"她正在起诉他,要求支付赡养费。她其实很想和解,但他更喜欢上法庭。这并不奇怪--她只能获得收益,所以她是厌恶风险的。而他,面对的选择都是坏的,所以他宁愿冒这个险。"
26
26
Prospect Theory
前景理论
Amos and I stumbled on the central flaw in Bernoulli’s theory by a lucky combination of skill and ignorance. At Amos’s suggestion, I read a chapter in his book that described experiments in which distinguished scholars had measured the utility of money by asking people to make choices about gambles in which the participant could win or lose a few pennies. The experimenters were measuring the utility of wealth, by modifying wealth within a range of less than a dollar. This raised questions. Is it plausible to assume that people evaluate the gambles by tiny differences in wealth? How could one hope to learn about the psychophysics of wealth by studying reactions to gains and losses of pennies? Recent developments in psychophysical theory suggested that if you want to study the subjective value of wealth , you should ask direct questions about wealth, not about changes of wealth. I did not know enough about utility theory to be blinded by respect for it, and I was puzzled.
阿莫斯和我通过技巧和无知的幸运结合,偶然发现了伯努利理论的核心缺陷。在阿莫斯的建议下,我读了他书中的一章,其中描述了一些实验,杰出的学者 ,通过要求人们对赌博做出选择,参与者可能赢或输几分钱来衡量金钱的效用。 实验者通过在不到一美元的范围内修改财富,来衡量财富的效用。这就提出了问题。 假设人们通过财富的微小差异来评估赌博,这是否合理?通过研究对一分钱的收益和损失的反应,人们怎么可能希望了解 有关财富的心理物理学?心理物理学理论的最新发展表明,如果你想研究 财富的主观价值 ,你应该直接问关于财富的问题,而不是关于财富的变化。 我对效用理论的了解不够,没有被对它的尊重所蒙蔽,我感到很困惑。
When Amos and I met the next day, I reported my difficulties as a vague thought, not as a discovery. I fully expected him to set me straight and to explain why the experiment that had puzzled me made sense after all, but he did nothing of the kind—the relevance of the modern psychophysics was immediately obvious to him. He remembered that the economist Harry Markowitz, who would later earn the Nobel Prize for his work on finance, had proposed a theory in which utilities were attached to changes of wealth rather than to states of wealth. Markowitz’s idea had been around for a quarter of a century and had not attracted much attention, but we quickly concluded that this was the way to go, and that the theory we were planning to develop would define outcomes as gains and losses, not as states of wealth. Knowledge of perception and ignorance about decision theory both contributed to a large step forward in our research.
当阿莫斯和我第二天见到 ,我把我的困难作为一个模糊的想法报告,而不是作为一个发现。我完全期待他能为我解惑,并解释为什么让我困惑的实验毕竟是有意义的,但他没有这样做--现代心理物理学的相关性对他来说立刻就很明显了。他记得经济学家哈里-马科维茨(Harry Markowitz)后来因其在 金融方面的工作而获得了诺贝尔奖,他提出了一个理论,其中效用与财富的变化而不是财富的状态相联系。 马科维茨的想法已经存在了四分之一个世纪,并没有引起人们的注意,但我们很快就得出结论,这就是我们要走的路,我们计划开发的理论将把结果定义为收益和损失,而不是财富的状态。知识 ,而对决策理论的无知都促使我们的研究向前迈出了一大步。
We soon knew that we had overcome a serious case of theory-induced blindness, because the idea we had rejected now seemed not only false but absurd. We were amused to realize that we were unable to assess our current wealth within tens of thousands of dollars. The idea of deriving attitudes to small changes from the utility of wealth now seemed indefensible. You know you have made a theoretical advance when you can no longer reconstruct why you failed for so long to see the obvious. Still, it took us years to explore the implications of thinking about outcomes as gains and losses.
我们很快就知道,我们已经克服了一个严重的理论致盲的情况,因为我们曾经拒绝的想法现在看来不仅是错误的,而且是荒谬的。 我们很高兴地意识到,我们无法评估我们目前的财富在几万美元以内。从财富的效用中推导出对微小变化的态度 ,这种想法现在看来是站不住脚的。当你不再能重建你为什么长期以来未能看到明显的东西时,你就知道你已经取得了理论上的进步。不过,我们还是花了很多年时间来探索将结果视为收益和损失的意义。
In utility theory, the utility of a gain is assessed by comparing the utilities of two states of wealth. For example, the utility of getting an extra $500 when your wealth is $1 million is the difference between the utility of $1,000,500 and the utility of $1 million. And if you own the larger amount, the disutility of losing $500 is again the difference between the utilities of the two states of wealth. In this theory, the utilities of gains and losses are allowed to differ only in their sign (+ or −). There is no way to represent the fact that the disutility of losing $500 could be greater than the utility of winning the same amount—though of course it is. As might be expected in a situation of theory-induced blindness, possible differences between gains and losses were neither expected nor studied. The distinction between gains and losses was assumed not to matter, so there was no point in examining it.
在效用理论中,收益的效用是通过比较两种财富状态的效用来评估的。 例如,当你的财富是100万美元时,得到额外的500美元的效用是100万美元的效用和100万美元的效用之间的差异。而如果你拥有更大的金额,失去500美元的效用又是两种财富状态的效用之间的差异。 在这个理论中,收益和损失的效用只允许在其符号 (+或-)上有所不同。没有办法表示这样一个事实,即输掉500美元的损失可能大于赢得同样数额的效用--尽管它当然是。 正如在理论诱导的盲目性的情况下可能预期的那样,收益和损失之间可能的差异既没有被预期,也没有被研究。 收益和损失之间的区别被认为不重要,所以没有必要 ,来研究它。
Amos and I did not see immediately that our focus on changes of wealth opened the way to an exploration of a new topic. We were mainly concerned with differences between gambles with high or low probability of winning. One day, Amos made the casual suggestion, “How about losses?” and we quickly found that our familiar risk aversion was replaced by risk seeking when we switched our focus. Consider these two problems:
阿莫斯和我并没有立即看到,我们对财富变化的关注为探索一个新的话题开辟了道路。我们主要关注的是获胜概率高或低的赌博之间的差异。有一天,阿莫斯随口提了个建议:"损失如何?"我们很快发现,当我们转换 ,我们熟悉的风险厌恶被寻求风险所取代。考虑一下这两个问题。
Problem 1: Which do you choose?
问题1:你选择哪个?
Get $900 for sure OR 90% chance to get $1,000
确定获得900美元或90%的机会获得1000美元
Problem 2: Which do you choose?
问题2:你选择哪个?
Lose $900 for sure OR 90% chance to lose $1,000
肯定输掉900美元或90%的机会输掉1000美元
You were probably risk averse in problem 1, as is the great majority of people. The subjective value of a gain of $900 is certainly more than 90% of the value of a gain of $1,000. The risk-averse choice in this problem would not have surprised Bernoulli.
你在问题1中可能是规避风险的,绝大多数人都是如此。 900美元收益的主观价值肯定超过1000美元收益的90%。在这个问题中,规避风险的 ,这不会让伯努利感到惊讶。
Now examine your preference in problem 2. If you are like most other people, you chose the gamble in this question. The explanation for this risk-seeking choice is the mirror image of the explanation of risk aversion in problem 1: the (negative) value of losing $900 is much more than 90% of the (negative) value of losing $1,000. The sure loss is very aversive, and this drives you to take the risk. Later, we will see that the evaluations of the probabilities (90% versus 100%) also contributes to both risk aversion in problem 1 and the preference for the gamble in problem 2.
现在考察一下你在问题2中的偏好。 如果你和其他大多数人一样,你选择了这个问题中的赌博。 对这种寻求风险的选择的解释是问题1中对风险厌恶的解释的镜像:失去900美元的(负)价值远远超过失去1000美元的(负)价值的90%。肯定的 损失是非常令人厌恶的,这促使你去承担风险。稍后,我们将看到对概率的评估(90%与100%)也有助于问题1中的风险厌恶和问题2中对赌博的偏爱。
We were not the first to notice that people become risk seeking when all their options are bad, but theory-induced blindness had prevailed. Because the dominant theory did not provide a plausible way to accommodate different attitudes to risk for gains and losses, the fact that the attitudes differed had to be ignored. In contrast, our decision to view outcomes as gains and losses led us to focus precisely on this discrepancy. The observation of contrasting attitudes to risk with favorable and unfavorable prospects soon yielded a significant advance: we found a way to demonstrate the central error in Bernoulli’s model of choice. Have a look:
我们并不是第一个注意到当人们的所有选择都是坏的时候,他们会变得寻求风险,但理论引起的盲目性已经占了上风。因为主流的 理论没有提供一个合理的方法来适应对收益和损失的不同风险态度,所以不得不忽略态度不同的事实。相比之下,我们将结果视为收益和损失的决定使我们恰恰关注了这种差异。对前景有利和不利的风险态度的对比观察很快产生了一个重大进展: ,我们找到了一种方法来证明伯努利选择模型中的核心错误。 看一看。
Problem 3: In addition to whatever you own, you have been given $1,000. You are now asked to choose one of these options:
问题3:除了你拥有的任何东西外,你还得到了1000美元。现在要求你在这些选项中选择一个。
50% chance to win $1,000 OR get $500 for sure
50%的机会赢得1,000美元或肯定获得500美元
Problem 4: In addition to whatever you own, you have been given $2,000. You are now asked to choose one of these options:
问题4:除了你拥有的任何东西外,你还得到了2000美元。现在要求你在这些选项中选择一个。
50% chance to lose $1,000 OR lose $500 for sure
50%的机会 损失1000美元或肯定损失500美元
You can easily confirm that in terms of final states of wealth—all that matters for Bernoulli’s theory—problems 3 and 4 are identical. In both cases you have a choice between the same two options: you can have the certainty of being richer than you currently are by $1,500, or accept a gamble in which you have equal chances to be richer by $1,000 or by $2,000. In Bernoulli’s theory, therefore, the two problems should elicit similar preferences. Check your intuitions, and you will probably guess what other people did.
你可以很容易地确认,在财富的最终状态方面--对于伯努利的理论来说,所有这些都是重要的--问题3和问题4是相同的。在这两种情况下,你可以在同样的两个选项中做出选择:你可以确定比你现在的财富多出1500美元,或者接受一个赌博,你有同等机会多出1000美元或2000美元的财富。 因此,在伯努利的理论中,这两个问题应该引起类似的偏好。 检查你的直觉,你可能会猜到其他人的做法。
In the first choice, a large majority of respondents preferred the sure thing.
在 第一个选择中,绝大多数的受访者都倾向于肯定的事情。
In the second choice, a large majority preferred the gamble.
在第二个选择中,绝大多数人倾向于赌博。
The finding of different preferences in problems 3 and 4 was a decisive counterexample to the key idea of Bernoulli’s theory. If the utility of wealth is all that matters, then transparently equivalent statements of the same problem should yield identical choices. The comparison of the problems highlights the all-important role of the reference point from which the options are evaluated. The reference point is higher than current wealth by $1,000 in problem 3, by $2,000 in problem 4. Being richer by $1,500 is therefore a gain of $500 in problem 3 and a loss in problem 4. Obviously, other examples of the same kind are easy to generate. The story of Anthony and Betty had a similar structure.
在问题3和4中发现不同的偏好是对伯努利理论的 关键思想的决定性反例。 如果财富的效用是最重要的,那么同一问题的透明等价陈述应该产生相同的选择。 这些问题的比较突出了评估选项的参考点的重要作用。 在问题3中,参考点比目前的财富高1,000美元,在问题4中高2,000美元。因此,在问题3中,富裕1,500美元是获得500美元,在问题4中是损失。显然,同类的其他例子也容易产生。安东尼和贝蒂的故事有一个类似的结构。
How much attention did you pay to the gift of $1,000 or $2,000 that you were “given” prior to making your choice? If you are like most people, you barely noticed it. Indeed, there was no reason for you to attend to it, because the gift is included in the reference point, and reference points are generally ignored. You know something about your preferences that utility theorists do not—that your attitudes to risk would not be different if your net worth were higher or lower by a few thousand dollars (unless you are abjectly poor). And you also know that your attitudes to gains and losses are not derived from your evaluation of your wealth. The reason you like the idea of gaining $100 and dislike the idea of losing $100 is not that these amounts change your wealth. You just like winning and dislike losing—and you almost certainly dislike losing more than you like winning.
在做出选择之前,你对 "给 "你的1000美元或2000美元的礼物有多少关注呢?如果你像大多数人一样,你几乎没有注意到它。事实上,没有理由 ,让你去关注它,因为礼物包含在参考点中,而参考点一般都被忽略了。你知道一些关于你的偏好的事情,而效用理论家却不知道--如果你的净资产高或低几千美元,你对风险的态度不会有什么不同(除非你是赤贫的)。而且你也知道,你对收益和损失的态度 ,不是来自你对财富的评价。你之所以喜欢获得100美元,不喜欢失去100美元,并不是因为这些金额改变了你的财富。 你只是喜欢赢,不喜欢输,而且你几乎肯定不喜欢输比喜欢赢更多。
The four problems highlight the weakness of Bernoulli’s model. His theory is too simple and lacks a moving part. The missing variable is the reference point , the earlier state relative to which gains and losses are evaluated. In Bernoulli’s theory you need to know only the state of wealth to determine its utility, but in prospect theory you also need to know the reference state. Prospect theory is therefore more complex than utility theory. In science complexity is considered a cost, which must be justified by a sufficiently rich set of new and (preferably) interesting predictions of facts that the existing theory cannot explain. This was the challenge we had to meet.
这四个问题突出了伯努利模型的弱点。他的理论过于简单,缺少一个移动的 部分。缺少的变量是 参考点 ,即相对于评估收益和损失的早期状态。在伯努利的理论中,你只需要知道财富的状态来确定其效用,但在前景理论中,你还需要知道参考状态。因此,前景理论比效用理论更复杂。在科学中,复杂性被认为是一种成本,必须通过足够丰富的新的和(最好是)有趣的预测来证明现有理论无法解释的事实, 。这是我们必须应对的挑战。
Although Amos and I were not working with the two-systems model of the mind, it’s clear now that there are three cognitive features at the heart of prospect theory. They play an essential role in the evaluation of financial outcomes and are common to many automatic processes of perception, judgment, and emotion. They should be seen as operating characteristics of System 1.
虽然阿莫斯和我当时并没有使用心灵的双系统模型,但现在很清楚,前景理论的核心是有三个认知特征。它们在评估财务 ,并在许多感知、判断和情感的自动过程中起着至关重要的作用。它们应该被看作是系统1的运行特征。
Evaluation is relative to a neutral reference point, which is sometimes referred to as an “adaptation level.” You can easily set up a compelling demonstration of this principle. Place three bowls of water in front of you. Put ice water into the left-hand bowl and warm water into the right-hand bowl. The water in the middle bowl should be at room temperature. Immerse your hands in the cold and warm water for about a minute, then dip both in the middle bowl. You will experience the same temperature as heat in one hand and cold in the other. For financial outcomes, the usual reference point is the status quo, but it can also be the outcome that you expect, or perhaps the outcome to which you feel entitled, for example, the raise or bonus that your colleagues receive. Outcomes that are better than the reference points are gains. Below the reference point they are losses.
评价是相对于一个中立的参考点而言的,这有时被称为 "适应水平"。你可以很容易地设置一个令人信服的这一原则的演示。 在你面前放三碗水。将冰水放入 左手边的碗中,将温水放入右手边的碗中。中间碗里的水应该是室温。将双手浸泡在冷水和温水中约一分钟,然后将双手浸泡在中间的碗中。你会体验到一手是热,一手是冷的相同温度。 对于财务结果,通常的参考点是现状,但也可以是你期望的结果 ,也可能是你觉得有权得到的结果,例如,你的同事获得的加薪或奖金。好于参考点的结果是收益。 低于参考点的则是损失。
A principle of diminishing sensitivity applies to both sensory dimensions and the evaluation of changes of wealth. Turning on a weak light has a large effect in a dark room. The same increment of light may be undetectable in a brightly illuminated room. Similarly, the subjective difference between $900 and $1,000 is much smaller than the difference between $100 and $200.
敏感性递减的原则适用于感官层面和对财富变化的评估。 打开一盏微弱的灯,对 一个黑暗的房间有很大的影响。同样的增量光在一个明亮的房间里可能是无法察觉的。同样地,900美元和1000美元之间的主观差异比100美元和200美元之间的差异小得多。
The third principle is loss aversion. When directly compared or weighted against each other, losses loom larger than gains. This asymmetry between the power of positive and negative expectations or experiences has an evolutionary history. Organisms that treat threats as more urgent than opportunities have a better chance to survive and reproduce.
第三个原则是损失厌恶。 当直接比较或相互加权时,损失比收益更大。这种积极和消极预期 或经验之间的不对称有一个进化的历史。把威胁当作比机会更紧迫的生物有更好的机会生存和繁殖。
The three principles that govern the value of outcomes are illustrated by figure 10 . If prospect theory had a flag, this image would be drawn on it. The graph shows the psychological value of gains and losses, which are the “carriers” of value in prospect theory (unlike Bernoulli’s model, in which states of wealth are the carriers of value). The graph has two distinct parts, to the right and to the left of a neutral reference point. A salient feature is that it is S-shaped, which represents diminishing sensitivity for both gains and losses. Finally, the two curves of the S are not symmetrical. The slope of the function changes abruptly at the reference point: the response to losses is stronger than the response to corresponding gains. This is loss aversion.
图10 说明了支配结果价值的 三个原则 。 如果前景理论有一面旗帜,那么这幅图就会画在上面。该图显示了收益和损失的心理价值,它们是前景理论中价值的 "载体 " (与伯努利的模型不同,财富状态是价值的载体)。该图有两个不同的部分,在一个中性参考点的右边和左边。一个突出的特点是,它是S形的,这代表了收益和损失的敏感性递减。 最后,S的两条曲线不是对称的。函数的斜率在参考点突然改变 :对损失的反应 ,比对相应收益的反应强。这就是损失厌恶。

Figure 10
图10
LOSS AVERSION
亏损率
Many of the options we face in life are “mixed”: there is a risk of loss and an opportunity for gain, and we must decide whether to accept the gamble or reject it. Investors who evaluate a start-up, lawyers who wonder whether to file a lawsuit, wartime generals who consider an offensive, and politicians who must decide whether to run for office all face the possibilities of victory or defeat. For an elementary example of a mixed prospect, examine your reaction to the next question.
我们在生活中面临的许多选择都是 "混合 "的:有损失的风险,也有收益的机会,我们必须决定是接受赌博还是拒绝赌博。 评估创业公司的投资者,不知道是否提起诉讼的律师,考虑进攻的战时将军,以及必须决定是否竞选的政治家,都面临着胜利或失败的可能性 。 对于混合前景的一个基本例子,检查你对下一个问题的反应。
Problem 5: You are offered a gamble on the toss of a coin.
问题5:有人向你提供抛硬币的赌注。
If the coin shows tails, you lose $100.
如果硬币显示为反面,你就损失100美元。
If the coin shows heads, you win $150.
如果硬币显示为正面,你将赢得150美元。
Is this gamble attractive? Would you accept it?
这场赌博有吸引力吗? 你会接受吗?
To make this choice, you must balance the psychological benefit of getting $150 against the psychological cost of losing $100. How do you feel about it? Although the expected value of the gamble is obviously positive, because you stand to gain more than you can lose, you probably dislike it—most people do. The rejection of this gamble is an act of System 2, but the critical inputs are emotional responses that are generated by System 1. For most people, the fear of losing $100 is more intense than the hope of gaining $150. We concluded from many such observations that “losses loom larger than gains” and that people are loss averse .
要做出这个选择,你必须平衡得到150美元的心理收益和失去100美元的 。你对 ,感觉如何?虽然这个赌博的预期价值显然是正的,但由于你的收益大于你的损失,你可能不喜欢它--大多数人都是这样。 拒绝这个赌博是系统2的行为,但关键的输入是由系统1产生的情绪反应。对大多数人来说,失去100美元的恐惧比获得150美元的希望更 。我们从许多这样的观察中得出结论:"损失比收益更大",人们是 厌恶损失 。
You can measure the extent of your aversion to losses by asking yourself a question: What is the smallest gain that I need to balance an equal chance to lose $100? For many people the answer is about $200, twice as much as the loss. The “loss aversion ratio” has been estimated in several experiments and is usually in the range of 1.5 to 2.5. This is an average, of course; some people are much more loss averse than others. Professional risk takers in the financial markets are more tolerant of losses, probably because they do not respond emotionally to every fluctuation. When participants in an experiment were instructed to “think like a trader,” they became less loss averse and their emotional reaction to losses (measured by a physiological index of emotional arousal) was sharply reduced.
你可以通过问自己一个问题来衡量你对损失的厌恶程度:我需要的最小收益是什么,以平衡损失100美元的同等机会?对许多人来说,答案是大约200美元,是损失的两倍。 "损失厌恶率 " 已经在一些实验中被估计出来,通常在1.5到2.5之间。当然,这只是一个平均值;有些人比其他人更厌恶损失。金融市场上的专业风险承担者对损失的容忍度更高,可能是因为他们不会对每一次波动做出情绪化的反应。当实验中的参与者被指示 "像交易员一样思考 "时, ,他们变得不那么厌恶损失,他们 对损失的情绪反应 (通过情绪唤醒的生理指数衡量)急剧减少。
In order to examine your loss aversion ratio for different stakes, consider the following questions. Ignore any social considerations, do not try to appear either bold or cautious, and focus only on the subjective impact of the possible loss and the offsetting gain.
为了考察你对不同赌注的损失厌恶率,请考虑以下问题。 忽略任何社会因素,不要试图表现得既大胆又谨慎,只关注可能损失的主观影响和 ,抵消收益。
Consider a 50–50 gamble in which you can lose $10. What is the smallest gain that makes the gamble attractive? If you say $10, then you are indifferent to risk. If you give a number less than $10, you seek risk. If your answer is above $10, you are loss averse.
考虑一个50-50的赌博,你可能会输掉10美元。使赌博具有吸引力的最小收益是什么?如果你说10美元,那么你对风险无动于衷。 如果你给出的数字低于10美元,你就会寻求风险。 如果你的答案高于10美元,你就会厌恶损失。
What about a possible loss of $500 on a coin toss? What possible gain do you require to offset it?
抛硬币可能损失500美元怎么办? 你需要什么可能的收益来抵消它?
What about a loss of $2,000?
损失 2,000美元呢?
As you carried out this exercise, you probably found that your loss aversion coefficient tends to increase when the stakes rise, but not dramatically. All bets are off, of course, if the possible loss is potentially ruinous, or if your lifestyle is threatened. The loss aversion coefficient is very large in such cases and may even be infinite—there are risks that you will not accept, regardless of how many millions you might stand to win if you are lucky.
在你进行这项练习时,你可能发现,当赌注增加时,你的损失厌恶系数往往会增加,但并不剧烈。当然,如果可能的损失是潜在的破坏性的,或者如果你的生活方式受到威胁,那么所有的赌注都将被取消。 在这种情况下,损失厌恶系数非常大,甚至可能是无限的--有一些风险是你不会接受的, ,如果你幸运的话,你可能会赢得多少百万。
Another look at figure 10 may help prevent a common confusion. In this chapter I have made two claims, which some readers may view as contradictory:
再看一下 图10 可能有助于防止常见的混淆。在 本章中,我提出了两个主张,一些读者可能认为它们是矛盾的。
In mixed gambles, where both a gain and a loss are possible, loss aversion causes extremely risk-averse choices.
在混合型赌博中,收益和损失都是可能的,损失厌恶导致极度规避风险的选择。
In bad choices, where a sure loss is compared to a larger loss that is merely probable, diminishing sensitivity causes risk seeking.
在不好的选择中,一个肯定的损失与一个更大的 ,只是可能的损失相比,敏感性的降低会导致风险的寻求。
There is no contradiction. In the mixed case, the possible loss looms twice as large as the possible gain, as you can see by comparing the slopes of the value function for losses and gains. In the bad case, the bending of the value curve (diminishing sensitivity) causes risk seeking. The pain of losing $900 is more than 90% of the pain of losing $1,000. These two insights are the essence of prospect theory.
在混合情况下,可能的损失是可能的收益的两倍,你可以通过比较损失和收益的价值函数的斜率看到这一点。 在坏情况下,价值曲线的弯曲(敏感性递减)导致风险的寻求。失去900美元的痛苦比 失去1000美元的痛苦的90%。这两个见解是前景理论的精髓。
Figure 10 shows an abrupt change in the slope of the value function where gains turn into losses, because there is considerable loss aversion even when the amount at risk is minuscule relative to your wealth. Is it plausible that attitudes to states of wealth could explain the extreme aversion to small risks? It is a striking example of theory-induced blindness that this obvious flaw in Bernoulli’s theory failed to attract scholarly notice for more than 250 years. In 2000, the behavioral economist Matthew Rabin finally proved mathematically that attempts to explain loss aversion by the utility of wealth are absurd and doomed to fail, and his proof attracted attention. Rabin’s theorem shows that anyone who rejects a favorable gamble with small stakes is mathematically committed to a foolish level of risk aversion for some larger gamble. For example, he notes that most Humans reject the following gamble:
图10 显示了价值函数斜率的突然变化,其中收益变成了损失,因为即使风险金额相对于你的财富来说微不足道,也有相当大的损失厌恶。 对财富状态的态度可以解释对小风险的极端厌恶,这是否合理? 250多年来,伯努利理论中的这一明显缺陷未能引起学术界的注意,这是理论诱导盲目性的一个突出例子。 2000年,行为经济学家马修-拉宾终于从数学上证明,试图用财富效用解释损失厌恶是荒谬的,注定要失败,他的证明引起了人们的注意。 拉宾定理 表明,任何 ,如果拒绝小赌注的有利赌博,在数学上就会对一些较大的赌博承诺愚蠢的风险厌恶程度。例如,他指出,大多数人类拒绝以下的赌博。
50% chance to lose $100 and 50% chance to win $200
50%的机会失去100美元,50%的机会赢得200美元
He then shows that according to utility theory, an individual who rejects that gamble will also turn down the following gamble:
然后他表明,根据效用理论,一个 拒绝该赌博 的人也会拒绝下面的赌博。
50% chance to lose $200 and 50% chance to win $20,000
50%的机会 ,失去200美元,50%的机会赢得20,000美元。
But of course no one in his or her right mind will reject this gamble! In an exuberant article they wrote about the proof, Matthew Rabin and Richard Thaler commented that the larger gamble “has an expected return of $9,900—with exactly zero chance of losing more than $200. Even a lousy lawyer could have you declared legally insane for turning down this gamble.”
但当然,没有一个头脑正常的人会拒绝这场赌博的。马修-拉宾和理查德- 塞勒在他们写的一篇关于证明的旺盛文章中评论说,更大的赌博 "有9900美元的预期回报--损失超过200美元的可能性完全为零。 即使是一个糟糕的律师 ,也会因为你拒绝这个 赌博而宣布你在法律上是疯了。"
Perhaps carried away by their enthusiasm, they concluded their article by recalling the famous Monty Python sketch in which a frustrated customer attempts to return a dead parrot to a pet store. The customer uses a long series of phrases to describe the state of the bird, culminating in “this is an ex-parrot.” Rabin and Thaler went on to say that “it is time for economists to recognize that expected utility is an ex-hypothesis.” Many economists saw this flippant statement as little short of blasphemy. However, the theory-induced blindness of accepting the utility of wealth as an explanation of attitudes to small losses is a legitimate target for humorous comment.
也许是被他们的热情冲昏了头脑,他们在文章的最后回顾了著名的Monty Python小品,其中一个沮丧的顾客试图把一只死鹦鹉送回宠物店。顾客用一长串短语来描述鸟的状态,最后说 "这是一只前鹦鹉"。拉宾和塞勒接着说,"现在是经济学家承认 ,预期效用是一种前假设的时候了"。许多经济学家认为这种轻描淡写的说法不啻为亵渎神明。然而,接受财富的效用作为对小损失的态度的解释,这种理论引起的盲目性是幽默评论的合法目标。
BLIND SPOTS OF PROSPECT THEORY
前景理论的盲点
So far in this part of the book I have extolled the virtues of prospect theory and criticized the rational model and expected utility theory. It is time for some balance.
到目前为止,在本书的这一部分,我已经颂扬了前景理论的优点, 批评了理性模型和预期效用理论。现在是取得一些平衡的时候了。
Most graduate students in economics have heard about prospect theory and loss aversion, but you are unlikely to find these terms in the index of an introductory text in economics. I am sometimes pained by this omission, but in fact it is quite reasonable, because of the central role of rationality in basic economic theory. The standard concepts and results that undergraduates are taught are most easily explained by assuming that Econs do not make foolish mistakes. This assumption is truly necessary, and it would be undermined by introducing the Humans of prospect theory, whose evaluations of outcomes are unreasonably short-sighted.
大多数经济学研究生都听说过前景理论和损失规避,但你不太可能在经济学入门教材的索引中找到这些术语。我有时为这一遗漏感到痛苦,但事实上这是很合理的,因为理性在基本 经济理论中的核心作用。假设Econs不犯愚蠢的错误,本科生所学的标准概念和结果最容易解释。这个假设确实是必要的,如果引入前景理论的悍将,它将被破坏,因为前景理论对结果的评价是不合理的短视。
There are good reasons for keeping prospect theory out of introductory texts. The basic concepts of economics are essential intellectual tools, which are not easy to grasp even with simplified and unrealistic assumptions about the nature of the economic agents who interact in markets. Raising questions about these assumptions even as they are introduced would be confusing, and perhaps demoralizing. It is reasonable to put priority on helping students acquire the basic tools of the discipline. Furthermore, the failure of rationality that is built into prospect theory is often irrelevant to the predictions of economic theory, which work out with great precision in some situations and provide good approximations in many others. In some contexts, however, the difference becomes significant: the Humans described by prospect theory are guided by the immediate emotional impact of gains and losses, not by long-term prospects of wealth and global utility.
将前景理论排除在入门级 ,有很好的理由。经济学的基本概念是重要的智力工具,即使对在市场中互动的经济主体的性质进行简化和不现实的假设,也不容易掌握。即使在介绍这些假设时提出问题,也会使人感到困惑,也许还会使人丧失信心。合理的做法是优先帮助学生获得 ,掌握该学科的基本工具。此外,前景理论中的理性失败往往与经济理论的预测无关,经济理论在某些情况下非常精确,而在其他许多情况下则提供了良好的近似值。 然而,在某些情况下,这种差异变得很重要:前景理论所描述的人类是以收益和损失的直接 ,而不是以财富和全球效用的长期前景为指导。
I emphasized theory-induced blindness in my discussion of flaws in Bernoulli’s model that remained unquestioned for more than two centuries. But of course theory-induced blindness is not restricted to expected utility theory. Prospect theory has flaws of its own, and theory-induced blindness to these flaws has contributed to its acceptance as the main alternative to utility theory.
我在讨论伯努利模型的缺陷时强调了理论上的盲目性,这些缺陷两个多世纪以来一直没有受到质疑。 但当然,理论上的盲目性并不限于预期效用理论。前景理论有其自身的缺陷,而理论引起的对这些 的缺陷的盲目性,促使其被接受为效用理论的主要替代方案。
Consider the assumption of prospect theory, that the reference point, usually the status quo, has a value of zero. This assumption seems reasonable, but it leads to some absurd consequences. Have a good look at the following prospects. What would it be like to own them?
考虑到前景理论的假设,即参考点,通常是现状,其价值为零。这个假设似乎是合理的,但它导致了一些荒谬的后果。 好好看看以下的前景。 拥有它们会是什么样子?
A. one chance in a million to win $1 million
A. 一百万分之一的机会赢得100万美元
B. 10% chance to win $12 and 90% chance to win nothing
B. 10%的机会赢得12美元,90%的机会一无所获
C. 90% chance to win $1 million and 10% chance to win nothing
C. 90%的机会赢得100万美元,10%的机会一无所获
Winning nothing is a possible outcome in all three gambles, and prospect theory assigns the same value to that outcome in the three cases. Winning nothing is the reference point and its value is zero. Do these statements correspond to your experience? Of course not. Winning nothing is a nonevent in the first two cases, and assigning it a value of zero makes good sense. In contrast, failing to win in the third scenario is intensely disappointing. Like a salary increase that has been promised informally, the high probability of winning the large sum sets up a tentative new reference point. Relative to your expectations, winning nothing will be experienced as a large loss. Prospect theory cannot cope with this fact, because it does not allow the value of an outcome (in this case, winning nothing) to change when it is highly unlikely, or when the alternative is very valuable. In simple words, prospect theory cannot deal with disappointment. Disappointment and the anticipation of disappointment are real, however, and the failure to acknowledge them is as obvious a flaw as the counterexamples that I invoked to criticize Bernoulli’s theory.
在这三种赌博中,一无所获是一种可能的结果,前景理论对这三种情况下的结果赋予了相同的价值。 无所获是参考点,其价值为零。这些说法是否与你的经验相符?当然不是。在前两种情况下,一无所获 ,是一种非事件,将其赋值为零是很有意义的。相比之下,在第三种情况下未能获胜是令人强烈失望的。就像已经非正式承诺的加薪一样,赢得大笔款项的高概率为我们建立了一个暂定的新参考点。相对于你的期望,一无所获将被视为一种巨大的损失。 前景理论无法应对这一事实,因为它不允许一个结果(在这种情况下,一无所获)的价值在它极不可能发生或替代方案非常有价值时发生变化。 简单地说,前景理论无法应对失望。 然而,失望和对失望的预期是真实存在的,不承认它们与我为批评伯努利的理论所援引的反例一样,是明显的 。
Prospect theory and utility theory also fail to allow for regret. The two theories share the assumption that available options in a choice are evaluated separately and independently, and that the option with the highest value is selected. This assumption is certainly wrong, as the following example shows.
前景理论和效用理论也没有考虑到后悔的问题。这两种理论的共同假设是:选择中的可用选项是单独和独立评估的,并且选择价值最高的选项。 这种假设当然是错误的,正如下面的例子所示。
Problem 6: Choose between 90% chance to win $1 million OR $50 with certainty.
问题6: 在90%的机会赢得100万美元或50美元的确定性中选择。
Problem 7: Choose between 90% chance to win $1 million OR $150,000 with certainty.
问题7:在90%的机会赢得100万美元或150,000美元的确定性中选择。
Compare the anticipated pain of choosing the gamble and not winning in the two cases. Failing to win is a disappointment in both, but the potential pain is compounded in problem 7 by knowing that if you choose the gamble and lose you will regret the “greedy” decision you made by spurning a sure gift of $150,000. In regret, the experience of an outcome depends on an option you could have adopted but did not.
比较一下在这两种情况下,选择赌博和 没有 赢的预期痛苦。 在这两种情况下,没有赢是一种失望,但在问题7中,潜在的痛苦更加复杂,因为你知道如果你选择赌博并输了,你会后悔 ,你做出了 "贪婪 "的决定,放弃了15万美元的肯定礼物。在遗憾中,一个结果的经验取决于你本可以采取但没有采取的选择。
Several economists and psychologists have proposed models of decision making that are based on the emotions of regret and disappointment. It is fair to say that these models have had less influence than prospect theory, and the reason is instructive. The emotions of regret and disappointment are real, and decision makers surely anticipate these emotions when making their choices. The problem is that regret theories make few striking predictions that would distinguish them from prospect theory, which has the advantage of being simpler. The complexity of prospect theory was more acceptable in the competition with expected utility theory because it did predict observations that expected utility theory could not explain.
一些 经济学家和心理学家 提出了基于后悔和失望情绪的决策模型。可以说,这些模型的影响力不如前景理论,其原因 ,具有指导意义。后悔和失望的情绪是真实存在的,决策者在做出选择时肯定会预见到这些情绪。问题是,后悔理论很少做出引人注目的预测,使其区别于前景理论,而前景理论的优势是比较简单。在与预期效用 理论的竞争中,前景理论的复杂性更容易被接受,因为它确实预测了预期效用理论无法解释的观察。
Richer and more realistic assumptions do not suffice to make a theory successful. Scientists use theories as a bag of working tools, and they will not take on the burden of a heavier bag unless the new tools are very useful. Prospect theory was accepted by many scholars not because it is “true” but because the concepts that it added to utility theory, notably the reference point and loss aversion, were worth the trouble; they yielded new predictions that turned out to be true. We were lucky.
更丰富和更现实的假设并不足以使一个理论成功。科学家把理论当作工作工具的袋子,除非新工具非常有用,否则他们不会承担更重的袋子的负担。前景理论被许多学者接受,并不是因为它是 "真实的",而是因为 ,它为效用理论增加的概念,特别是参考点和损失厌恶,是值得一试的;它们产生了新的预测,结果是真实的。 我们很幸运。
SPEAKING OF PROSPECT THEORY
说到前景理论
“He suffers from extreme loss aversion, which makes him turn down very favorable opportunities.”
"他患有极端的损失厌恶症,这使他拒绝了非常有利的机会。"
“Considering her vast wealth, her emotional response to trivial gains and losses makes no sense.”
"考虑到她的巨额财富,她对琐碎的收益和损失的情绪反应 ,毫无意义。"
“He weighs losses about twice as much as gains, which is normal.”
"他的体重损失大约是收益的两倍,这很正常。"
27
27
The Endowment Effect
捐赠效应
You have probably seen figure 11 or a close cousin of it even if you never had a class in economics. The graph displays an individual’s “indifference map” for two goods.
即使你从未上过经济学课,你可能也见过 图11 或它的近亲。该图显示了一个人对两种商品的 "冷漠图"。

Figure 11
图11
Students learn in introductory economics classes that each point on the map specifies a particular combination of income and vacation days. Each “indifference curve” connects the combinations of the two goods that are equally desirable—they have the same utility. The curves would turn into parallel straight lines if people were willing to “sell” vacation days for extra income at the same price regardless of how much income and how much vacation time they have. The convex shape indicates diminishing marginal utility: the more leisure you have, the less you care for an extra day of it, and each added day is worth less than the one before. Similarly, the more income you have, the less you care for an extra dollar, and the amount you are willing to give up for an extra day of leisure increases.
学生 ,在经济学入门课程中了解到,地图上的每一个点都指定了收入和假期的特定组合。每条 "冷漠曲线 "都连接着两种商品的组合,它们具有相同的效用。 如果人们愿意以相同的价格 "出售 "假期来换取额外的收入,无论他们有多少收入和多少假期,曲线都会变成平行的直线。凸形表示边际效用递减:你拥有的闲暇越多,你就越不关心多一天的闲暇,每增加一天都比之前的价值低。同样,你的收入越多,你就越不在乎多出的一美元,你愿意为多一天的休闲而放弃的金额也会增加 。
All locations on an indifference curve are equally attractive. This is literally what indifference means: you don’t care where you are on an indifference curve. So if A and B are on the same indifference curve for you, you are indifferent between them and will need no incentive to move from one to the other, or back. Some version of this figure has appeared in every economics textbook written in the last hundred years, and many millions of students have stared at it. Few have noticed what is missing. Here again, the power and elegance of a theoretical model have blinded students and scholars to a serious deficiency.
冷漠曲线上的所有位置都具有同样的吸引力。 这就是冷漠的字面意思:你并不关心你在冷漠曲线上的位置。因此,如果A和B对你来说是在同一条冷漠曲线上,你对它们是无动于衷的,不需要任何激励来从一个转向另一个,或者返回。在过去的一百年里,这个数字的某个版本出现在每一本经济学教科书 ,数百万学生盯着它看。 很少有人注意到缺少了什么。在这里,一个理论模型的力量和优雅再次使学生和学者看不到一个严重的缺陷。
What is missing from the figure is an indication of the individual’s current income and leisure. If you are a salaried employee, the terms of your employment specify a salary and a number of vacation days, which is a point on the map. This is your reference point, your status quo, but the figure does not show it. By failing to display it, the theorists who draw this figure invite you to believe that the reference point does not matter, but by now you know that of course it does. This is Bernoulli’s error all over again. The representation of indifference curves implicitly assumes that your utility at any given moment is determined entirely by your present situation, that the past is irrelevant, and that your evaluation of a possible job does not depend on the terms of your current job. These assumptions are completely unrealistic in this case and in many others.
图中缺少的 是对个人目前收入和闲暇的说明。如果你是一名受薪雇员,你的雇佣条件规定 ,工资和休假天数,这就是图中的一个点。这是你的参考点,你的现状,但图中并没有显示。通过不显示它,画这个图的理论家邀请你相信参考点并不重要,但现在你知道它当然重要。这又是伯努利的错误。冷漠 曲线的表述隐含着这样的假设:你在任何特定时刻的效用完全由你现在的情况决定,过去是不相关的,你对一份可能的工作的评价不取决于你目前工作的条件。 这些假设在这个案例和其他许多案例中都是完全不现实的。
The omission of the reference point from the indifference map is a surprising case of theory-induced blindness, because we so often encounter cases in which the reference point obviously matters. In labor negotiations, it is well understood by both sides that the reference point is the existing contract and that the negotiations will focus on mutual demands for concessions relative to that reference point. The role of loss aversion in bargaining is also well understood: making concessions hurts. You have much personal experience of the role of reference point. If you changed jobs or locations, or even considered such a change, you surely remember that the features of the new place were coded as pluses or minuses relative to where you were. You may also have noticed that disadvantages loomed larger than advantages in this evaluation—loss aversion was at work. It is difficult to accept changes for the worse. For example, the minimal wage that unemployed workers would accept for new employment averages 90% of their previous wage, and it drops by less than 10% over a period of one year .
漠视地图中参考点的遗漏是一个令人惊讶的案例 ,因为我们经常遇到参考点明显重要的情况。在劳资谈判中,双方都很清楚,参考点是现有的合同,谈判的重点是相对于该参考点的相互要求的让步。损失厌恶在讨价还价中的作用也很好理解:做出 让步会造成伤害。你对参考点的作用有很多 个人经验。如果你换了工作或地点,或者甚至考虑过这样的改变,你肯定记得新地方的特征被编码为相对于你所在的地方的优点或缺点。你可能也注意到,在这个评价中,缺点比优点更重要,损失厌恶在起作用。很难 ,接受更坏的变化。例如,失业工人为新工作接受的最低工资平均为他们以前工资的90%,而且在 一年 的时间内下降不到10%。
To appreciate the power that the reference point exerts on choices, consider Albert and Ben, “hedonic twins” who have identical tastes and currently hold identical starting jobs, with little income and little leisure time. Their current circumstances correspond to the point marked 1 in figure 11 . The firm offers them two improved positions, A and B, and lets them decide who will get a raise of $10,000 (position A) and who will get an extra day of paid vacation each month (position B). As they are both indifferent, they toss a coin. Albert gets the raise, Ben gets the extra leisure. Some time passes as the twins get accustomed to their positions. Now the company suggests they may switch jobs if they wish.
为了理解参考点对选择的影响,请考虑阿尔伯特和本,这对 "享乐型双胞胎 "有相同的品味,目前拥有相同的起始工作, ,收入很少,休闲时间也很少。他们目前的情况对应 于图11 中标有1的点。公司为他们提供了两个改进的职位,A和B,并让他们决定谁将获得10,000美元的加薪(职位A),谁将获得每月多一天的带薪假期(职位B)。由于他们都无所谓,他们抛出一枚硬币。 一段时间过去了,这对双胞胎适应了他们的岗位。 现在,公司建议他们如果愿意,可以换工作。
The standard theory represented in the figure assumes that preferences are stable over time. Positions A and B are equally attractive for both twins and they will need little or no incentive to switch. In sharp contrast, prospect theory asserts that both twins will definitely prefer to remain as they are. This preference for the status quo is a consequence of loss aversion.
图中所代表的标准理论假设,偏好随着时间的推移是稳定的。 职位A和B对双胞胎都有同样的吸引力,他们几乎不需要任何激励来转换。与此形成鲜明对比的是,前景理论断言,这对双胞胎 ,肯定会选择保持现状。这种对现状的偏好是损失厌恶的结果。
Let us focus on Albert. He was initially in position 1 on the graph, and from that reference point he found these two alternatives equally attractive:
让我们专注于阿尔伯特。他最初是在图上的位置1,从这个参考点出发,他发现这两个选择同样具有吸引力。
Go to A: a raise of $10,000
转到A:增加10,000美元
OR
或
Go to B: 12 extra days of vacation
转到B:多出12天的假期
Taking position A changes Albert’s reference point, and when he considers switching to B, his choice has a new structure:
采取A位置改变了阿尔伯特的参考点, ,当他考虑转到B位置时,他的选择有了新的结构。
Stay at A: no gain and no loss
留在A区:没有收获也没有损失
OR
或
Move to B: 12 extra days of vacation and a $10,000 salary cut
转到B:多出12天假期,减薪10,000美元
You just had the subjective experience of loss aversion. You could feel it: a salary cut of $10,000 is very bad news. Even if a gain of 12 vacation days was as impressive as a gain of $10,000, the same improvement of leisure is not sufficient to compensate for a loss of $10,000. Albert will stay at A because the disadvantage of moving outweighs the advantage. The same reasoning applies to Ben, who will also want to keep his present job because the loss of now-precious leisure outweighs the benefit of the extra income.
你刚刚有了损失厌恶的主观体验。 你能感觉到:减薪一万美元是非常糟糕的消息。即使12天假期的收益与1万美元的收益一样令人印象深刻,同样的休闲改善也 ,不足以弥补1万美元的损失。阿尔伯特会留在A市,因为搬家的弊大于利。 同样的推理也适用于本,他也想保留现在的工作,因为现在宝贵的休闲时间的损失超过了额外收入的好处。
This example highlights two aspects of choice that the standard model of indifference curves does not predict. First, tastes are not fixed; they vary with the reference point. Second, the disadvantages of a change loom larger than its advantages, inducing a bias that favors the status quo. Of course, loss aversion does not imply that you never prefer to change your situation; the benefits of an opportunity may exceed even overweighted losses. Loss aversion implies only that choices are strongly biased in favor of the reference situation (and generally biased to favor small rather than large changes).
这个例子强调了选择的两个方面,而冷漠曲线的标准模型并没有 。 首先,品味不是固定的;它们随着参考点的变化而变化。第二,变革的弊端比其优势更大,诱发了偏向于现状的偏见。当然,损失厌恶并不意味着你永远不愿意改变你的情况;机会的好处甚至可能超过超重的损失。损失厌恶只意味着选择强烈地 ,偏向于参考情况(而且一般偏向于小的而不是大的变化)。
Conventional indifference maps and Bernoulli’s representation of outcomes as states of wealth share a mistaken assumption: that your utility for a state of affairs depends only on that state and is not affected by your history. Correcting that mistake has been one of the achievements of behavioral economics.
传统的冷漠图和伯努利将结果表示为财富状态的做法有一个错误的假设:你对某一状态的效用只取决于该状态,而不受你的历史影响。纠正这一错误是 行为经济学的成就之一。
THE ENDOWMENT EFFECT
禀赋效应
The question of when an approach or a movement got its start is often difficult to answer, but the origin of what is now known as behavioral economics can be specified precisely. In the early 1970s, Richard Thaler, then a graduate student in the very conservative economics department of the University of Rochester, began having heretical thoughts. Thaler always had a sharp wit and an ironic bent, and as a student he amused himself by collecting observations of behavior that the model of rational economic behavior could not explain. He took special pleasure in evidence of economic irrationality among his professors, and he found one that was particularly striking.
一个方法或运动何时开始的问题往往很难回答,但现在被称为行为经济学的起源却可以准确说明。 20世纪70年代初,当时在罗切斯特大学非常保守的经济系读研究生的理查德-塞勒开始有异端思想。20世纪70年代初,当时在罗切斯特大学非常保守的经济系读研究生的理查德-塞勒开始有异端思想。塞勒 ,他总是有敏锐的智慧和讽刺的倾向。作为学生,他通过收集理性经济行为模型无法解释的行为观察来消遣自己。他对他的教授中的经济非理性的证据感到特别高兴,他发现了一个特别引人注目的证据。
Professor R (now revealed to be Richard Rosett, who went on to become the dean of the University of Chicago Graduate School of Business) was a firm believer in standard economic theory as well as a sophisticated wine lover. Thaler observed that Professor R was very reluctant to sell a bottle from his collection—even at the high price of $100 (in 1975 dollars!). Professor R bought wine at auctions, but would never pay more than $35 for a bottle of that quality. At prices between $35 and $100, he would neither buy nor sell. The large gap is inconsistent with economic theory, in which the professor is expected to have a single value for the bottle. If a particular bottle is worth $50 to him, then he should be willing to sell it for any amount in excess of $50. If he did not own the bottle , he should be willing to pay any amount up to $50 for it. The just-acceptable selling price and the just-acceptable buying price should have been identical, but in fact the minimum price to sell ($100) was much higher than the maximum buying price of $35. Owning the good appeared to increase its value.
R教授(现在发现是理查德-罗塞特,他后来成为芝加哥大学商学院 的院长)是标准经济理论的坚定信仰者,同时也是一个成熟的葡萄酒爱好者。泰勒观察到,R教授非常不愿意出售他收藏的瓶子--即使是以100美元的高价(以1975年的美元计算!)。 R教授在拍卖会上购买葡萄酒,但绝不会为该质量的葡萄酒支付超过35美元的价格。在 35美元到100美元之间的价格,他既不买也不卖。巨大的差距与经济理论不一致,在经济理论中,教授被期望对瓶子有一个单一的价值。如果某个瓶子对他来说值50美元,那么他应该愿意以超过50美元的任何金额出售。如果他 不拥有这瓶酒 ,他应该愿意为它支付任何金额,最多50美元。公正接受的出售 价格和公正接受的购买价格应该是相同的,但事实上最低出售价格(100美元)远远高于最高购买价格35美元。拥有该物品似乎增加了它的价值。
Richard Thaler found many examples of what he called the endowment effect , especially for goods that are not regularly traded. You can easily imagine yourself in a similar situation. Suppose you hold a ticket to a sold-out concert by a popular band, which you bought at the regular price of $200. You are an avid fan and would have been willing to pay up to $500 for the ticket. Now you have your ticket and you learn on the Internet that richer or more desperate fans are offering $3,000. Would you sell? If you resemble most of the audience at sold-out events you do not sell. Your lowest selling price is above $3,000 and your maximum buying price is $500. This is an example of an endowment effect, and a believer in standard economic theory would be puzzled by it . Thaler was looking for an account that could explain puzzles of this kind.
理查德-塞勒(Richard Thaler)发现了许多他所谓的 禀赋效应 的例子,特别是对于那些不定期交易的商品。你可以很容易地想象自己处于类似的情况。 假设你持有一张某流行乐队的音乐会的门票,你以200美元的正常价格购买了这张门票。你是一个狂热的粉丝,并且愿意为这张门票支付高达500美元。现在你有了票,你在互联网上了解到,更富有或更绝望的粉丝在提供3000美元。你会卖吗?如果你像大多数观众在售卖活动中的样子,你就不卖。 你的最低售价在3000美元以上,你的最高买价是500美元。这是一个禀赋效应的例子,一个标准经济理论的信徒 会对此感到困惑 。泰勒一直在寻找一个可以解释这类谜题的账户。
Chance intervened when Thaler met one of our former students at a conference and obtained an early draft of prospect theory. He reports that he read the manuscript with considerable excitement, because he quickly realized that the loss-averse value function of prospect theory could explain the endowment effect and some other puzzles in his collection. The solution was to abandon the standard idea that Professor R had a unique utility for the state of having a particular bottle. Prospect theory suggested that the willingness to buy or sell the bottle depends on the reference point—whether or not the professor owns the bottle now. If he owns it, he considers the pain of giving up the bottle. If he does not own it, he considers the pleasure of getting the bottle. The values were unequal because of loss aversion: giving up a bottle of nice wine is more painful than getting an equally good bottle is pleasurable. Remember the graph of losses and gains in the previous chapter. The slope of the function is steeper in the negative domain; the response to a loss is stronger than the response to a corresponding gain. This was the explanation of the endowment effect that Thaler had been searching for. And the first application of prospect theory to an economic puzzle now appears to have been a significant milestone in the development of behavioral economics.
机会来了,塞勒在一次会议上遇到了我们以前的一个学生,并获得了前景理论的早期草案。他报告说, ,他怀着相当激动的心情阅读了这份手稿,因为他很快意识到,前景理论的损失规避价值函数可以解释禀赋效应和他的收藏中的一些其他困惑。解决方案是放弃R教授对 拥有 特定瓶子的状态有独特效用的标准想法。前景理论认为, 买或卖瓶子的意愿取决于参考点--教授现在是否拥有这个瓶子。 如果他拥有这个瓶子,他就会考虑 放弃 瓶子的痛苦。如果他不拥有它,他考虑的是 得到 这瓶酒的乐趣。价值不平等是因为损失厌恶: 放弃一瓶好酒 比得到一瓶同样好的酒更痛苦,是愉悦的。记得 ,上一章的损失和收益图。函数的斜率在负值领域更陡峭;对损失的反应比对相应收益的反应更强烈。 这就是塞勒一直在寻找的对禀赋效应的解释。而前景理论在经济 ,现在看来是 行为经济学发展中的一个重要里程碑。
Thaler arranged to spend a year at Stanford when he knew that Amos and I would be there. During this productive period, we learned much from each other and became friends. Seven years later, he and I had another opportunity to spend a year together and to continue the conversation between psychology and economics. The Russell Sage Foundation, which was for a long time the main sponsor of behavioral economics, gave one of its first grants to Thaler for the purpose of spending a year with me in Vancouver. During that year, we worked closely with a local economist, Jack Knetsch, with whom we shared intense interest in the endowment effect, the rules of economic fairness, and spicy Chinese food.
当Thaler知道Amos和我将在斯坦福大学学习时,他安排了一年的时间。 在这个富有成效的时期,我们从对方那里学到了很多东西,并成为了朋友。七年后,他和我又有机会在一起呆了一年,继续心理学和经济学之间的对话。 罗素-塞奇基金会(Russell Sage Foundation)在很长一段时间内都是行为经济学的主要赞助者,它把第一批资助金之一给了塞勒,目的是让他和我在温哥华呆一年。在那一年里,我们与当地的一位经济学家Jack Knetsch密切合作,我们对禀赋效应、经济公平规则和辛辣的中国食物有着强烈的兴趣。
The starting point for our investigation was that the endowment effect is not universal. If someone asks you to change a $5 bill for five singles, you hand over the five ones without any sense of loss. Nor is there much loss aversion when you shop for shoes. The merchant who gives up the shoes in exchange for money certainly feels no loss. Indeed, the shoes that he hands over have always been, from his point of view, a cumbersome proxy for money that he was hoping to collect from some consumer. Furthermore, you probably do not experience paying the merchant as a loss, because you were effectively holding money as a proxy for the shoes you intended to buy. These cases of routine trading are not essentially different from the exchange of a $5 bill for five singles. There is no loss aversion on either side of routine commercial exchanges.
我们调查的出发点是, 禀赋效应并不普遍。如果有人让你把一张5美元的钞票换成5张单数的,你就把5张单数递过去,没有任何损失感。买鞋的时候也没有什么损失厌恶感。放弃鞋子以换取金钱的商人当然不会感到损失。事实上,从他的角度来看,他交出的鞋子一直是一个繁琐的代理,即他希望从某个消费者那里收取的钱 。此外,你可能并不觉得向商家付款是一种损失,因为你实际上是持有金钱作为你打算购买的鞋子的代理。这些常规交易的案例与用5美元的钞票换取5张单据没有本质上的区别。在常规的商业交流中,双方都不存在损失厌恶。
What distinguishes these market transactions from Professor R’s reluctance to sell his wine, or the reluctance of Super Bowl ticket holders to sell even at a very high price? The distinctive feature is that both the shoes the merchant sells you and the money you spend from your budget for shoes are held “for exchange.” They are intended to be traded for other goods. Other goods, such as wine and Super Bowl tickets, are held “for use,” to be consumed or otherwise enjoyed. Your leisure time and the standard of living that your income supports are also not intended for sale or exchange.
这些 市场交易 与R教授不愿意出售他的酒,或超级碗门票持有者不愿意出售,甚至以很高的价格出售,有什么区别?其显著特点是,商家卖给你的鞋和你从预算中拿出的买鞋的钱都是用来 "交换 "的。它们的目的是用来换取其他商品。其他商品,如葡萄酒和 "超级碗 "门票,是 "为使用而持有",用于消费或以其他方式享受。你的休闲时间和你的收入所支持的生活标准也不是为了出售或交换。
Knetsch, Thaler, and I set out to design an experiment that would highlight the contrast between goods that are held for use and for exchange. We borrowed one aspect of the design of our experiment from Vernon Smith, the founder of experimental economics, with whom I would share a Nobel Prize many years later. In this method, a limited number of tokens are distributed to the participants in a “market.” Any participants who own a token at the end of the experiment can redeem it for cash. The redemption values differ for different individuals, to represent the fact that the goods traded in markets are more valuable to some people than to others. The same token may be worth $10 to you and $20 to me, and an exchange at any price between these values will be advantageous to both of us.
Knetsch、Thaler和我开始设计一个实验,以突出为使用和交换而持有的商品之间的对比。我们从实验经济学的创始人 Vernon Smith那里借用了我们实验设计的一个方面,多年后我将与他分享诺贝尔奖。在这种方法中,有限数量的代币被分配给 "市场 "的参与者。任何在实验结束时拥有代币的参与者都可以将其兑换成现金。赎回价值 ,对不同的人来说是不同的,以代表市场上交易的商品 ,对一些人来说比对另一些人更有价值。同样的代币对你来说可能值10美元,对我来说可能值20美元,在这些价值之间以任何价格进行交换都会对我们双方有利。
Smith created vivid demonstrations of how well the basic mechanisms of supply and demand work. Individuals would make successive public offers to buy or sell a token, and others would respond publicly to the offer. Everyone watches these exchanges and sees the price at which the tokens change hands. The results are as regular as those of a demonstration in physics. As inevitably as water flows downhill, those who own a token that is of little value to them (because their redemption values are low) end up selling their token at a profit to someone who values it more. When trading ends, the tokens are in the hands of those who can get the most money for them from the experimenter. The magic of the markets has worked! Furthermore, economic theory correctly predicts both the final price at which the market will settle and the number of tokens that will change hands. If half the participants in the market were randomly assigned tokens, the theory predicts that half of the tokens will change hands .
史密斯创造了生动的示范,展示了基本的供求机制是如何运作的。 个人会连续公开提出购买或出售代币,其他人会公开回应 。每个人都在观察这些交换,看到代币易手的价格。其结果就像物理学中的演示一样有规律。就像水往低处流一样,那些拥有对他们来说没有什么价值的代币的人(因为他们的赎回价值很低)最终会把他们的代币卖给更重视它的人,这是很不可避免的。当交易结束时,代币被 在那些能从实验者那里得到最多钱的人手中。市场的魔力发挥了作用!此外,经济理论正确地预测了市场最终结算的价格和将转手的代币数量。如果市场上一半的参与者被随机分配到代币,理论预测 一半的代币会换手 。
We used a variation on Smith’s method for our experiment. Each session began with several rounds of trades for tokens, which perfectly replicated Smith’s finding. The estimated number of trades was typically very close or identical to the amount predicted by the standard theory. The tokens, of course, had value only because they could be exchanged for the experimenter’s cash; they had no value for use. Then we conducted a similar market for an object that we expected people to value for use: an attractive coffee mug, decorated with the university insignia of wherever we were conducting the experiments. The mug was then worth about $6 (and would be worth about double that amount today). Mugs were distributed randomly to half the participants. The Sellers had their mug in front of them, and the Buyers were invited to look at their neighbor’s mug; all indicated the price at which they would trade. The Buyers had to use their own money to acquire a mug. The results were dramatic: the average selling price was about double the average buying price, and the estimated number of trades was less than half of the number predicted by standard theory. The magic of the market did not work for a good that the owners expected to use.
我们在实验中使用了史密斯方法的一个变种。 每一节课都是以几轮交易换取代币开始的,这完全复制了史密斯的发现。估计的交易数量通常与标准理论预测的数量非常接近或相同。当然,代币有价值只是因为它们可以换取实验者的现金;它们没有使用价值 。然后,我们对一个我们期望人们重视使用的物品进行了类似的市场:一个有吸引力的咖啡杯,装饰着我们进行实验的地方的大学标志。杯子当时的价值约为6美元(今天的价值约为这个数字的两倍)。杯子被随机分配给一半的参与者。卖家将他们的杯子放在他们面前, ,买家被邀请看他们邻居的杯子;所有人都表示他们将以什么价格进行交易。买方不得不用自己的钱来购买一个杯子。结果是戏剧性的:平均卖价是平均买价的两倍,而估计的交易数量还不到标准理论预测的一半。市场的魔力没有发挥作用 ,以换取业主期望使用的货物。
We conducted a series of experiments using variants of the same procedure, always with the same results. My favorite is one in which we added to the Sellers and Buyers a third group—Choosers. Unlike the Buyers, who had to spend their own money to acquire the good, the Choosers could receive either a mug or a sum of money, and they indicated the amount of money that was as desirable as receiving the good. These were the results:
我们使用相同程序的变体进行了一系列的实验,结果总是一样的。 我最喜欢的是我们在卖家和买家中加入了第三组--选择者。与买方不同的是,买方必须花自己的钱来获得货物,而选择者可以 ,要么收到一个杯子,要么收到一笔钱,他们表示钱的数量 ,与收到货物一样令人满意。结果是这样的。
Sellers
卖家
$7.12
$7.12
Choosers
选择者
$3.12
$3.12
Buyers
购买者
$2.87
$2.87
The gap between Sellers and Choosers is remarkable, because they actually face the same choice! If you are a Seller you can go home with either a mug or money, and if you are a Chooser you have exactly the same two options. The long-term effects of the decision are identical for the two groups. The only difference is in the emotion of the moment. The high price that Sellers set reflects the reluctance to give up an object that they already own, a reluctance that can be seen in babies who hold on fiercely to a toy and show great agitation when it is taken away. Loss aversion is built into the automatic evaluations of System 1.
卖家和选择者之间的差距是显著的,因为他们实际上面临着同样的选择。如果你是卖家,你可以带着杯子或钱回家,如果你是选择者,你有完全相同的两个选择。决定的长期影响对两组人来说是相同的 。唯一的区别在于当时的情绪。 卖家设定的高价反映了他们不愿意放弃已经拥有的物品,这种不情愿可以在婴儿身上看到,他们猛地抱住一个玩具,当它被拿走的时候表现得非常焦急。损失厌恶是建立在系统1的自动评估中的。
Buyers and Choosers set similar cash values, although the Buyers have to pay for the mug, which is free for the Choosers. This is what we would expect if Buyers do not experience spending money on the mug as a loss. Evidence from brain imaging confirms the difference. Selling goods that one would normally use activates regions of the brain that are associated with disgust and pain. Buying also activates these areas, but only when the prices are perceived as too high—when you feel that a seller is taking money that exceeds the exchange value. Brain recordings also indicate that buying at especially low prices is a pleasurable event.
买方和选择方设定了类似的现金 ,不过买方必须支付杯子的费用,而选择方则是免费。如果买家不把花钱买杯子当作一种损失,这就是我们的预期。来自大脑成像的证据证实了这种差异。出售通常会使用的商品会激活大脑中与厌恶和痛苦有关的区域。购买也会激活这些区域,但只有当 价格被认为太高时--当你觉得卖家拿的钱超过了交换价值。 大脑记录 还表明,以特别低的价格购买是一件令人愉快的事情。
The cash value that the Sellers set on the mug is a bit more than twice as high as the value set by Choosers and Buyers. The ratio is very close to the loss aversion coefficient in risky choice, as we might expect if the same value function for gains and losses of money is applied to both riskless and risky decisions . A ratio of about 2:1 has appeared in studies of diverse economic domains, including the response of households to price changes. As economists would predict, customers tend to increase their purchases of eggs, orange juice, or fish when prices drop and to reduce their purchases when prices rise; however, in contrast to the predictions of economic theory, the effect of price increases (losses relative to the reference price) is about twice as large as the effect of gains.
卖家在杯子上设定的现金价值比选择者和买家设定的价值高了一倍多一点。该比率非常接近风险选择中的损失规避系数, ,因为如果对 无风险决策和风险决策 适用相同的货币收益和损失的价值函数,我们可能会预期。大约2:1的比率出现在不同经济领域的研究中,包括家庭对价格变化的反应。正如经济学家所预测的那样,当价格下降时,顾客倾向于增加鸡蛋、橙汁或鱼的购买量,而当价格上涨时,则减少购买 ;然而,与经济理论的预测相反, 价格上涨的效果 (相对于参考价格的损失)大约是收益效果的两倍。
The mugs experiment has remained the standard demonstration of the endowment effect, along with an even simpler experiment that Jack Knetsch reported at about the same time. Knetsch asked two classes to fill out a questionnaire and rewarded them with a gift that remained in front of them for the duration of the experiment. In one session, the prize was an expensive pen; in another, a bar of Swiss chocolate. At the end of the class, the experimenter showed the alternative gift and allowed everyone to trade his or her gift for another. Only about 10% of the participants opted to exchange their gift. Most of those who had received the pen stayed with the pen, and those who had received the chocolate did not budge either.
杯子实验一直是禀赋效应的标准证明,同时还有杰克-克内奇在同一时期报告的一个更简单的实验。克内奇要求两个班级 ,填写一份 ,并奖励他们一份礼物,在实验过程中一直放在他们面前。在一次实验中,奖品是一支昂贵的笔;在另一次实验中,是一块瑞士巧克力。在课堂结束时,实验者展示了替代的礼物,并允许每个人用他或她的礼物换取另一个。只有约10%的参与者选择交换他们的 礼物。大多数收到笔的人都留在了笔上,而那些收到巧克力的人也没有动摇。
THINKING LIKE A TRADER
像交易员一样思考
The fundamental ideas of prospect theory are that reference points exist, and that losses loom larger than corresponding gains. Observations in real markets collected over the years illustrate the power of these concepts . A study of the market for condo apartments in Boston during a downturn yielded particularly clear results. The authors of that study compared the behavior of owners of similar units who had bought their dwellings at different prices. For a rational agent, the buying price is irrelevant history—the current market value is all that matters. Not so for Humans in a down market for housing. Owners who have a high reference point and thus face higher losses set a higher price on their dwelling, spend a longer time trying to sell their home, and eventually receive more money.
前景理论的基本观点是,参考点是存在的,而且损失比相应的收益更大。多年来在真实市场上的观察, 说明了这些概念的力量 。在 波士顿的公寓 市场 ,在经济低迷时期,产生了特别明显的结果。该研究的作者比较了以不同价格购买其住宅的类似单位的业主的行为。 对于一个理性的代理人来说,购买价格是无关紧要的历史,当前的市场价值是所有重要的。在住房市场下滑的情况下,人类就不一样了。具有较高参考 ,从而面临较高损失的业主为他们的住宅设定了较高的价格,花费较长的时间试图出售他们的房屋,并最终获得更多的资金。
The original demonstration of an asymmetry between selling prices and buying prices (or, more convincingly, between selling and choosing) was very important in the initial acceptance of the ideas of reference point and loss aversion. However, it is well understood that reference points are labile, especially in unusual laboratory situations, and that the endowment effect can be eliminated by changing the reference point.
卖价和买价(或者更有说服力的是,卖价和选价)之间不对称的最初证明,对于最初接受参考点和损失厌恶的观点非常重要。然而, ,人们都明白,参考点是不稳定的,特别是在通常的实验室情况下,而且禀赋效应可以通过改变参考点来消除。
No endowment effect is expected when owners view their goods as carriers of value for future exchanges, a widespread attitude in routine commerce and in financial markets. The experimental economist John List, who has studied trading at baseball card conventions, found that novice traders were reluctant to part with the cards they owned, but that this reluctance eventually disappeared with trading experience. More surprisingly, List found a large effect of trading experience on the endowment effect for new goods.
当所有者把他们的商品看作是未来交换的价值载体时,就不会有禀赋效应,这是常规商业和金融市场中普遍存在的态度。实验经济学家John List, ,他研究了棒球卡大会上的交易,发现新手交易者不愿意放弃他们拥有的卡片,但这种不情愿最终随着交易经验的增加而消失。更令人惊讶的是,List发现 交易经验 对新商品的禀赋效应有很大 影响 。
At a convention, List displayed a notice that invited people to take part in a short survey, for which they would be compensated with a small gift: a coffee mug or a chocolate bar of equal value. The gifts were assigned at random. As the volunteers were about to leave, List said to each of them, “We gave you a mug [or chocolate bar], but you can trade for a chocolate bar [or mug] instead, if you wish.” In an exact replication of Jack Knetsch’s earlier experiment, List found that only 18% of the inexperienced traders were willing to exchange their gift for the other. In sharp contrast, experienced traders showed no trace of an endowment effect: 48% of them traded! At least in a market environment in which trading was the norm, they showed no reluctance to trade.
在一个会议上,List展示了一个通知,邀请人们参加一个简短的调查, ,为此他们将得到一个小礼物的补偿:一个咖啡杯或一个同等价值的巧克力棒。礼物是随机分配的。当志愿者们准备离开时,李斯特对他们每个人说, "我们给了你一个杯子[或巧克力棒],但如果你愿意,你可以用巧克力棒[或杯子]代替。"在对杰克-克内奇早期实验的精确复制中,利斯特发现,只有 18%的没有经验的交易者愿意用他们的礼物换取对方。与此形成鲜明对比的是,有经验的交易员没有显示出禀赋效应的痕迹:他们中有48%的人进行了交易!至少在一个以交易为常态的市场环境中,他们没有表现出不愿意交易。
Jack Knetsch also conducted experiments in which subtle manipulations made the endowment effect disappear. Participants displayed an endowment effect only if they had physical possession of the good for a while before the possibility of trading it was mentioned. Economists of the standard persuasion might be tempted to say that Knetsch had spent too much time with psychologists, because his experimental manipulation showed concern for the variables that social psychologists expect to be important. Indeed, the different methodological concerns of experimental economists and psychologists have been much in evidence in the ongoing debate about the endowment effect .
Jack Knetsch也 进行了一些实验,在这些实验中,微妙的操作使得禀赋效应消失。参与者 ,只有在提到交易的可能性之前,他们对物品的实际占有有一段时间,才会显示出禀赋效应。 标准观点的经济学家可能会说,克奈茨花了太多的时间在心理学家身上,因为他的实验操作显示出对社会心理学家所期望的重要变量的关注。事实上,实验经济学家和心理学家在方法论上的不同关注,在 目前关于禀赋效应的辩论 中已经有很多证据。
Veteran traders have apparently learned to ask the correct question, which is “How much do I want to have that mug, compared with other things I could have instead?” This is the question that Econs ask, and with this question there is no endowment effect, because the asymmetry between the pleasure of getting and the pain of giving up is irrelevant.
资深交易员显然已经学会了提出正确的问题,即 "与我可以拥有的其他东西相比,我想 拥有 这个杯子的程度如何?"这就是Econs提出的问题,有了这个问题,就没有禀赋 ,因为得到的快乐和放弃的痛苦之间的不对称是不相关的。
Recent studies of the psychology of “decision making under poverty” suggest that the poor are another group in which we do not expect to find the endowment effect. Being poor, in prospect theory, is living below one’s reference point. There are goods that the poor need and cannot afford, so they are always “in the losses.” Small amounts of money that they receive are therefore perceived as a reduced loss, not as a gain. The money helps one climb a little toward the reference point, but the poor always remain on the steep limb of the value function.
最近对 "贫困下的决策 "的心理学研究表明,穷人是另一个我们不期望发现禀赋效应的群体。 在前景理论中,贫困就是生活在一个人的参考点以下。有些商品是穷人需要而又买不起的,所以 ,他们总是在 "吃亏"。因此,他们收到的少量金钱被认为是减少的损失,而不是收益。 这些钱帮助人们向参考点爬升一点,但穷人总是停留在价值函数的陡峭边缘。
People who are poor think like traders, but the dynamics are quite different. Unlike traders, the poor are not indifferent to the differences between gaining and giving up. Their problem is that all their choices are between losses. Money that is spent on one good is the loss of another good that could have been purchased instead. For the poor, costs are losses.
穷人 的想法和商人一样,但动力却完全不同。与商人不同,穷人对获得和放弃之间的差异 ,并不是无动于衷。他们的问题是,他们所有的选择都是在损失之间进行的。 花在一种商品上的钱是对另一种本可以购买的商品的损失。对于穷人来说,成本就是损失。
We all know people for whom spending is painful, although they are objectively quite well-off. There may also be cultural differences in the attitude toward money, and especially toward the spending of money on whims and minor luxuries, such as the purchase of a decorated mug. Such a difference may explain the large discrepancy between the results of the “mugs study” in the United States and in the UK . Buying and selling prices diverge substantially in experiments conducted in samples of students of the United States, but the differences are much smaller among English students. Much remains to be learned about the endowment effect.
我们都知道有些人的消费是痛苦的,尽管他们客观上是相当富裕的。在对待金钱的态度上也可能存在文化差异,特别是 ,把钱花在心血来潮和小奢侈品上,比如购买一个装饰的杯子。这种差异 ,可能解释了 美国和英国 的 "杯子研究 "结果之间的巨大差异。在美国的学生样本中进行的实验中,购买和销售价格有很大的差异,但在英国 。关于禀赋效应,还有很多东西有待了解。
SPEAKING OF THE ENDOWMENT EFFECT
说到禀赋效应
“She didn’t care which of the two offices she would get, but a day after the announcement was made, she was no longer willing to trade. Endowment effect!”
"她并不关心她会得到这两个办公室中的哪一个,但在宣布后的一天,她不再愿意交易。禀赋效应!"
“These negotiations are going nowhere because both sides find it difficult to make concessions, even when they can get something in return. Losses loom larger than gains.”
"这些谈判毫无进展,因为双方都觉得很难做出让步,即使他们可以得到一些回报。 损失比收益大"。
“When they raised their prices, demand dried up.”
"当他们提高价格时,需求就干涸了。"
“He just hates the idea of selling his house for less money than he paid for it. Loss aversion is at work.”
"他只是讨厌以低于他所付的钱出售他的房子。 损失厌恶在起作用。"
“He is a miser, and treats any dollar he spends as a loss.”
"他是个守财奴,把花掉的任何一块钱都当成损失。"
28
28
Bad Events
不良事件
The concept of loss aversion is certainly the most significant contribution of psychology to behavioral economics. This is odd, because the idea that people evaluate many outcomes as gains and losses, and that losses loom larger than gains, surprises no one. Amos and I often joked that we were engaged in studying a subject about which our grandmothers knew a great deal. In fact, however, we know more than our grandmothers did and can now embed loss aversion in the context of a broader two-systems model of the mind, and specifically a biological and psychological view in which negativity and escape dominate positivity and approach. We can also trace the consequences of loss aversion in surprisingly diverse observations: only out-of-pocket losses are compensated when goods are lost in transport; attempts at large-scale reforms very often fail; and professional golfers putt more accurately for par than for a birdie. Clever as she was, my grandmother would have been surprised by the specific predictions from a general idea she considered obvious.
损失厌恶的概念无疑是心理学对行为经济学最重要的贡献。这很奇怪,因为人们将许多结果评价为收益和损失,而且损失 ,比收益大,这种想法没有人感到惊讶。阿莫斯和我经常开玩笑说,我们从事的是一个我们的祖母知道很多东西的研究。然而,事实上,我们比我们的祖母知道得更多,而且现在可以将损失厌恶嵌入更广泛的心灵双系统模型的背景中,特别是生物学和心理学观点,其中消极性和逃避支配着积极性 和方法。我们还可以在令人惊讶的各种观察中追踪损失厌恶的后果:当货物在运输过程中丢失时,只有自费损失才能得到补偿;大规模改革的尝试常常失败;职业高尔夫球手推杆比小鸟更准确。尽管我的祖母很聪明,但她会对来自 ,一个她认为很明显的一般想法的具体预测感到惊讶。
NEGATIVITY DOMINANCE
消极性支配

Figure 12
图 12
Your heartbeat accelerated when you looked at the left-hand figure. It accelerated even before you could label what is so eerie about that picture. After some time you may have recognized the eyes of a terrified person. The eyes on the right, narrowed by the raised cheeks of a smile, express happiness—and they are not nearly as exciting. The two pictures were presented to people lying in a brain scanner. Each picture was shown for less than 2 ⁄ 100 of a second and immediately masked by “visual noise,” a random display of dark and bright squares. None of the observers ever consciously knew that he had seen pictures of eyes, but one part of their brain evidently knew: the amygdala, which has a primary role as the “threat center” of the brain, although it is also activated in other emotional states. Images of the brain showed an intense response of the amygdala to a threatening picture that the viewer did not recognize. The information about the threat probably traveled via a superfast neural channel that feeds directly into a part of the brain that processes emotions, bypassing the visual cortex that supports the conscious experience of “seeing.” The same circuit also causes schematic angry faces (a potential threat) to be processed faster and more efficiently than schematic happy faces. Some experimenters have reported that an angry face “pops out” of a crowd of happy faces, but a single happy face does not stand out in an angry crowd. The brains of humans and other animals contain a mechanism that is designed to give priority to bad news. By shaving a few hundredths of a second from the time needed to detect a predator, this circuit improves the animal’s odds of living long enough to reproduce. The automatic operations of System 1 reflect this evolutionary history. No comparably rapid mechanism for recognizing good news has been detected. Of course, we and our animal cousins are quickly alerted to signs of opportunities to mate or to feed, and advertisers design billboards accordingly. Still, threats are privileged above opportunities, as they should be.
当你看着左边的图时,你的 心跳加速了 。它甚至在你能够标明那张照片的阴森之处之前就加速了。一段时间后,你可能已经认出了一个惊恐的人的眼睛。右边的眼睛,因微笑的脸颊上扬而缩小,表达了幸福--它们几乎没有那么激动人心。这两张图片被呈现给躺在 大脑扫描仪中的人。每张图片的显示时间不到 2/100 秒,并立即被 "视觉噪音 "所掩盖,这是一种随机显示的黑暗和明亮的方块。没有一个观察者有意识地知道他看到了眼睛的图片,但他们大脑的一个部分显然知道:杏仁核,它的主要作用是作为大脑的 "威胁中心",尽管它也在其他情绪 状态下被激活。大脑的图像显示,杏仁核对观看者不认识的威胁性图片有强烈的反应。有关威胁的信息可能通过一个超快的神经通道直接进入大脑处理情绪的部分, 绕过了 支持 "看 "的意识体验的 视觉皮层 。同样的回路也导致图示 愤怒的脸(一种潜在的威胁)比图示快乐的脸 更快 、更有效地被 处理 。一些实验者报告说,一张愤怒的脸从一群快乐的脸中 "跳出来" ,但一张快乐的脸在愤怒的人群中并不显眼。人类和其他动物的大脑含有一种机制,旨在优先考虑坏消息。通过从检测捕食者所需的时间中减去几百分之一 ,这个电路提高了动物活得足够长以进行繁殖的几率。系统1的自动操作反映了这一进化史。目前还没有检测到类似的快速识别好消息的机制。当然,我们和我们的动物表亲很快就会被提醒有机会交配或进食的迹象,广告商也会相应地设计广告牌 。尽管如此,威胁还是被置于机会之上,因为它们应该如此。
The brain responds quickly even to purely symbolic threats. Emotionally loaded words quickly attract attention, and bad words ( war , crime ) attract attention faster than do happy words ( peace , love ). There is no real threat, but the mere reminder of a bad event is treated in System 1 as threatening. As we saw earlier with the word vomit , the symbolic representation associatively evokes in attenuated form many of the reactions to the real thing, including physiological indices of emotion and even fractional tendencies to avoid or approach, recoil or lean forward. The sensitivity to threats extends to the processing of statements of opinions with which we strongly disagree. For example, depending on your attitude to euthanasia, it would take your brain less than one-quarter of a second to register the “threat” in a sentence that starts with “I think euthanasia is an acceptable/unacceptable …”
大脑甚至对纯粹的象征性威胁也会做出快速反应。带有情感色彩的词语会迅速吸引注意力,坏的词语 (战争 、 犯罪 )比快乐的词语 (和平 、 爱 )更能吸引注意力。虽然没有真正的威胁,但在系统1中,仅仅是对一个坏事件的提醒就被视为威胁。正如我们在前面看到的 呕吐 这个词,象征性的表述以减弱的形式唤起了对真实事物的许多反应,包括情绪的生理指数,甚至是回避或接近、退缩或前倾的零星倾向。对威胁的敏感性延伸到对我们强烈反对的意见陈述的处理。 例如,根据你对安乐死的态度 ,你的大脑只需花不到四分之一秒的时间就能在一个以 "我认为安乐死是一种 可接受/不可接受的 ...... "为开头的句子中识别出 "威胁"。
The psychologist Paul Rozin, an expert on disgust, observed that a single cockroach will completely wreck the appeal of a bowl of cherries, but a cherry will do nothing at all for a bowl of cockroaches. As he points out, the negative trumps the positive in many ways, and loss aversion is one of many manifestations of a broad negativity dominance . Other scholars, in a paper titled “Bad Is Stronger Than Good,” summarized the evidence as follows: “Bad emotions, bad parents, and bad feedback have more impact than good ones, and bad information is processed more thoroughly than good. The self is more motivated to avoid bad self-definitions than to pursue good ones. Bad impressions and bad stereotypes are quicker to form and more resistant to disconfirmation than good ones.” They cite John Gottman, the well-known expert in marital relations, who observed that the long-term success of a relationship depends far more on avoiding the negative than on seeking the positive. Gottman estimated that a stable relationship requires that good interactions outnumber bad interactions by at least 5 to 1. Other asymmetries in the social domain are even more striking. We all know that a friendship that may take years to develop can be ruined by a single action.
心理学家保罗-罗津(Paul Rozin)是一位研究厌恶的专家,他观察到一只蟑螂会完全破坏一碗樱桃的吸引力,但一颗樱桃对一碗蟑螂完全没有作用。正如他所指出的 ,负面的东西在很多方面都胜过正面的东西,而损失厌恶是广泛的 负面性支配 的众多表现之一。其他学者在一篇题为 "坏的比好的更强大 "的论文中,将证据总结如下。"坏的情绪、坏的父母和坏的反馈比好的影响更大,而且坏的信息比好的信息处理得更彻底。自我更有动力 ,避免坏的自我定义,而不是追求好的定义。坏的印象和坏的定型观念比好的印象和定型观念形成得更快,更容易被 否定 "。他们引用了著名的婚姻关系专家约翰-戈特曼(John Gottman)的观点,他观察到,一段关系的长期成功远远取决于避免负面的东西,而不是寻求正面的东西。戈特曼估计,一个稳定的 关系需要好的互动比坏的互动至少多出5比1。社会领域中的其他不对称现象甚至更加引人注目。我们都知道,一个可能需要多年发展的友谊可以被一个单一的行动所毁掉。
Some distinctions between good and bad are hardwired into our biology. Infants enter the world ready to respond to pain as bad and to sweet (up to a point) as good. In many situations, however, the boundary between good and bad is a reference point that changes over time and depends on the immediate circumstances. Imagine that you are out in the country on a cold night, inadequately dressed for the torrential rain, your clothes soaked. A stinging cold wind completes your misery. As you wander around, you find a large rock that provides some shelter from the fury of the elements. The biologist Michel Cabanac would call the experience of that moment intensely pleasurable because it functions, as pleasure normally does, to indicate the direction of a biologically significant improvement of circumstances. The pleasant relief will not last very long, of course, and you will soon be shivering behind the rock again, driven by your renewed suffering to seek better shelter.
好与坏之间的一些区别是硬性规定在我们的生物学中的。婴儿来到这个世界,准备好对疼痛作出坏的反应,对甜的反应(在 )则是好的。然而,在许多情况下,好与坏的界限是一个参考点,随着时间的推移而变化,并取决于眼前的情况。想象一下,在一个寒冷的夜晚,你在乡下,在暴雨中衣衫不整,衣服湿透。一阵刺骨的寒风让你更加痛苦。当你四处游荡时,你发现了一块大石头,它为你提供了一些 ,以抵御元素的狂暴。生物学家米歇尔-卡巴纳克(Michel Cabanac)将那一刻的经历称为强烈的愉悦,因为它的功能,就像愉悦通常所做的那样,表明了 生物学上的重大 环境 改善 的方向。当然,这种愉快的缓解不会持续很久,你很快就会再次在岩石后面发抖,在你重新开始的 痛苦的驱使下,寻找更好的住所。
GOALS ARE REFERENCE POINTS
目标是参考点
Loss aversion refers to the relative strength of two motives: we are driven more strongly to avoid losses than to achieve gains. A reference point is sometimes the status quo, but it can also be a goal in the future: not achieving a goal is a loss, exceeding the goal is a gain. As we might expect from negativity dominance, the two motives are not equally powerful . The aversion to the failure of not reaching the goal is much stronger than the desire to exceed it.
损失厌恶指的是两种动机的相对强度:我们避免损失的动机比实现收益的动机更强烈。一个参考点是 ,有时是现状,但也可以是未来的一个目标:没有实现目标是损失,超过目标是收益。正如我们从消极性支配中所预期的那样,这两种动机 ,其 力量并不相等 。对达不到目标的失败的厌恶要比超过目标的愿望强烈得多。
People often adopt short-term goals that they strive to achieve but not necessarily to exceed. They are likely to reduce their efforts when they have reached an immediate goal, with results that sometimes violate economic logic. New York cabdrivers, for example, may have a target income for the month or the year, but the goal that controls their effort is typically a daily target of earnings. Of course, the daily goal is much easier to achieve (and exceed) on some days than on others. On rainy days, a New York cab never remains free for long, and the driver quickly achieves his target; not so in pleasant weather, when cabs often waste time cruising the streets looking for fares. Economic logic implies that cabdrivers should work many hours on rainy days and treat themselves to some leisure on mild days, when they can “buy” leisure at a lower price. The logic of loss aversion suggests the opposite: drivers who have a fixed daily target will work many more hours when the pickings are slim and go home early when rain-drenched customers are begging to be taken somewhere.
人们经常采用短期目标,努力实现但不一定超过。当他们达到眼前的目标时,他们可能会减少努力,其结果有时会违反经济逻辑。例如,纽约的出租车司机,可能有 本月或全年的目标收入,但控制他们努力的目标通常是每天的收入目标。当然,每天的目标在某些日子比其他日子更容易实现(和超过)。在雨天,纽约的出租车从来不会长时间保持空闲,司机很快就能实现他的目标;而在天气好的时候就不一样了,出租车经常浪费时间在街上巡视, ,寻找车费。经济逻辑意味着,出租车司机应该在雨天工作很多时间,而在温和的日子里,他们可以用较低的价格 "购买 "休闲。 损失规避的逻辑表明,情况恰恰相反:司机如果有固定的每日目标,就会在收入微薄的时候多工作几小时,而在 被雨淋湿的顾客 乞求被带到某个地方的时候提前回家。
The economists Devin Pope and Maurice Schweitzer, at the University of Pennsylvania, reasoned that golf provides a perfect example of a reference point: par. Every hole on the golf course has a number of strokes associated with it; the par number provides the baseline for good—but not outstanding—performance. For a professional golfer, a birdie (one stroke under par) is a gain, and a bogey (one stroke over par) is a loss. The economists compared two situations a player might face when near the hole:
宾夕法尼亚大学的经济学家Devin Pope和Maurice Schweitzer认为,高尔夫提供了一个完美的参考点的例子:标准杆。高尔夫球场上的每一个球洞都有一个与之相关的杆数;标准杆数提供了良好--但不是卓越--的基准。对于一个职业高尔夫球手来说,一只小鸟(低于标准杆一杆)是一种收获, ,而一个柏忌(高于标准杆一杆)是一种损失。经济学家们比较了球员在接近球洞时可能面临的两种情况。
putt to avoid a bogey
推杆以避免柏忌
putt to achieve a birdie
推杆达到小鸟的目的
Every stroke counts in golf, and in professional golf every stroke counts a lot. According to prospect theory, however, some strokes count more than others. Failing to make par is a loss, but missing a birdie putt is a foregone gain, not a loss. Pope and Schweitzer reasoned from loss aversion that players would try a little harder when putting for par (to avoid a bogey) than when putting for a birdie. They analyzed more than 2.5 million putts in exquisite detail to test that prediction.
在高尔夫运动中,每一杆都很重要,而在职业高尔夫中,每一杆都很重要。然而,根据前景理论,有些杆数比其他杆数更重要。未能达到标准杆是一种损失,但错过小鸟推杆则是一种预知的 ,而不是一种损失。Pope和Schweitzer从损失厌恶的角度推断,球员在推入标准杆时(避免柏忌)会比推入小鸟球时更努力一些。他们分析了超过250万个推杆,以测试这一预测的细节。
They were right. Whether the putt was easy or hard, at every distance from the hole, the players were more successful when putting for par than for a birdie. The difference in their rate of success when going for par (to avoid a bogey) or for a birdie was 3.6%. This difference is not trivial. Tiger Woods was one of the “participants” in their study. If in his best years Tiger Woods had managed to putt as well for birdies as he did for par, his average tournament score would have improved by one stroke and his earnings by almost $1 million per season. These fierce competitors certainly do not make a conscious decision to slack off on birdie putts, but their intense aversion to a bogey apparently contributes to extra concentration on the task at hand.
他们是对的。无论推杆是容易还是困难,在距离球洞的每个距离 ,选手们在推杆时的成功率 ,比推小鸟球的成功率高。他们在争取保帕(避免柏忌)和争取小鸟时的成功率相差3.6%。这个差异不是小事。泰格-伍兹是他们研究中的 "参与者 "之一。如果泰格-伍兹在他最好的年华里,能够像抓小鸟一样成功推杆,那么他的平均比赛成绩就会提高一杆,他每赛季的收入几乎 ,100万美元。这些激烈的竞争者当然不会有意识地决定在小鸟推杆上偷懒,但他们对柏忌的强烈厌恶显然有助于更加专注于手头的工作。
The study of putts illustrates the power of a theoretical concept as an aid to thinking. Who would have thought it worthwhile to spend months analyzing putts for par and birdie? The idea of loss aversion, which surprises no one except perhaps some economists, generated a precise and nonintuitive hypothesis and led researchers to a finding that surprised everyone—including professional golfers.
对推杆的研究说明了一个理论概念作为辅助思考的力量。谁会想到值得花几个月的时间来分析推杆打平和 小鸟?损失厌恶的概念,除了一些经济学家之外,没有人感到惊讶,它产生了一个精确而非直观的假设,并使研究人员得出了一个让所有人都感到惊讶的发现--包括职业高尔夫球手。
DEFENDING THE STATUS QUO
捍卫现状
If you are set to look for it, the asymmetric intensity of the motives to avoid losses and to achieve gains shows up almost everywhere. It is an ever-present feature of negotiations, especially of renegotiations of an existing contract, the typical situation in labor negotiations and in international discussions of trade or arms limitations. The existing terms define reference points, and a proposed change in any aspect of the agreement is inevitably viewed as a concession that one side makes to the other. Loss aversion creates an asymmetry that makes agreements difficult to reach. The concessions you make to me are my gains, but they are your losses; they cause you much more pain than they give me pleasure. Inevitably, you will place a higher value on them than I do. The same is true, of course, of the very painful concessions you demand from me, which you do not appear to value sufficiently! Negotiations over a shrinking pie are especially difficult, because they require an allocation of losses. People tend to be much more easygoing when they bargain over an expanding pie.
如果你有心去寻找,避免损失和实现收益的动机的不对称强度几乎到处都能看到。这是谈判的一个 ,特别是现有合同的重新谈判,这是劳工谈判和国际贸易或武器限制讨论中的典型情况。现有的条款定义了参考点,对协议的任何方面提出的改变都不可避免地被视为一方对另一方的让步。损失厌恶产生了一种不对称性 ,使协议难以达成。你对我的让步是我的收获,但却是你的损失;它们给你带来的痛苦比给我带来的快乐多得多。不可避免地,你会比我更看重它们。当然,你要求我作出的非常痛苦的让步也是如此,你似乎并不充分重视这些让步!这就是我们的谈判。关于缩小的蛋糕的谈判 ,特别困难,因为它们需要分配损失。当人们为一个不断扩大的蛋糕讨价还价时,他们往往会更随和。
Many of the messages that negotiators exchange in the course of bargaining are attempts to communicate a reference point and provide an anchor to the other side. The messages are not always sincere. Negotiators often pretend intense attachment to some good (perhaps missiles of a particular type in bargaining over arms reductions), although they actually view that good as a bargaining chip and intend ultimately to give it away in an exchange. Because negotiators are influenced by a norm of reciprocity, a concession that is presented as painful calls for an equally painful (and perhaps equally inauthentic) concession from the other side.
谈判者在讨价还价过程中交流的许多信息都是试图 传达一个参考点 ,为对方提供一个锚。这些信息并不总是真诚的。谈判者经常假装对某些 ,也许是在军备削减谈判中的 特定类型的导弹,尽管他们实际上将该货物视为讨价还价的筹码,并打算最终在交换中放弃它。由于谈判者受到互惠准则的影响,一个表现为痛苦的让步会要求另一方做出同样痛苦的(也许同样不真实的)让步。
Animals, including people, fight harder to prevent losses than to achieve gains. In the world of territorial animals, this principle explains the success of defenders. A biologist observed that “when a territory holder is challenged by a rival, the owner almost always wins the contest —usually within a matter of seconds.” In human affairs, the same simple rule explains much of what happens when institutions attempt to reform themselves, in “reorganizations” and “restructuring” of companies, and in efforts to rationalize a bureaucracy, simplify the tax code, or reduce medical costs. As initially conceived, plans for reform almost always produce many winners and some losers while achieving an overall improvement. If the affected parties have any political influence, however, potential losers will be more active and determined than potential winners; the outcome will be biased in their favor and inevitably more expensive and less effective than initially planned. Reforms commonly include grandfather clauses that protect current stakeholders—for example, when the existing workforce is reduced by attrition rather than by dismissals, or when cuts in salaries and benefits apply only to future workers. Loss aversion is a powerful conservative force that favors minimal changes from the status quo in the lives of both institutions and individuals. This conservatism helps keep us stable in our neighborhood, our marriage, and our job; it is the gravitational force that holds our life together near the reference point.
动物, ,包括人,为防止损失而战斗比为实现收益而战斗更难。在领地动物的世界里,这一原则解释了保卫者的成功。一位生物学家观察到,"当领地拥有者受到对手的挑战时,领地拥有者几乎总是 赢得比赛--通常是 在几秒钟之内。"在人类事务中,同样的简单规则解释了当机构 ,在公司的 "重组 "和 "改组 "中,以及在努力使官僚机构合理化、简化税法或减少医疗费用时发生的大部分情况。按照最初的设想,改革计划几乎总是产生许多赢家和一些输家,同时实现整体改善。 然而,如果受影响的各方有任何政治影响力,潜在的输家会比潜在的赢家更加积极和坚定;结果会偏向于他们,并且不可避免地比最初的计划更加昂贵和低效。改革通常包括保护当前利益相关者的祖父条款--例如,当现有劳动力通过减员而不是解雇来减少时,或者当削减工资和福利只适用于未来的工人时。 损失厌恶是一种强大的保守力量,它倾向于在机构和个人的生活中对现状进行最小的改变。这种保守主义有助于使我们在邻里、婚姻和工作中保持稳定;它是使我们的生活在参考点附近保持一致的引力。
LOSS AVERSION IN THE LAW
法律中的损失厌恶
During the year that we spent working together in Vancouver, Richard Thaler, Jack Knetsch, and I were drawn into a study of fairness in economic transactions, partly because we were interested in the topic but also because we had an opportunity as well as an obligation to make up a new questionnaire every week. The Canadian government’s Department of Fisheries and Oceans had a program for unemployed professionals in Toronto, who were paid to administer telephone surveys. The large team of interviewers worked every night and new questions were constantly needed to keep the operation going. Through Jack Knetsch, we agreed to generate a questionnaire every week, in four color-labeled versions. We could ask about anything; the only constraint was that the questionnaire should include at least one mention of fish, to make it pertinent to the mission of the department. This went on for many months, and we treated ourselves to an orgy of data collection.
在我们在温哥华一起工作的一年里, Richard Thaler、Jack Knetsch和我被吸引到一项关于经济交易公平性的研究中,部分原因是我们对这个话题感兴趣,但也因为我们有机会以及有义务每周编造一份新的问卷调查。加拿大政府的渔业和海洋部有一个针对多伦多的失业专业人员的项目,他们有报酬地管理 电话调查。这支庞大的调查员队伍每天晚上都在工作,不断需要新的问题来维持运作。通过Jack Knetsch,我们同意每周生成一份问卷 ,有四个彩色标签的版本。我们可以问任何问题;唯一的限制是,问卷中至少要有一处提到鱼,以使其与该部门的任务相关 。这种情况持续了好几个月,我们对自己进行了一次数据收集的狂欢。
We studied public perceptions of what constitutes unfair behavior on the part of merchants, employers, and landlords . Our overarching question was whether the opprobrium attached to unfairness imposes constraints on profit seeking. We found that it does. We also found that the moral rules by which the public evaluates what firms may or may not do draw a crucial distinction between losses and gains. The basic principle is that the existing wage, price, or rent sets a reference point, which has the nature of an entitlement that must not be infringed. It is considered unfair for the firm to impose losses on its customers or workers relative to the reference transaction, unless it must do so to protect its own entitlement. Consider this example:
我们研究了公众对什么是 商人、雇主和房东 的不公平行为的看法。我们的首要问题是,对不公平行为的厌恶是否会对追求利润造成限制。我们发现,它确实如此。我们还发现,公众据以评估企业可以或不可以做什么的道德规则 ,在损失和收益之间做出了关键的区分。基本原则是,现有的工资、价格或租金设定了一个参考点,它具有不得侵犯的权利的性质。公司将相对于参考交易的损失强加给其客户或工人被认为是不公平的,除非它必须 ,以保护自己的权利。请看这个例子。
A hardware store has been selling snow shovels for $15. The morning after a large snowstorm, the store raises the price to $20.
一家五金店一直以15美元的价格出售雪铲。一场大雪过后的早晨,该店将价格提高到20美元。
Please rate this action as:
请将此行动评价为。
Completely Fair Acceptable Unfair Very Unfair
完全公平 可接受 不公平 非常不公平
The hardware store behaves appropriately according to the standard economic model: it responds to increased demand by raising its price. The participants in the survey did not agree: 82% rated the action Unfair or Very Unfair. They evidently viewed the pre-blizzard price as a reference point and the raised price as a loss that the store imposes on its customers, not because it must but simply because it can. A basic rule of fairness, we found, is that the exploitation of market power to impose losses on others is unacceptable. The following example illustrates this rule in another context (the dollar values should be adjusted for about 100% inflation since these data were collected in 1984):
根据标准的经济模式,五金店的行为是适当的:它通过提高其 价格来应对需求的增加。参与调查的人并不同意:82%的人认为这种行为不公平或非常不公平。他们显然把暴雪前的价格看作是一个参考点,而把提高的价格看作是商店强加给顾客的损失,不是因为它必须这样做,而只是因为它可以这样做。我们发现,公平的一个基本规则是,利用市场力量将损失强加于人是不可接受的。 下面的例子在另一个背景下说明了这一规则(由于这些数据是在1984年收集的,所以美元价值应该根据大约100%的通货膨胀率进行调整)。
A small photocopying shop has one employee who has worked there for six months and earns $9 per hour. Business continues to be satisfactory, but a factory in the area has closed and unemployment has increased. Other small shops have now hired reliable workers at $7 an hour to perform jobs similar to those done by the photocopy shop employee. The owner of the shop reduces the employee’s wage to $7.
一家小型复印店有一名员工,他已经在那里工作了六个月,每小时收入9美元。生意继续令人满意,但该地区的一家工厂已经关闭,失业率上升。其他小商店 ,现在已经以每小时7美元的价格雇佣了可靠的工人,从事与该复印店员工类似的工作。店主把员工的工资降到7美元。
The respondents did not approve: 83% considered the behavior Unfair or Very Unfair. However, a slight variation on the question clarifies the nature of the employer’s obligation. The background scenario of a profitable store in an area of high unemployment is the same, but now
的受访者不赞成。83%的人认为这种行为不公平或非常不公平。然而,对这个问题稍作改动,就可以澄清雇主义务的性质。在高失业率地区的一家盈利的商店 ,其背景情况是一样的,但现在
the current employee leaves, and the owner decides to pay a replacement $7 an hour.
现在的雇员离开了,而老板决定付给替代者每小时7美元。
A large majority (73%) considered this action Acceptable. It appears that the employer does not have a moral obligation to pay $9 an hour. The entitlement is personal: the current worker has a right to retain his wage even if market conditions would allow the employer to impose a wage cut. The replacement worker has no entitlement to the previous worker’s reference wage, and the employer is therefore allowed to reduce pay without the risk of being branded unfair.
绝大多数人(73%)认为这一行动可以接受。看来,雇主并没有支付每小时9美元的道德义务。权利是个人的:目前的工人有权利保留他的工资,即使市场条件允许 ,雇主也可以强行削减工资。替代工人没有权利获得以前工人的参考工资,因此,雇主被允许减少工资而没有被打成不公平的风险。
The firm has its own entitlement, which is to retain its current profit. If it faces a threat of a loss, it is allowed to transfer the loss to others. A substantial majority of respondents believed that it is not unfair for a firm to reduce its workers’ wages when its profitability is falling. We described the rules as defining dual entitlements to the firm and to individuals with whom it interacts. When threatened, it is not unfair for the firm to be selfish. It is not even expected to take on part of the losses; it can pass them on.
公司有自己的权利,那就是保留其当前的利润。如果它面临亏损的威胁,它可以将损失转嫁给其他人。相当多的受访者 ,认为企业在盈利能力下降时降低工人工资并不公平。我们将这些规则描述为对企业和与之互动的个人的双重权利的界定。当受到威胁时,企业自私自利并非不公平。它甚至不需要承担部分损失;它可以将这些损失转嫁出去。
Different rules governed what the firm could do to improve its profits or to avoid reduced profits. When a firm faced lower production costs, the rules of fairness did not require it to share the bonanza with either its customers or its workers. Of course, our respondents liked a firm better and described it as more fair if it was generous when its profits increased, but they did not brand as unfair a firm that did not share. They showed indignation only when a firm exploited its power to break informal contracts with workers or customers, and to impose a loss on others in order to increase its profit. The important task for students of economic fairness is not to identify ideal behavior but to find the line that separates acceptable conduct from actions that invite opprobrium and punishment.
不同的规则规定了公司可以 ,以提高其利润或避免利润减少。当一个公司面临较低的生产成本时,公平规则并不要求它与客户或工人分享红利。当然,我们的受访者更喜欢一家公司,如果它在利润增加时慷慨解囊,他们会认为它更公平,但他们并没有把不分享的公司打成不公平。只有当一个公司利用其权力破坏与工人或客户的非正式合同,并将损失强加给他人以增加其利润时,他们才会表现出愤慨 。研究经济公平的学生的重要任务不是确定理想的行为,而是找到将可接受的行为与招致责难和惩罚的行为分开的界限。
We were not optimistic when we submitted our report of this research to the American Economic Review . Our article challenged what was then accepted wisdom among many economists that economic behavior is ruled by self-interest and that concerns for fairness are generally irrelevant. We also relied on the evidence of survey responses, for which economists generally have little respect. However, the editor of the journal sent our article for evaluation to two economists who were not bound by those conventions (we later learned their identity; they were the most friendly the editor could have found). The editor made the correct call. The article is often cited, and its conclusions have stood the test of time. More recent research has supported the observations of reference-dependent fairness and has also shown that fairness concerns are economically significant , a fact we had suspected but did not prove. Employers who violate rules of fairness are punished by reduced productivity, and merchants who follow unfair pricing policies can expect to lose sales. People who learned from a new catalog that the merchant was now charging less for a product that they had recently bought at a higher price reduced their future purchases from that supplier by 15%, an average loss of $90 per customer. The customers evidently perceived the lower price as the reference point and thought of themselves as having sustained a loss by paying more than appropriate. Moreover, the customers who reacted the most strongly were those who bought more items and at higher prices. The losses far exceeded the gains from the increased purchases produced by the lower prices in the new catalog.
当我们向《 美国经济评论》 提交关于这项研究的 报告时,我们并不乐观。我们的文章挑战了当时许多经济学家公认的智慧,即经济行为是由自我利益支配的,对公平的关注通常是不相关的。我们还依靠了调查问卷的证据,而经济学家通常对这些证据并不尊重。 然而,该杂志的编辑将我们的文章 ,让两位不受这些惯例约束的经济学家进行评估(我们后来知道了他们的身份;他们是编辑能找到的最友好的人)。编辑做出了正确的决定。这篇文章经常被引用,其结论也经受住了时间的考验。最近的研究支持了对参考性公平的观察,也表明 公平问题在经济上具有重要意义 ,我们曾怀疑但没有证明这一事实。违反公平规则的雇主会受到生产力下降的惩罚,而遵循不公平定价政策的商人可以预期会失去销售额。人们从一个新的目录中得知,商家现在对他们最近以较高价格购买的产品收费较低,他们今后从 该供应商那里购买的产品减少了15%,每个客户平均损失90美元。显然,顾客认为较低的价格是参考点,并认为自己因支付了超过适当的价格而遭受了损失。此外,反应最强烈的顾客是那些以更高价格购买更多物品的顾客。这些损失远远超过了新目录中较低的价格所带来的购买量的增加 。
Unfairly imposing losses on people can be risky if the victims are in a position to retaliate. Furthermore, experiments have shown that strangers who observe unfair behavior often join in the punishment. Neuroeconomists (scientists who combine economics with brain research) have used MRI machines to examine the brains of people who are engaged in punishing one stranger for behaving unfairly to another stranger. Remarkably, altruistic punishment is accompanied by increased activity in the “pleasure centers” of the brain. It appears that maintaining the social order and the rules of fairness in this fashion is its own reward. Altruistic punishment could well be the glue that holds societies together. However, our brains are not designed to reward generosity as reliably as they punish meanness. Here again, we find a marked asymmetry between losses and gains.
如果受害者有能力进行报复,不公平地将损失强加于人可能会有风险。此外,实验表明,观察到不公平行为的陌生人往往会加入惩罚行列。神经经济学家(将经济学与大脑研究相结合的科学家)使用核磁共振成像机器检查了参与惩罚的人的大脑, ,一个陌生人对另一个陌生人的不公平行为。值得注意的是, 利他主义的惩罚伴随着 大脑中 "快乐中心 "活动的增加。看来,以这种方式维护社会秩序和公平规则本身就是一种奖励。利他主义的惩罚很可能是维系社会的胶水。然而,我们的大脑并没有被设计成像惩罚吝啬那样可靠地奖励 慷慨的行为。在这里,我们再次发现损失和收益之间存在明显的不对称性。
The influence of loss aversion and entitlements extends far beyond the realm of financial transactions. Jurists were quick to recognize their impact on the law and in the administration of justice. In one study, David Cohen and Jack Knetsch found many examples of a sharp distinction between actual losses and foregone gains in legal decisions. For example, a merchant whose goods were lost in transit may be compensated for costs he actually incurred, but is unlikely to be compensated for lost profits. The familiar rule that possession is nine-tenths of the law confirms the moral status of the reference point. In a more recent discussion, Eyal Zamir makes the provocative point that the distinction drawn in the law between restoring losses and compensating for foregone gains may be justified by their asymmetrical effects on individual well-being . If people who lose suffer more than people who merely fail to gain, they may also deserve more protection from the law.
损失厌恶和权利的影响远远超出了金融交易的范畴。法学家们很快就认识到它们对法律和司法的影响。在一项研究中,David Cohen和Jack Knetsch发现了许多例子,在法律决定中, 实际损失和放弃的收益 之间有明显的区别 。例如,货物在运输过程中丢失的商人可能会得到他实际发生的费用的赔偿,但不太可能得到利润损失的赔偿。熟悉的规则,即占有是法律的十分之九,证实了参考点的道德地位。在最近的一次讨论中,Eyal Zamir提出了一个挑衅性的观点: ,法律中在恢复损失 和补偿放弃的收益之间所做的区分,可能是由于它们 对个人福祉的不对称影响 而得到证明。如果损失的人比仅仅未能获得收益的人遭受更多的痛苦,他们可能也应该得到法律的更多保护。
SPEAKING OF LOSSES
说到损失
“This reform will not pass. Those who stand to lose will fight harder than those who stand to gain.”
"这项改革将不会通过。那些会失去的人将比那些 ,会得到的人更努力地争取。"
“Each of them thinks the other’s concessions are less painful. They are both wrong, of course. It’s just the asymmetry of losses.”
"他们中的每一个人都认为对方的让步是不那么痛苦的。当然,他们都错了。这只是损失的不对称性。"
“They would find it easier to renegotiate the agreement if they realized the pie was actually expanding. They’re not allocating losses; they are allocating gains.”
"如果他们意识到蛋糕实际上在扩大,他们会发现更容易重新谈判协议。他们不是在分配损失;他们是在分配收益"。
“Rental prices around here have gone up recently, but our tenants don’t think it’s fair that we should raise their rent, too. They feel entitled to their current terms.”
"最近这里的租金价格上涨了,但我们的租户认为我们也应该提高他们的租金,这并不 ,这很公平。他们觉得有权享受目前的条件。"
“My clients don’t resent the price hike because they know my costs have gone up, too. They accept my right to stay profitable.”
"我的客户并不反感涨价,因为他们知道我的成本也上升了。他们接受我保持盈利的权利。"
29
29
The Fourfold Pattern
四个方面的模式
Whenever you form a global evaluation of a complex object—a car you may buy, your son-in-law, or an uncertain situation—you assign weights to its characteristics. This is simply a cumbersome way of saying that some characteristics influence your assessment more than others do. The weighting occurs whether or not you are aware of it; it is an operation of System 1. Your overall evaluation of a car may put more or less weight on gas economy, comfort, or appearance. Your judgment of your son-in-law may depend more or less on how rich or handsome or reliable he is. Similarly, your assessment of an uncertain prospect assigns weights to the possible outcomes. The weights are certainly correlated with the probabilities of these outcomes: a 50% chance to win a million is much more attractive than a 1% chance to win the same amount. The assignment of weights is sometimes conscious and deliberate. Most often, however, you are just an observer to a global evaluation that your System 1 delivers.
每当你对一个复杂的对象--你可能购买的汽车、你的女婿或一个不确定的情况--形成总体评价时,你都会给它的特征分配权重。这只是一种繁琐的说法: ,一些特征比其他特征更能影响你的评估。无论你是否意识到,这种加权都会发生;它是系统1的一种操作。你对一辆车的总体评价可能会对汽油经济性、舒适性或外观给予或多或少的重视。你对女婿的判断可能或多或少取决于他有多富有、多英俊或多可靠。同样,你对一个不确定的 前景的评估也会对可能的结果分配权重。这些权重肯定与这些结果的概率相关:50%的机会赢得一百万,比1%的机会赢得同样的金额更有吸引力。权重的分配有时是有意识的、刻意的。然而,大多数时候,你只是一个观察者,观察你的系统1所提供的全局评价。
CHANGING CHANCES
改变 机会
One reason for the popularity of the gambling metaphor in the study of decision making is that it provides a natural rule for the assignment of weights to the outcomes of a prospect: the more probable an outcome, the more weight it should have. The expected value of a gamble is the average of its outcomes, each weighted by its probability. For example, the expected value of “20% chance to win $1,000 and 75% chance to win $100” is $275. In the pre-Bernoulli days, gambles were assessed by their expected value. Bernoulli retained this method for assigning weights to the outcomes, which is known as the expectation principle, but applied it to the psychological value of the outcomes. The utility of a gamble, in his theory, is the average of the utilities of its outcomes, each weighted by its probability.
赌博的隐喻在决策研究中很受欢迎,原因之一是它为前景结果的权重分配提供了一个自然规则:结果的可能性越大,它的权重就越大。赌博的期望值是其结果的平均值,每个结果都按其概率加权。例如,"20%的机会 ,赢得1000美元,75%的机会赢得100美元 "的预期值是275美元。 在伯努利之前的时代,赌博是以其预期值来评估的。伯努利保留了这种为结果分配权重的方法,也就是所谓的期望值原则,但将其应用于结果的心理价值。在他的理论中,一场赌博的效用是其结果效用的平均值,每个结果都按其概率加权 。
The expectation principle does not correctly describe how you think about the probabilities related to risky prospects. In the four examples below, your chances of receiving $1 million improve by 5%. Is the news equally good in each case?
期望原则没有正确描述你如何思考与风险前景有关的概率。在下面的四个例子中,你收到100万美元的机会提高了5%。每种情况下的消息都一样好吗?
A. From 0 to 5%
A.从0到5%
B. From 5% to 10%
B.从5%到10%
C. From 60% to 65%
C.从60%到65%
D. From 95% to 100%
D.从95%到100%
The expectation principle asserts that your utility increases in each case by exactly 5% of the utility of receiving $1 million. Does this prediction describe your experiences? Of course not.
期望原则断言,在每种情况下,你的效用都会增加 ,正好是收到100万美元的效用的5%。这个预测描述了你的经历吗?当然不是。
Everyone agrees that 0 ➞ 5% and 95% ➞ 100% are more impressive than either 5% ➞ 10% or 60% ➞ 65%. Increasing the chances from 0 to 5% transforms the situation, creating a possibility that did not exist earlier, a hope of winning the prize. It is a qualitative change, where 5 ➞ 10% is only a quantitative improvement. The change from 5% to 10% doubles the probability of winning, but there is general agreement that the psychological value of the prospect does not double. The large impact of 0 ➞ 5% illustrates the possibility effect , which causes highly unlikely outcomes to be weighted disproportionately more than they “deserve.” People who buy lottery tickets in vast amounts show themselves willing to pay much more than expected value for very small chances to win a large prize.
大家都同意,0➞5%和95%➞100%比5%➞10%或60%➞65%更令人印象深刻。将机会从0增加到5%改变了情况,创造了一种先前不存在的可能性,即赢得奖金的希望。这是一个质的变化, ,而5➞10%只是一个量的提高。从5%到10%的变化使中奖的概率增加了一倍,但人们普遍认为前景的心理价值并没有增加。0➞5%的巨大影响说明了 可能性效应 ,它导致极不可能的结果被不成比例地加权,而不是它们 "应得 "的。大量购买彩票的人 ,表明他们愿意为非常小的中大奖的机会付出比预期值高得多的钱。
The improvement from 95% to 100% is another qualitative change that has a large impact, the certainty effect . Outcomes that are almost certain are given less weight than their probability justifies. To appreciate the certainty effect, imagine that you inherited $1 million, but your greedy stepsister has contested the will in court. The decision is expected tomorrow. Your lawyer assures you that you have a strong case and that you have a 95% chance to win, but he takes pains to remind you that judicial decisions are never perfectly predictable. Now you are approached by a risk-adjustment company, which offers to buy your case for $910,000 outright—take it or leave it. The offer is lower (by $40,000!) than the expected value of waiting for the judgment (which is $950,000), but are you quite sure you would want to reject it? If such an event actually happens in your life, you should know that a large industry of “structured settlements” exists to provide certainty at a hefty price, by taking advantage of the certainty effect.
从95%到100%的改进是另一个有很大影响的质变,即 确定性效应 。几乎确定的结果被赋予的权重低于其概率所证明的。为了理解确定性效应,想象一下你继承了100万美元, ,但你贪婪的继妹在法庭上对遗嘱提出异议。预计明天就会有判决。你的律师向你保证,你有一个强有力的案子,你有95%的机会获胜,但他不厌其烦地提醒你,司法判决从来都是完全可以预测的。现在,一家风险调整公司找到你,提出以91万美元的价格买下你的案子--要么接受要么放弃。 这个报价比等待判决的预期价值 (95万美元)要低(4万美元!),但你确定你会拒绝它吗?如果这样的事件真的发生在你的生活中,你应该知道有一个庞大的 "结构性和解 "行业存在,通过利用确定性效应,以高昂的价格提供确定性。
Possibility and certainty have similarly powerful effects in the domain of losses. When a loved one is wheeled into surgery, a 5% risk that an amputation will be necessary is very bad—much more than half as bad as a 10% risk. Because of the possibility effect, we tend to overweight small risks and are willing to pay far more than expected value to eliminate them altogether. The psychological difference between a 95% risk of disaster and the certainty of disaster appears to be even greater; the sliver of hope that everything could still be okay looms very large. Overweighting of small probabilities increases the attractiveness of both gambles and insurance policies.
在损失领域,可能性和确定性也有类似的 ,具有强大的影响。当所爱的人被推进手术室时,5%的截肢风险是非常糟糕的,比10%的风险要糟糕一半以上。由于可能性效应,我们倾向于过度重视小风险,并愿意付出远高于预期价值的代价来完全消除它们。95%的灾难风险 ,与确定的灾难之间的心理差异似乎更大;一切都可能好起来的一丝希望显得非常大。对小概率的过度重视增加了赌博和保险政策的吸引力。
The conclusion is straightforward: the decision weights that people assign to outcomes are not identical to the probabilities of these outcomes, contrary to the expectation principle. Improbable outcomes are overweighted—this is the possibility effect. Outcomes that are almost certain are underweighted relative to actual certainty. The expectation principle , by which values are weighted by their probability, is poor psychology.
结论是直截了当的:人们赋予结果的决策权重与这些结果的概率不一致,这与 期望原则相反。不可能的结果会被加权--这就是可能性效应。相对于实际的确定性而言,几乎确定的结果被降低了权重。 期望原则 ,即根据其概率对价值进行加权,是糟糕的心理学。
The plot thickens, however, because there is a powerful argument that a decision maker who wishes to be rational must conform to the expectation principle. This was the main point of the axiomatic version of utility theory that von Neumann and Morgenstern introduced in 1944. They proved that any weighting of uncertain outcomes that is not strictly proportional to probability leads to inconsistencies and other disasters . Their derivation of the expectation principle from axioms of rational choice was immediately recognized as a monumental achievement, which placed expected utility theory at the core of the rational agent model in economics and other social sciences. Thirty years later, when Amos introduced me to their work, he presented it as an object of awe. He also introduced me to a famous challenge to that theory.
然而,情节变得更加复杂,因为有一个强有力的论据表明,一个希望成为理性的决策者 必须 符合 ,以符合期望原则。这是冯-诺伊曼和莫根斯坦在1944年提出的效用理论公理化版本的主要观点。他们证明,任何对不确定结果的加权,如果不与概率严格成正比,就会导致不一致 和其他灾难 。 他们从理性选择的公理中推导出的期望原理立即被公认为是一项不朽的成就,它将期望效用理论置于经济学和其他社会科学中的理性代理人模型的核心。30年后,当阿莫斯向我介绍他们的工作时,他把它作为一个敬畏的对象。他还向我介绍了对该理论的一个著名挑战。
ALLAIS’S PARADOX
阿莱氏悖论
In 1952, a few years after the publication of von Neumann and Morgenstern’s theory, a meeting was convened in Paris to discuss the economics of risk. Many of the most renowned economists of the time were in attendance. The American guests included the future Nobel laureates Paul Samuelson, Kenneth Arrow, and Milton Friedman, as well as the leading statistician Jimmie Savage.
1952年,在冯-诺伊曼和摩根斯坦的 理论发表几年后,在巴黎召开了一次会议,讨论风险的经济学问题。当时许多最知名的经济学家都参加了会议。美国客人包括未来的诺贝尔奖获得者保罗-萨缪尔森、 肯尼斯-阿罗和米尔顿-弗里德曼,以及著名的统计学家吉米-萨维奇。
One of the organizers of the Paris meeting was Maurice Allais, who would also receive a Nobel Prize some years later. Allais had something up his sleeve, a couple of questions on choice that he presented to his distinguished audience. In the terms of this chapter, Allais intended to show that his guests were susceptible to a certainty effect and therefore violated expected utility theory and the axioms of rational choice on which that theory rests. The following set of choices is a simplified version of the puzzle that Allais constructed . In problems A and B, which would you choose?
巴黎会议的组织者之一是莫里斯-阿莱(Maurice Allais),他也将在几年后获得诺贝尔奖 。阿莱的袖子里有一些东西,他向他的杰出的听众提出了几个关于选择的问题。在本章中,Allais打算证明他的客人容易受到确定性效应的影响,因此违反了预期效用理论和该理论所依据的理性选择公理。下面这组选择是 Allais所构建的谜题 的简化版 。在问题A和B中,你会选择哪个?
A. 61% chance to win $520,000 OR 63% chance to win $500,000
A.61%的机会赢得520,000美元或63%的机会赢得500,000美元
B. 98% chance to win $520,000 OR 100% chance to win $500,000
B.98%的机会赢得520,000美元或100%的机会赢得500,000美元
If you are like most other people, you preferred the left-hand option in problem A and you preferred the right-hand option in problem B. If these were your preferences, you have just committed a logical sin and violated the rules of rational choice. The illustrious economists assembled in Paris committed similar sins in a more involved version of the “Allais paradox.”
如果你和其他大多数人一样,在问题A中你更喜欢左边的选项,在问题B中你更喜欢右边的选项。如果这些是你的偏好,那么你 ,只是犯了一个逻辑上的罪,违反了理性选择的规则。聚集在巴黎的杰出经济学家们在 "阿莱悖论 "的一个更复杂的版本中犯了类似的罪。
To see why these choices are problematic, imagine that the outcome will be determined by a blind draw from an urn that contains 100 marbles—you win if you draw a red marble, you lose if you draw white. In problem A, almost everybody prefers the left-hand urn, although it has fewer winning red marbles, because the difference in the size of the prize is more impressive than the difference in the chances of winning. In problem B, a large majority chooses the urn that guarantees a gain of $500,000. Furthermore, people are comfortable with both choices—until they are led through the logic of the problem.
要知道为什么这些选择是有问题的,想象一下,结果将由从一个装有100颗弹珠的瓮中盲抽决定--如果你抽到红色弹珠,你就赢了,如果你抽到白色弹珠,你就输了。 在问题A中,几乎所有人都倾向于左边的瓮,尽管它的红色弹珠中奖率较低,因为奖金大小的差异比中奖几率的差异更令人印象深刻。在问题B中,绝大多数人都选择了保证获得50万美元的瓮。此外,人们对这两种选择都很满意--直到有人引导他们了解问题的逻辑。
Compare the two problems, and you will see that the two urns of problem B are more favorable versions of the urns of problem A, with 37 white marbles replaced by red winning marbles in each urn. The improvement on the left is clearly superior to the improvement on the right, since each red marble gives you a chance to win $520,000 on the left and only $500,000 on the right. So you started in the first problem with a preference for the left-hand urn, which was then improved more than the right-hand urn—but now you like the one on the right! This pattern of choices does not make logical sense, but a psychological explanation is readily available: the certainty effect is at work. The 2% difference between a 100% and a 98% chance to win in problem B is vastly more impressive than the same difference between 63% and 61% in problem A.
比较一下这两个问题,你会发现问题B的两个瓮是问题A的瓮的更有利的版本,每个瓮中的37颗白色弹珠被红色的获胜弹珠取代。左边的改进明显优于右边的改进,因为每颗红色弹珠在左边给你赢得52万美元的机会,在右边只有50万美元。所以你在 第一个问题的时候,就开始倾向于左边的瓮,当时的改进比右边的瓮更多--但现在你喜欢右边的那个!这就是你的选择。这种选择模式在逻辑上说不通,但心理学上的解释很容易 :确定性效应在起作用。在问题B中,100%和98%的获胜机会之间2%的差异比问题A中63%和61%之间的相同 。
As Allais had anticipated, the sophisticated participants at the meeting did not notice that their preferences violated utility theory until he drew their attention to that fact as the meeting was about to end. Allais had intended this announcement to be a bombshell: the leading decision theorists in the world had preferences that were inconsistent with their own view of rationality! He apparently believed that his audience would be persuaded to give up the approach that he rather contemptuously labeled “the American school” and adopt an alternative logic of choice that he had developed. He was to be sorely disappointed .
正如阿莱所预料的那样,参加会议的老练的与会者并没有注意到他们的偏好违反了效用理论,直到他在会议即将结束时提请他们注意这一事实。阿莱本想把这一宣布作为一个重磅炸弹:世界上领先的决策理论家们的偏好与他们自己的理性观点不一致, !他显然认为,他的听众会被说服,放弃他相当轻蔑地称之为 "美国学派 "的方法,而采用他所开发的另一种选择逻辑。他感到 非常失望 。
Economists who were not aficionados of decision theory mostly ignored the Allais problem. As often happens when a theory that has been widely adopted and found useful is challenged, they noted the problem as an anomaly and continued using expected utility theory as if nothing had happened. In contrast, decision theorists—a mixed collection of statisticians, economists, philosophers, and psychologists—took Allais’s challenge very seriously. When Amos and I began our work, one of our initial goals was to develop a satisfactory psychological account of Allais’s paradox.
那些不是决策理论爱好者的经济学家们大多忽略了阿莱问题。就像经常发生的那样,当一个已经被广泛采用并被认为有用的理论 ,他们注意到这个问题是一个反常现象,并继续使用预期效用理论,好像什么都没有发生过。相反,决策理论家--统计学家、经济学家、哲学家和心理学家的混合集合--非常认真地对待阿莱的挑战。当阿莫斯和我开始我们的工作时,我们最初的目标之一是为阿莱的悖论制定一个 令人满意的心理学解释。
Most decision theorists, notably including Allais, maintained their belief in human rationality and tried to bend the rules of rational choice to make the Allais pattern permissible. Over the years there have been multiple attempts to find a plausible justification for the certainty effect, none very convincing. Amos had little patience for these efforts; he called the theorists who tried to rationalize violations of utility theory “lawyers for the misguided.” We went in another direction. We retained utility theory as a logic of rational choice but abandoned the idea that people are perfectly rational choosers. We took on the task of developing a psychological theory that would describe the choices people make, regardless of whether they are rational. In prospect theory, decision weights would not be identical to probabilities.
大多数决策理论家,特别是包括Allais,都坚持相信人类的理性,并试图弯曲理性选择的规则,使Allais模式被允许。多年来,人们曾多次试图为确定性效应找到一个合理的理由,但都不太令人信服。阿莫斯对这些努力没有什么耐心 ;他称那些试图将违反效用理论的行为合理化的理论家为 "被误导者的律师"。我们走的是另一个方向。我们保留了效用理论作为理性选择的逻辑,但放弃了人们是完全理性选择者的想法。我们承担了发展一种心理学理论的任务,这种理论将描述人们做出的选择,无论 ,他们是否理性。在前景理论中,决策权重将不会与概率相同。
DECISION WEIGHTS
决策权重
Many years after we published prospect theory, Amos and I carried out a study in which we measured the decision weights that explained people’s preferences for gambles with modest monetary stakes. The estimates for gains are shown in table 4 .
在我们发表前景理论多年后,阿莫斯和我进行了一项研究,我们测量了解释人们对赌注不大的赌博的偏好的决策权重。 表4 中显示了 对收益的估计 。
Table 4
表 4
You can see that the decision weights are identical to the corresponding probabilities at the extremes: both equal to 0 when the outcome is impossible, and both equal to 100 when the outcome is a sure thing. However, decision weights depart sharply from probabilities near these points. At the low end, we find the possibility effect: unlikely events are considerably overweighted. For example, the decision weight that corresponds to a 2% chance is 8.1. If people conformed to the axioms of rational choice, the decision weight would be 2—so the rare event is overweighted by a factor of 4. The certainty effect at the other end of the probability scale is even more striking. A 2% risk of not winning the prize reduces the utility of the gamble by 13%, from 100 to 87.1.
你可以看到,决策权重与极端情况下的相应概率是相同的:当结果不可能发生时,两者都等于0,而当结果肯定发生时,两者都等于100。然而,在这些点附近,决策权重与概率有很大的出入。在低端,我们发现了可能性效应:不太可能发生的事件被大大加权了。例如, ,与2%的可能性相对应的决策权重是8.1。如果人们符合理性选择的公理,决策权重将是2--因此罕见事件的权重被放大了4倍。在概率表的另一端,确定性效应甚至更加引人注目。2%的 不 中奖风险使赌博的效用降低了13%,从100降至87.1。
To appreciate the asymmetry between the possibility effect and the certainty effect, imagine first that you have a 1% chance to win $1 million. You will know the outcome tomorrow. Now, imagine that you are almost certain to win $1 million, but there is a 1% chance that you will not. Again, you will learn the outcome tomorrow. The anxiety of the second situation appears to be more salient than the hope in the first. The certainty effect is also more striking than the possibility effect if the outcome is a surgical disaster rather than a financial gain. Compare the intensity with which you focus on the faint sliver of hope in an operation that is almost certain to be fatal, compared to the fear of a 1% risk.
为了理解 可能性效应和确定性效应之间的不对称性,首先想象一下,你有1%的机会赢得100万美元。你明天就会知道结果。现在,想象一下,你几乎肯定会赢得100万美元,但有1%的机会不会。同样,你明天就会知道结果。第二种情况下的焦虑似乎比第一种情况下的希望更突出, 。如果结果是一场手术灾难而不是经济收益,那么确定性效应也比可能性效应更为突出。比较一下,与对1%风险的恐惧相比,你对几乎肯定是致命的手术中那一丝微弱的希望所关注的强度。
The combination of the certainty effect and possibility effects at the two ends of the probability scale is inevitably accompanied by inadequate sensitivity to intermediate probabilities. You can see that the range of probabilities between 5% and 95% is associated with a much smaller range of decision weights (from 13.2 to 79.3), about two-thirds as much as rationally expected. Neuroscientists have confirmed these observations, finding regions of the brain that respond to changes in the probability of winning a prize. The brain’s response to variations of probabilities is strikingly similar to the decision weights estimated from choices .
概率表两端的确定性效应和可能性效应的结合,不可避免地伴随着对中间概率的不充分敏感性。你可以看到,5%和95%之间的概率范围与决策权重的范围小得多(从13.2到79.3),大约是理性预期的三分之二。神经科学家已经证实了这些观察结果,发现大脑的一些区域对 中奖概率的变化做出了反应。大脑对概率变化的反应与 从选择中估计 出的决策权重惊人地相似。
Probabilities that are extremely low or high (below 1% or above 99%) are a special case. It is difficult to assign a unique decision weight to very rare events, because they are sometimes ignored altogether, effectively assigned a decision weight of zero. On the other hand, when you do not ignore the very rare events, you will certainly overweight them. Most of us spend very little time worrying about nuclear meltdowns or fantasizing about large inheritances from unknown relatives. However, when an unlikely event becomes the focus of attention, we will assign it much more weight than its probability deserves. Furthermore, people are almost completely insensitive to variations of risk among small probabilities. A cancer risk of 0.001% is not easily distinguished from a risk of 0.00001%, although the former would translate to 3,000 cancers for the population of the United States, and the latter to 30.
概率极低或极高(低于1%或高于99%)是一种特殊情况。很难给非常罕见的事件分配一个独特的决策权重,因为它们有时会被完全忽略,有效地分配一个决策 ,权重为零。另一方面,当你不忽视 非常罕见的事件时,你肯定会给它们加权。我们大多数人很少花时间担心核泄漏或幻想从未知的亲戚那里得到大量的遗产。然而,当一个不太可能发生的事件成为关注的焦点时,我们会赋予它比其概率所应得的更大的权重。此外,人们 ,对小概率中的风险变化几乎完全不敏感。0.001%的癌症风险与0.00001%的风险不容易区分,尽管前者会转化为美国人口中的3000个癌症,而后者则是30个。
When you pay attention to a threat, you worry—and the decision weights reflect how much you worry. Because of the possibility effect, the worry is not proportional to the probability of the threat. Reducing or mitigating the risk is not adequate; to eliminate the worry the probability must be brought down to zero.
当你注意到一个威胁时,你就会担心--决策权重反映了你的担心程度。由于可能性 ,担心与威胁的概率不成正比。减少或减轻风险是不够的;要消除担忧,必须将概率降至零。
The question below is adapted from a study of the rationality of consumer valuations of health risks, which was published by a team of economists in the 1980s. The survey was addressed to parents of small children .
下面的问题改编自一个经济学家小组在20世纪80年代发表的关于消费者对健康风险估值的合理性的研究。该调查的对象是 小孩子的父母 。
Suppose that you currently use an insect spray that costs you $10 per bottle and it results in 15 inhalation poisonings and 15 child poisonings for every 10,000 bottles of insect spray that are used.
假设你目前使用的杀虫剂每瓶价格为10美元,每使用1万瓶杀虫剂就会造成15起吸入性中毒和15起儿童中毒事件。
You learn of a more expensive insecticide that reduces each of the risks to 5 for every 10,000 bottles. How much would you be willing to pay for it?
你了解到有一种更昂贵的杀虫剂,可以将每种风险降低到每10,000瓶中的5种。你愿意为它支付多少钱?
The parents were willing to pay an additional $2.38, on average, to reduce the risks by two-thirds from 15 per 10,000 bottles to 5. They were willing to pay $8.09, more than three times as much, to eliminate it completely. Other questions showed that the parents treated the two risks (inhalation and child poisoning) as separate worries and were willing to pay a certainty premium for the complete elimination of either one. This premium is compatible with the psychology of worry but not with the rational model.
家长们愿意额外支付 ,平均为2.38美元,以将风险降低三分之二,从每10,000瓶15个减少到5个。他们愿意支付8.09美元,是完全消除风险的三倍多。其他问题显示,父母将两种风险(吸入和儿童中毒)视为单独的担忧,并愿意为完全消除其中一种风险支付确定性的溢价。 这种溢价与 忧虑心理学 相符,但与理性模型不符。
THE FOURFOLD PATTERN
四维模式
When Amos and I began our work on prospect theory, we quickly reached two conclusions: people attach values to gains and losses rather than to wealth, and the decision weights that they assign to outcomes are different from probabilities. Neither idea was completely new, but in combination they explained a distinctive pattern of preferences that we called the fourfold pattern. The name has stuck. The scenarios are illustrated below.
当阿莫斯和我开始研究前景理论时,我们很快就得出了两个结论:人们对收益和损失的重视,而不是对 财富的重视,而且他们赋予结果的决策权重与概率不同。这两个观点都不是全新的,但结合起来,它们解释了 一种独特的偏好模式,我们称之为四重模式。这个名字已经坚持了下来。以下是这些情景的图示。

Figure 13
图13
The top row in each cell shows an illustrative prospect.
每个单元格的顶行显示了一个说明性的前景。
The second row characterizes the focal emotion that the prospect evokes.
第二行描述了前景所唤起的焦点情感。
The third row indicates how most people behave when offered a choice between a gamble and a sure gain (or loss) that corresponds to its expected value (for example, between “95% chance to win $10,000” and “$9,500 with certainty”). Choices are said to be risk averse if the sure thing is preferred, risk seeking if the gamble is preferred.
第三行表示,当提供给人们一个在赌博和与预期值相对应的肯定收益(或损失)之间的选择时,大多数人的行为(例如,在 "95%的机会赢得10,000美元 "和 "9500美元的肯定收益 "之间)。如果肯定的东西是首选,那么选择被称为 风险厌恶,如果赌博是首选,那么风险寻求。
The fourth row describes the expected attitudes of a defendant and a plaintiff as they discuss a settlement of a civil suit.
第四行描述了被告和原告在讨论民事诉讼的解决方案时的预期态度。
The fourfold pattern of preferences is considered one of the core achievements of prospect theory. Three of the four cells are familiar; the fourth (top right) was new and unexpected.
喜好的 四重模式 被认为是前景理论的核心成就之一。四个单元中的三个是人们所熟悉的;第四个(右上角)是新的和意想不到的。
The top left is the one that Bernoulli discussed: people are averse to risk when they consider prospects with a substantial chance to achieve a large gain. They are willing to accept less than the expected value of a gamble to lock in a sure gain.
左上角是伯努利所讨论的:当人们考虑到有很大机会获得巨大收益的前景时,他们会厌恶风险。他们愿意接受低于预期价值的赌博,以锁定一个肯定的收益。
The possibility effect in the bottom left cell explains why lotteries are popular. When the top prize is very large, ticket buyers appear indifferent to the fact that their chance of winning is minuscule. A lottery ticket is the ultimate example of the possibility effect. Without a ticket you cannot win, with a ticket you have a chance, and whether the chance is tiny or merely small matters little. Of course, what people acquire with a ticket is more than a chance to win; it is the right to dream pleasantly of winning.
左下角单元格中的可能性效应解释了为什么彩票会受欢迎。当最高奖金非常大时,买彩票的人显得无动于衷, ,因为他们中奖的机会微乎其微。一张彩票 ,是可能性效应的最终例子。没有彩票就不能中奖,有了彩票就有了机会,而这个机会是否微小或仅仅是小事一桩。当然,人们通过彩票获得的不仅仅是一个获胜的机会;它是愉快地梦想获胜的权利。
The bottom right cell is where insurance is bought. People are willing to pay much more for insurance than expected value—which is how insurance companies cover their costs and make their profits. Here again, people buy more than protection against an unlikely disaster; they eliminate a worry and purchase peace of mind.
右下方的单元格 ,是购买保险的地方。人们愿意为保险支付比预期价值高得多的费用--这也是保险公司支付其成本并获得利润的方式。在这里,人们购买的不仅仅是对不可能发生的灾难的保护;他们消除了担忧,购买了心灵的平静。
The results for the top right cell initially surprised us. We were accustomed to think in terms of risk aversion except for the bottom left cell, where lotteries are preferred. When we looked at our choices for bad options, we quickly realized that we were just as risk seeking in the domain of losses as we were risk averse in the domain of gains. We were not the first to observe risk seeking with negative prospects—at least two authors had reported that fact, but they had not made much of it . However, we were fortunate to have a framework that made the finding of risk seeking easy to interpret, and that was a milestone in our thinking. Indeed, we identified two reasons for this effect.
右上方单元格的结果最初让我们感到惊讶。我们习惯于用 ,除了左下角的单元格外,我们都是用风险厌恶的方式来思考的,因为左下角的单元格是首选。当我们审视自己对坏选项的选择时,我们很快意识到,我们在损失领域的风险寻求和在收益领域的风险厌恶一样。我们并不是第一个观察到在负面前景下寻求风险的人--至少有两位作者曾报道过这一事实,但他们 并没有对其做过多的介绍 。然而,我们很幸运地拥有一个框架,使寻求风险的发现易于解释,这是我们思维中的一个里程碑。事实上,我们确定了这种效应的两个原因。
First, there is diminishing sensitivity. The sure loss is very aversive because the reaction to a loss of $900 is more than 90% as intense as the reaction to a loss of $1,000. The second factor may be even more powerful: the decision weight that corresponds to a probability of 90% is only about 71, much lower than the probability. The result is that when you consider a choice between a sure loss and a gamble with a high probability of a larger loss, diminishing sensitivity makes the sure loss more aversive, and the certainty effect reduces the aversiveness of the gamble. The same two factors enhance the attractiveness of the sure thing and reduce the attractiveness of the gamble when the outcomes are positive.
首先,敏感性不断降低。肯定的损失是非常令人厌恶的,因为对900美元损失的反应比对1000美元损失的反应要强烈90%以上。第二个因素可能 ,甚至更为强大:与90%的概率相对应的决策权重只有71左右,比概率低得多。其结果是,当你考虑在肯定的损失和具有较大损失概率的赌博之间做出选择时,敏感性的降低使得肯定的损失更加令人厌恶,而确定性效应降低了赌博的厌恶性。同样的两个因素 ,当结果是积极的时候,肯定的事情会增强吸引力,而赌博的吸引力会降低。
The shape of the value function and the decision weights both contribute to the pattern observed in the top row of table 13 . In the bottom row, however, the two factors operate in opposite directions: diminishing sensitivity continues to favor risk aversion for gains and risk seeking for losses, but the overweighting of low probabilities overcomes this effect and produces the observed pattern of gambling for gains and caution for losses.
价值函数的形状和决策权重都有助于在 表13 的顶行观察到的模式。然而,在底部一行,这两个因素以相反的方向运作:敏感性的降低继续有利于对收益的风险规避和对损失的风险 ,但对低概率的加权克服了这种影响,产生了观察到的对收益的赌博和对损失的谨慎模式。
Many unfortunate human situations unfold in the top right cell. This is where people who face very bad options take desperate gambles, accepting a high probability of making things worse in exchange for a small hope of avoiding a large loss. Risk taking of this kind often turns manageable failures into disasters. The thought of accepting the large sure loss is too painful, and the hope of complete relief too enticing, to make the sensible decision that it is time to cut one’s losses. This is where businesses that are losing ground to a superior technology waste their remaining assets in futile attempts to catch up. Because defeat is so difficult to accept, the losing side in wars often fights long past the point at which the victory of the other side is certain, and only a matter of time.
许多不幸的人类情况都是在右上方的单元格中展开的。这就是面临非常糟糕的选择的人铤而走险的地方,接受 ,以使事情变得更糟的高概率来换取避免 大损失的小希望。这种冒险行为往往会把可控的失败变成灾难。想到接受巨大的肯定损失太痛苦了,而完全解脱的希望又太诱人了,以至于无法做出明智的决定,即是时候削减自己的损失了。这就是那些在优势技术面前败下阵来的企业在徒劳地试图追赶的过程中浪费他们的剩余资产。由于 失败是如此难以接受,战争中失败的一方往往在另一方的胜利是确定的、只是一个时间问题的时候,还在战斗很长时间。
GAMBLING IN THE SHADOW OF THE LAW
法律阴影下的赌博
The legal scholar Chris Guthrie has offered a compelling application of the fourfold pattern to two situations in which the plaintiff and the defendant in a civil suit consider a possible settlement. The situations differ in the strength of the plaintiff’s case.
法律学者克里斯-格思里(Chris Guthrie)将四重模式令人信服地应用于民事诉讼中原告和被告考虑可能和解的两种情况。 这两种情况在原告案件的力度上有所不同。
As in a scenario we saw earlier, you are the plaintiff in a civil suit in which you have made a claim for a large sum in damages. The trial is going very well and your lawyer cites expert opinion that you have a 95% chance to win outright, but adds the caution, “You never really know the outcome until the jury comes in.” Your lawyer urges you to accept a settlement in which you might get only 90% of your claim. You are in the top left cell of the fourfold pattern, and the question on your mind is, “Am I willing to take even a small chance of getting nothing at all? Even 90% of the claim is a great deal of money, and I can walk away with it now.” Two emotions are evoked, both driving in the same direction: the attraction of a sure (and substantial) gain and the fear of intense disappointment and regret if you reject a settlement and lose in court. You can feel the pressure that typically leads to cautious behavior in this situation. The plaintiff with a strong case is likely to be risk averse.
就像我们之前看到的一个场景,你是一个民事诉讼的原告,你提出了巨额赔偿的要求。审判进行得非常顺利,你的律师引用了专家的意见,认为你有95%的机会完全获胜,但又提醒说:"在陪审团来之前,你永远不会真正知道结果。"你的律师 ,敦促你接受和解,你可能只得到90%的索赔。你处于四重模式的左上角,你心中的问题是:"我是否愿意接受哪怕是一点都得不到的小机会?即使是90%的索赔额也是一笔大数目,而且我现在就可以拿着它离开。"两种情绪被唤起,都朝着同一个方向发展: ,一个肯定的(和可观的)收益的吸引力,以及如果你拒绝和解并在法庭上败诉,对强烈的失望和遗憾的恐惧。你可以感受到在这种情况下通常会导致谨慎行为的压力。拥有强大案件的原告很可能会规避风险。
Now step into the shoes of the defendant in the same case. Although you have not completely given up hope of a decision in your favor, you realize that the trial is going poorly. The plaintiff’s lawyers have proposed a settlement in which you would have to pay 90% of their original claim, and it is clear they will not accept less. Will you settle, or will you pursue the case? Because you face a high probability of a loss, your situation belongs in the top right cell. The temptation to fight on is strong: the settlement that the plaintiff has offered is almost as painful as the worst outcome you face, and there is still hope of prevailing in court. Here again, two emotions are involved: the sure loss is repugnant and the possibility of winning in court is highly attractive. A defendant with a weak case is likely to be risk seeking, prepared to gamble rather than accept a very unfavorable settlement. In the face-off between a risk-averse plaintiff and a risk-seeking defendant, the defendant holds the stronger hand. The superior bargaining position of the defendant should be reflected in negotiated settlements, with the plaintiff settling for less than the statistically expected outcome of the trial. This prediction from the fourfold pattern was confirmed by experiments conducted with law students and practicing judges, and also by analyses of actual negotiations in the shadow of civil trials .
现在站在同一案件的被告的立场上。虽然你还没有完全放弃对判决的希望, ,但你意识到,审判的进展很不顺利。原告的律师提出了一个和解方案,你必须支付他们原始索赔的90%,显然他们不会接受更少的金额。你是要和解,还是要继续打官司?因为你面临的损失概率很高,你的情况属于右上角的单元格。继续打下去的诱惑力很强:原告提出的和解 ,几乎和你面临的最坏结果一样痛苦,而且还有希望在法庭上获胜。 这里又涉及到两种情绪:肯定的损失令人厌恶,而在法庭上获胜的可能性则极具吸引力。一个案件薄弱的被告可能会寻求风险,准备赌博而不是接受一个非常不利的和解。在规避风险的原告和寻求风险的被告之间的对峙中, ,被告持有更强的手。被告的优势讨价还价地位应该反映在谈判的和解中,原告的和解金额低于统计学上的预期审判结果。对法律专业学生和执业 法官进行的实验,以及对 民事审判阴影下 的实际谈判的分析,都证实了这种来自四重模式的预测。
Now consider “frivolous litigation,” when a plaintiff with a flimsy case files a large claim that is most likely to fail in court. Both sides are aware of the probabilities, and both know that in a negotiated settlement the plaintiff will get only a small fraction of the amount of the claim. The negotiation is conducted in the bottom row of the fourfold pattern. The plaintiff is in the left-hand cell, with a small chance to win a very large amount; the frivolous claim is a lottery ticket for a large prize. Overweighting the small chance of success is natural in this situation, leading the plaintiff to be bold and aggressive in the negotiation. For the defendant, the suit is a nuisance with a small risk of a very bad outcome. Overweighting the small chance of a large loss favors risk aversion, and settling for a modest amount is equivalent to purchasing insurance against the unlikely event of a bad verdict. The shoe is now on the other foot: the plaintiff is willing to gamble and the defendant wants to be safe. Plaintiffs with frivolous claims are likely to obtain a more generous settlement than the statistics of the situation justify.
现在考虑一下 "无意义的诉讼",当原告用一个不可靠的案件提出一个大的索赔,而这个索赔很可能在法庭上失败。双方都知道这种可能性,而且都知道在谈判解决中,原告将只得到索赔金额的一小部分。谈判是在四重模式的底排进行的 。原告在左边的单元格中,有很小的机会赢得很大的金额; 轻率的索赔 是一张获得巨额奖金的彩票。在这种情况下,过度重视小的成功机会是很自然的,导致原告在谈判中大胆而积极。对被告来说,诉讼是一种滋扰,风险小 ,结果非常糟糕。对大额损失的小概率的过度权衡有利于规避风险,以适度的金额和解相当于为不可能出现的糟糕判决购买保险。现在的情况是:原告愿意赌博,被告希望安全。提出轻率索赔的原告有可能获得比 ,情况的统计数字所证明的更慷慨的和解。
The decisions described by the fourfold pattern are not obviously unreasonable. You can empathize in each case with the feelings of the plaintiff and the defendant that lead them to adopt a combative or an accommodating posture. In the long run, however, deviations from expected value are likely to be costly. Consider a large organization, the City of New York, and suppose it faces 200 “frivolous” suits each year, each with a 5% chance to cost the city $1 million. Suppose further that in each case the city could settle the lawsuit for a payment of $100,000. The city considers two alternative policies that it will apply to all such cases: settle or go to trial. (For simplicity, I ignore legal costs.)
四重模式所描述的决定并不是明显不合理的。在每个案例中,你都可以理解原告和被告的感受,这些感受导致他们采取争斗或包容的姿态。然而,从长远来看,偏离预期价值很可能是代价高昂的。考虑一个大型组织,即纽约市 ,并假设它每年面临200起 "无意义 "的诉讼,每起诉讼有5%的机会使该市损失100万美元。假设在每个案件中,该市都能以10万美元的价格解决诉讼。市政府考虑对所有这类案件采取两种替代政策:和解或进行审判。(为了简单起见,我忽略了法律费用)。
If the city litigates all 200 cases, it will lose 10, for a total loss of $10 million.
如果该市对所有200个案件进行诉讼,它 ,将失去10个,总损失为1000万美元。
If the city settles every case for $100,000, its total loss will be $20 million.
如果 ,该市以10万美元的价格解决每个案件,其总损失将是2000万美元。
When you take the long view of many similar decisions, you can see that paying a premium to avoid a small risk of a large loss is costly. A similar analysis applies to each of the cells of the fourfold pattern: systematic deviations from expected value are costly in the long run—and this rule applies to both risk aversion and risk seeking. Consistent overweighting of improbable outcomes—a feature of intuitive decision making—eventually leads to inferior outcomes.
当你从长计议许多类似的决定时,你可以看到,为避免大的损失的小风险而支付溢价是昂贵的。类似的分析适用于四重模式的每一个单元:从长远来看,系统性地偏离预期价值是有代价的, ,而且这一规则同时适用于风险厌恶和风险寻求。对不可能发生的结果的持续超重--这是直觉决策的特征--最终会导致较差的结果。
SPEAKING OF THE FOURFOLD PATTERN
说到四重模式
“He is tempted to settle this frivolous claim to avoid a freak loss, however unlikely. That’s overweighting of small probabilities. Since he is likely to face many similar problems, he would be better off not yielding.”
"他很想解决这个无意义的索赔,以避免畸形的损失,无论多么不可能。这是对小概率事件的过度重视。由于他可能会面临许多 类似的问题,他最好不要屈服。"
“We never let our vacations hang on a last-minute deal. We’re willing to pay a lot for certainty.”
"我们从来不让我们的假期挂在最后一刻的交易上。我们愿意为确定性付出很多。"
“They will not cut their losses so long as there is a chance of breaking even. This is risk-seeking in the losses.”
"只要有机会实现收支平衡,他们就不会减少损失。这就是在损失中寻求风险"。
“They know the risk of a gas explosion is minuscule, but they want it mitigated. It’s a possibility effect, and they want peace of mind.”
"他们知道煤气爆炸的风险微乎其微,但他们希望它得到缓解。这是一种可能性的影响,他们希望 ,让人放心。"
30
30
Rare Events
罕见的事件
I visited Israel several times during a period in which suicide bombings in buses were relatively common—though of course quite rare in absolute terms. There were altogether 23 bombings between December 2001 and September 2004, which had caused a total of 236 fatalities. The number of daily bus riders in Israel was approximately 1.3 million at that time. For any traveler, the risks were tiny, but that was not how the public felt about it. People avoided buses as much as they could, and many travelers spent their time on the bus anxiously scanning their neighbors for packages or bulky clothes that might hide a bomb.
我曾数次访问以色列,当时公共汽车上的自杀性爆炸事件相对较多--当然,从绝对数量上看是相当罕见的。从2001年12月到2004年9月 ,总共发生了23起爆炸事件,共造成236人死亡。当时,以色列每天乘坐公共汽车的人数约为130万。对于任何一个旅行者来说,风险都是很小的,但公众的感受却不是这样的。人们尽可能地避开公共汽车,许多旅客在公共汽车上焦急地扫视邻居的包裹或笨重的衣服,以免藏有 。
I did not have much occasion to travel on buses, as I was driving a rented car, but I was chagrined to discover that my behavior was also affected. I found that I did not like to stop next to a bus at a red light, and I drove away more quickly than usual when the light changed. I was ashamed of myself, because of course I knew better. I knew that the risk was truly negligible, and that any effect at all on my actions would assign an inordinately high “decision weight” to a minuscule probability. In fact, I was more likely to be injured in a driving accident than by stopping near a bus. But my avoidance of buses was not motivated by a rational concern for survival. What drove me was the experience of the moment: being next to a bus made me think of bombs, and these thoughts were unpleasant. I was avoiding buses because I wanted to think of something else.
我没有太多机会乘坐公共汽车,因为我开的是租来的车,但我懊恼地发现,我的行为也受到了影响。我发现我不喜欢在红灯时停在公共汽车旁边,而且当红灯改变时,我比平时开得更快。我为自己感到羞愧,因为我当然知道得更好。我知道风险确实可以忽略不计,而且 ,对我的行动有任何影响都会给一个微不足道的概率赋予过高的 "决策权重"。事实上,我在驾驶事故中受伤的可能性比在公共汽车附近停车的可能性更大。但我对公共汽车的回避并不是出于对生存的理性关注。驱使我的是当时的体验:在公共汽车旁边让我想到了炸弹,而这些想法是 不愉快的。我避开公共汽车是因为我想想到别的东西。
My experience illustrates how terrorism works and why it is so effective: it induces an availability cascade. An extremely vivid image of death and damage, constantly reinforced by media attention and frequent conversations, becomes highly accessible, especially if it is associated with a specific situation such as the sight of a bus. The emotional arousal is associative, automatic, and uncontrolled, and it produces an impulse for protective action. System 2 may “know” that the probability is low, but this knowledge does not eliminate the self-generated discomfort and the wish to avoid it . System 1 cannot be turned off. The emotion is not only disproportionate to the probability, it is also insensitive to the exact level of probability. Suppose that two cities have been warned about the presence of suicide bombers. Residents of one city are told that two bombers are ready to strike. Residents of another city are told of a single bomber. Their risk is lower by half, but do they feel much safer?
我的经历说明了恐怖主义是如何运作的,以及它为什么如此有效:它诱发了一个可得性级联。一个极其生动的死亡和 ,不断被媒体关注和频繁的谈话所强化,变得非常容易接受,特别是如果它与特定的情况相关联,如 ,看到一辆公共汽车。这种情绪唤醒是联想的、自动的、不受控制的,它产生了保护行动的冲动。系统2可能 "知道 "这种可能性很低,但这种知识并不能消除自我产生的不适感和 避免这种不适的愿望 。系统1不能被关闭。这种情绪不仅与概率不相称,而且对 确切的概率水平也不敏感。假设有两个城市被警告有自杀式炸弹袭击者存在。一个城市的居民被告知有两名炸弹客准备好了要进行袭击。另一个城市的居民则被告知只有一个炸弹客。他们的风险降低了一半,但他们是否感到更安全?
Many stores in New York City sell lottery tickets, and business is good. The psychology of high-prize lotteries is similar to the psychology of terrorism. The thrilling possibility of winning the big prize is shared by the community and reinforced by conversations at work and at home. Buying a ticket is immediately rewarded by pleasant fantasies, just as avoiding a bus was immediately rewarded by relief from fear. In both cases, the actual probability is inconsequential; only possibility matters. The original formulation of prospect theory included the argument that “highly unlikely events are either ignored or overweighted,” but it did not specify the conditions under which one or the other will occur, nor did it propose a psychological interpretation of it. My current view of decision weights has been strongly influenced by recent research on the role of emotions and vividness in decision making . Overweighting of unlikely outcomes is rooted in System 1 features that are familiar by now. Emotion and vividness influence fluency, availability, and judgments of probability—and thus account for our excessive response to the few rare events that we do not ignore.
纽约市的许多商店都出售彩票,而且生意很好。大奖娱乐官网 彩票的心理学与恐怖主义的心理学相似。赢得大奖的激动人心的可能性被社会所共享,并通过工作和家庭中的谈话得到加强。购买彩票立即得到了愉快幻想的奖励,就像躲避公共汽车立即得到了从恐惧中解脱的奖励一样。在这两种情况下,实际的概率是不重要的;只有可能性才是重要的。 前景理论的最初表述包括 "高度不可能的事件要么被忽视,要么被加权 "的论点,但它并没有具体说明在什么条件下会出现这样或那样的情况,也没有提出对它的心理学解释。我目前对决策权重的看法受到了最近关于情绪和 生动性在决策中 的作用的研究的强烈影响 。对不可能发生的结果的过高权重植根于系统1的特征,这些特征现在已经很熟悉了。情绪和生动性影响了流畅性、可得性和对概率的判断--从而说明了我们对少数我们没有忽视的罕见事件的过度反应。
OVERESTIMATION AND OVERWEIGHTING
高估和超重
What is your judgment of the probability that the next president of the United States will be a third-party candidate?
你对美国下一任总统是 第三方候选人的概率有何判断?
How much will you pay for a bet in which you receive $1,000 if the next president of the United States is a third-party candidate, and no money otherwise?
如果下一任美国总统是第三党候选人,你可以得到1000美元,否则没有钱,你会为这个赌注支付多少钱?
The two questions are different but obviously related. The first asks you to assess the probability of an unlikely event. The second invites you to put a decision weight on the same event, by placing a bet on it.
这两个问题是不同的,但明显相关。第一个问题要求你评估一个不太可能的事件的概率。第二个问题是请你对同一事件进行决策权重,即对其下注。
How do people make the judgments and how do they assign decision weights? We start from two simple answers, then qualify them. Here are the oversimplified answers:
人们如何做出判断,如何分配决策权重?我们从两个简单的答案开始,然后对其进行限定。以下是过度简化的答案。
People overestimate the probabilities of unlikely events.
人们高估了不可能发生的事件的概率。
People overweight unlikely events in their decisions.
人们在决策中对不可能发生的事件的权重过高。
Although overestimation and overweighting are distinct phenomena, the same psychological mechanisms are involved in both: focused attention, confirmation bias, and cognitive ease.
虽然高估和超重是不同的现象,但两者都涉及相同的心理机制: 集中注意力、确认偏差和认知轻松。
Specific descriptions trigger the associative machinery of System 1. When you thought about the unlikely victory of a third-party candidate, your associative system worked in its usual confirmatory mode, selectively retrieving evidence, instances, and images that would make the statement true. The process was biased, but it was not an exercise in fantasy. You looked for a plausible scenario that conforms to the constraints of reality; you did not simply imagine the Fairy of the West installing a third-party president. Your judgment of probability was ultimately determined by the cognitive ease, or fluency, with which a plausible scenario came to mind.
具体的描述会触发系统1的联想机制。当你想到第三方候选人不太可能获胜时,你的联想系统以其通常的确认模式工作,有选择地检索证据、实例和图像,使陈述成为真实。这个过程是有偏见的,但它不是一个 幻想的练习。你寻找的是一个符合现实约束的合理场景;你并没有简单地想象西方的仙女们会安装一个第三方总统。你对概率的判断最终是由认知上的难易程度决定的,或者说是流畅程度,在这种情况下,你会想到一个合理的方案。
You do not always focus on the event you are asked to estimate. If the target event is very likely, you focus on its alternative. Consider this example:
你并不总是专注于你被要求估计的事件。如果目标 事件是非常可能的,你就关注它的替代品。考虑一下这个例子。
What is the probability that a baby born in your local hospital will be released within three days?
在你当地医院出生的婴儿在三天内出院的概率是多少?
You were asked to estimate the probability of the baby going home, but you almost certainly focused on the events that might cause a baby not to be released within the normal period. Our mind has a useful capability to focus spontaneously on whatever is odd, different, or unusual. You quickly realized that it is normal for babies in the United States (not all countries have the same standards) to be released within two or three days of birth, so your attention turned to the abnormal alternative. The unlikely event became focal. The availability heuristic is likely to be evoked: your judgment was probably determined by the number of scenarios of medical problems you produced and by the ease with which they came to mind. Because you were in confirmatory mode, there is a good chance that your estimate of the frequency of problems was too high.
你被要求估计婴儿回家的概率,但你几乎肯定关注的是可能导致婴儿在正常时期内 不 被释放的事件。我们的头脑有一种有用的能力, ,自发地关注任何奇怪的、不同的或不寻常的东西。你很快意识到,在美国,婴儿在出生后两三天内被释放是正常的(并不是所有国家都有相同的标准),所以你的注意力转向了不正常的选择。不可能的事件成为焦点。可得性启发式很可能被唤起:你的判断可能是由你产生的医疗问题的情景的数量和它们出现在脑海中的容易程度决定的 。因为你 ,处于确认模式,所以你对问题频率的估计很有可能过高。
The probability of a rare event is most likely to be overestimated when the alternative is not fully specified. My favorite example comes from a study that the psychologist Craig Fox conducted while he was Amos’s student . Fox recruited fans of professional basketball and elicited several judgments and decisions concerning the winner of the NBA playoffs. In particular, he asked them to estimate the probability that each of the eight participating teams would win the playoff; the victory of each team in turn was the focal event.
当备选方案不完全明确时,一个罕见事件的概率最容易被高估。我最喜欢的例子来自于心理学家 Craig Fox的一项研究,当时他是 Amos的学生 。福克斯招募了职业篮球的球迷,诱导他们对NBA季后赛的获胜者做出一些判断和决定。特别是,他要求他们估计八支参赛球队中每支球队赢得季后赛的概率;每支球队的胜利依次是焦点事件。
You can surely guess what happened, but the magnitude of the effect that Fox observed may surprise you. Imagine a fan who has been asked to estimate the chances that the Chicago Bulls will win the tournament. The focal event is well defined, but its alternative—one of the other seven teams winning—is diffuse and less evocative. The fan’s memory and imagination, operating in confirmatory mode, are trying to construct a victory for the Bulls. When the same person is next asked to assess the chances of the Lakers, the same selective activation will work in favor of that team. The eight best professional basketball teams in the United States are all very good, and it is possible to imagine even a relatively weak team among them emerging as champion. The result: the probability judgments generated successively for the eight teams added up to 240%! This pattern is absurd, of course, because the sum of the chances of the eight events must add up to 100%. The absurdity disappeared when the same judges were asked whether the winner would be from the Eastern or the Western conference. The focal event and its alternative were equally specific in that question and the judgments of their probabilities added up to 100%.
你肯定能猜到发生了什么, ,但福克斯观察到的效果之大可能会让你吃惊。想象一下,一个球迷被要求估计芝加哥公牛队赢得比赛的可能性。焦点事件是明确的,但它的备选方案--其他七支球队获胜--是分散的,不那么令人回味的。球迷的记忆和想象力,在确认模式下运作,正试图为 公牛队构建一个胜利。当同一个人接下来被要求评估湖人队的机会时,同样的选择性激活将有利于该队。美国最好的八支职业篮球队都非常优秀,甚至有可能想象其中有一支相对较弱的球队成为冠军。结果是:为这八支球队相继产生的概率判断 ,加起来是240%!当然,这种模式是荒谬的,因为八个事件的概率之和 必须 加到100%。当同样的评委被问及冠军是来自东部还是西部联盟时,这种荒谬的现象就消失了。在这个问题中, 焦点事件和它的 备选方案同样具体,它们的概率判断相加为100%。
To assess decision weights, Fox also invited the basketball fans to bet on the tournament result. They assigned a cash equivalent to each bet (a cash amount that was just as attractive as playing the bet). Winning the bet would earn a payoff of $160. The sum of the cash equivalents for the eight individual teams was $287. An average participant who took all eight bets would be guaranteed a loss of $127! The participants surely knew that there were eight teams in the tournament and that the average payoff for betting on all of them could not exceed $160, but they overweighted nonetheless. The fans not only overestimated the probability of the events they focused on—they were also much too willing to bet on them.
为了评估 决策权重,福克斯还邀请篮球迷对比赛结果进行投注。他们给每个赌注分配了一个现金等价物(这个现金数额和打赌一样有吸引力)。赢得赌注将获得160美元的回报。八支个人队伍的现金等价物之和为287美元。一个普通的参与者如果参加了所有八个赌局,将保证损失127美元! 参赛者肯定知道比赛中有八支球队,押注所有球队的平均回报不可能超过160美元,但他们还是超额投注了。球迷们不仅高估了他们关注的事件的概率--他们也太愿意下注了。
These findings shed new light on the planning fallacy and other manifestations of optimism. The successful execution of a plan is specific and placing easy to imagine when one tries to forecast the outcome of a project. In contrast, the alternative of failure is diffuse, because there are innumerable ways for things to go wrong. Entrepreneurs and the investors who evaluate their prospects are prone both to overestimate their chances and to overweight their estimates.
这些发现对计划谬误和其他表现形式的 乐观主义有了新的认识。一个计划的成功执行是具体的,当人们试图预测一个项目的结果时, 容易想象。与此相反,失败的选择是分散的,因为有无数种方式可以出错。企业家和评估其前景的投资者很容易高估他们的机会,也很容易高估他们的估计。
VIVID OUTCOMES
生动的 成果
As we have seen, prospect theory differs from utility theory in the relationship it suggests between probability and decision weight. In utility theory, decision weights and probabilities are the same. The decision weight of a sure thing is 100, and the weight that corresponds to a 90% chance is exactly 90, which is 9 times more than the decision weight for a 10% chance. In prospect theory, variations of probability have less effect on decision weights. An experiment that I mentioned earlier found that the decision weight for a 90% chance was 71.2 and the decision weight for a 10% chance was 18.6. The ratio of the probabilities was 9.0, but the ratio of the decision weights was only 3.83, indicating insufficient sensitivity to probability in that range. In both theories, the decision weights depend only on probability, not on the outcome. Both theories predict that the decision weight for a 90% chance is the same for winning $100, receiving a dozen roses , or getting an electric shock. This theoretical prediction turns out to be wrong.
正如我们所看到的,前景理论与效用理论的不同之处在于它所提出的概率与决策权重之间的关系。在效用理论中,决策权重和概率是一样的。一个确定的事情的决策权重是100,而对应于90%的机会的权重正好是90,是10%的机会的决策权重的9倍。在前景理论中, ,概率的变化对决策权重的影响较小。我之前提到的一个实验发现,90%的机会的决策权重是71.2,10%的机会的决策权重是18.6。概率之比为9.0,但决策权重之比仅为3.83,表明在该范围内对概率的敏感性不足。在这两个理论中,决策 权重只取决于概率,而不取决于结果。两种理论都预测,对于赢得100美元、 收到一打玫瑰花 或受到电击,90%的机会的决策权重是相同的。这个理论预测结果是错误的。
Psychologists at the University of Chicago published an article with the attractive title “Money, Kisses, and Electric Shocks: On the Affective Psychology of Risk.” Their finding was that the valuation of gambles was much less sensitive to probability when the (fictitious) outcomes were emotional (“meeting and kissing your favorite movie star” or “getting a painful, but not dangerous, electric shock”) than when the outcomes were gains or losses of cash. This was not an isolated finding. Other researchers had found, using physiological measures such as heart rate, that the fear of an impending electric shock was essentially uncorrelated with the probability of receiving the shock. The mere possibility of a shock triggered the full-blown fear response. The Chicago team proposed that “affect-laden imagery” overwhelmed the response to probability. Ten years later, a team of psychologists at Princeton challenged that conclusion.
芝加哥大学的心理学家发表了一篇文章,题目很吸引人,"金钱、亲吻和电击。关于风险的情感 Psychology"。他们的发现是,当(虚构的)结果是情感性的("遇见并亲吻你最喜欢的电影明星 "或 "受到痛苦但不危险的电击")时,赌博的估值对概率的敏感性要比结果是现金收益或损失时低得多。这并不是一个孤立的发现。其他研究人员利用生理学 ,如心率,发现对即将到来的电击的恐惧与接受电击的概率基本上不相关。仅仅是电击的可能性就引发了全面的恐惧反应。芝加哥小组提出,"充满情感的想象 "压倒了对概率的反应。十年后,普林斯顿大学的一个心理学家小组对这一结论提出了质疑。
The Princeton team argued that the low sensitivity to probability that had been observed for emotional outcomes is normal. Gambles on money are the exception. The sensitivity to probability is relatively high for these gambles, because they have a definite expected value.
普林斯顿大学的研究小组认为,已经观察到的情感结果对概率的低敏感性是正常的。对金钱的赌博是个例外。这些赌博对概率的敏感性相对较高,因为它们有明确的预期价值。
What amount of cash is as attractive as each of these gambles?
什么 ,现金的数量和这些赌博中的每一个一样有吸引力?
A. 84% chance to win $59
A.84%的机会赢得59美元
B. 84% chance to receive one dozen red roses in a glass vase
B.84%的机会收到一打 玻璃花瓶装的红玫瑰
What do you notice? The salient difference is that question A is much easier than question B. You did not stop to compute the expected value of the bet, but you probably knew quickly that it is not far from $50 (in fact it is $49.56), and the vague estimate was sufficient to provide a helpful anchor as you searched for an equally attractive cash gift. No such anchor is available for question B, which is therefore much harder to answer. Respondents also assessed the cash equivalent of gambles with a 21% chance to win the two outcomes. As expected, the difference between the high-probability and low-probability gambles was much more pronounced for the money than for the roses.
你注意到了什么?你没有停下来计算赌注的期望值,但你可能很快就知道它离50美元不远了(事实上是49.56美元),在你寻找同样有吸引力的现金礼物时,这个模糊的估计足以提供一个有用的锚。对于问题B,没有这样的锚 ,因此这个问题更难回答。受访者还评估了有21%机会赢得两种结果的赌博的现金等值。正如预期的那样,高概率和低概率的赌博之间的差异对于金钱来说比对于玫瑰花来说要明显得多。
To bolster their argument that insensitivity to probability is not caused by emotion, the Princeton team compared willingness to pay to avoid gambles:
为了支持他们的论点,即对概率不敏感不是由情绪引起的, ,普林斯顿大学的团队比较了为避免赌博而支付的意愿。
21% chance (or 84% chance) to spend a weekend painting someone’s three-bedroom apartment
21%的机会(或84%的机会)花一个周末来粉刷别人的三居室公寓
21% chance (or 84% chance) to clean three stalls in a dormitory bathroom after a weekend of use
21%的机会(或84%的机会),在一个周末的使用后,清洁宿舍浴室的三个隔间
The second outcome is surely much more emotional than the first, but the decision weights for the two outcomes did not differ. Evidently, the intensity of emotion is not the answer.
第二种结果肯定比第一种结果更加情绪化,但两种结果的决策权重并没有不同。可见, 情感的强度并不是答案。
Another experiment yielded a surprising result. The participants received explicit price information along with the verbal description of the prize. An example could be:
另一个实验产生了一个令人惊讶的结果。参与者在收到奖品的口头描述的同时,也收到了明确的价格信息。一个例子可以是。
84% chance to win: A dozen red roses in a glass vase. Value $59.
84%的获奖机会。一打玻璃花瓶装的红玫瑰。价值59美元。
21% chance to win: A dozen red roses in a glass vase. Value $59.
21%的获奖机会。一打玻璃花瓶装的红玫瑰。价值59美元。
It is easy to assess the expected monetary value of these gambles, but adding a specific monetary value did not alter the results: evaluations remained insensitive to probability even in that condition. People who thought of the gift as a chance to get roses did not use price information as an anchor in evaluating the gamble. As scientists sometimes say, this is a surprising finding that is trying to tell us something. What story is it trying to tell us?
评估 这些赌博的预期货币价值是很容易的,但是增加一个具体的货币价值并没有改变结果:即使在这种条件下,评价仍然对概率不敏感。把礼物看作是得到玫瑰花的机会的人,并没有把价格信息作为评估赌博的锚, 。正如科学家们有时所说,这是一个令人惊讶的发现,它试图告诉我们什么。它试图 ,告诉我们什么故事呢?
The story, I believe, is that a rich and vivid representation of the outcome, whether or not it is emotional, reduces the role of probability in the evaluation of an uncertain prospect. This hypothesis suggests a prediction, in which I have reasonably high confidence: adding irrelevant but vivid details to a monetary outcome also disrupts calculation. Compare your cash equivalents for the following outcomes:
我认为,这个故事是,对结果的丰富而生动的表述,无论它是否是感性的,都会减少概率在评估不确定前景中的作用。这个假设提出了一个预测,我对这个预测有相当高的信心:在货币结果中加入不相关但生动的细节也会扰乱计算。比较你的现金等价物, 以下的结果。
21% (or 84%) chance to receive $59 next Monday
21%(或84%)的机会在下周一收到59美元
21% (or 84%) chance to receive a large blue cardboard envelope containing $59 next Monday morning
21%(或84%)的机会在下周一早上收到一个装有59美元的蓝色大纸板信封
The new hypothesis is that there will be less sensitivity to probability in the second case, because the blue envelope evokes a richer and more fluent representation than the abstract notion of a sum of money. You constructed the event in your mind, and the vivid image of the outcome exists there even if you know that its probability is low. Cognitive ease contributes to the certainty effect as well: when you hold a vivid image of an event, the possibility of its not occurring is also represented vividly, and overweighted. The combination of an enhanced possibility effect with an enhanced certainty effect leaves little room for decision weights to change between chances of 21% and 84%.
新的假设是,在第二种情况下,对概率的敏感度会降低,因为蓝色信封比一笔钱的抽象概念唤起了更丰富、更流畅的表述。你在脑海中构建了 ,即使你知道其概率很低,但结果的生动形象还是存在的。认知的便利性也有助于确定性效应:当你持有一个事件的生动形象时,它不发生的可能性也会被生动地表现出来,并被加权。强化的可能性效应和强化的确定性效应的结合,为决策权重在21%和84%的机会之间的变化留下了很小的 。
VIVID PROBABILITIES
生动的概率
The idea that fluency, vividness, and the ease of imagining contribute to decision weights gains support from many other observations. Participants in a well-known experiment are given a choice of drawing a marble from one of two urns, in which red marbles win a prize:
流利性、生动性和想象的容易性有助于决策权重的想法从许多其他观察中获得支持。在一个著名的实验中,参与者可以选择从两个瓮中抽出一个弹珠,其中红色的弹珠可以获得奖励。
Urn A contains 10 marbles, of which 1 is red.
瓮A中有10颗弹珠,其中 1是红色的。
Urn B contains 100 marbles, of which 8 are red.
瓮B中有100颗弹珠,其中8颗是红色的。
Which urn would you choose? The chances of winning are 10% in urn A and 8% in urn B, so making the right choice should be easy, but it is not: about 30%–40% of students choose the urn with the larger number of winning marbles, rather than the urn that provides a better chance of winning. Seymour Epstein has argued that the results illustrate the superficial processing characteristic of System 1 (which he calls the experiential system).
你会选择哪个瓮?A瓮的中奖几率是10%,B瓮的中奖几率是8%,所以做出正确的选择应该很容易,但事实并非如此:大约30%-40%的学生选择中奖弹珠 数量 较多的瓮,而不是选择中奖几率较高的瓮。 西摩-爱泼斯坦(Seymour Epstein)认为,这些结果说明了 ,系统1(他称之为经验系统)的 表面处理 特征。
As you might expect, the remarkably foolish choices that people make in this situation have attracted the attention of many researchers. The bias has been given several names; following Paul Slovic I will call it denominator neglect . If your attention is drawn to the winning marbles, you do not assess the number of nonwinning marbles with the same care. Vivid imagery contributes to denominator neglect, at least as I experience it. When I think of the small urn, I see a single red marble on a vaguely defined background of white marbles. When I think of the larger urn, I see eight winning red marbles on an indistinct background of white marbles, which creates a more hopeful feeling. The distinctive vividness of the winning marbles increases the decision weight of that event, enhancing the possibility effect. Of course, the same will be true of the certainty effect. If I have a 90% chance of winning a prize, the event of not winning will be more salient if 10 of 100 marbles are “losers” than if 1 of 10 marbles yields the same outcome.
正如你所期望的,人们在这种情况下做出的非常愚蠢的选择已经引起了许多研究人员的注意。这种偏见被赋予了几个名字;按照保罗-斯洛维奇的说法,我将称之为 分母忽视 。如果你的注意力被吸引到中奖的弹珠上,你就不会以同样的谨慎来评估 非中奖弹珠的数量。至少在我的经历中,生动的想象力有助于分母忽视。当我想到小瓮时,我看到一个红色的弹珠在一个模糊的白色弹珠的背景上。当我想到较大的骨灰盒时,我看到在不明确的白色弹珠背景上有八个获胜的红色弹珠,这创造了一种更有希望的感觉。获奖弹珠的独特 ,增加了该事件的决策权重,增强了可能性效应。当然,确定性效应也会如此。如果我有90%的机会中奖,如果100颗弹珠中有10颗是 "失败者",那么不中奖的事件就会比10颗弹珠中的1颗产生同样的结果更突出。
The idea of denominator neglect helps explain why different ways of communicating risks vary so much in their effects. You read that “a vaccine that protects children from a fatal disease carries a 0.001% risk of permanent disability.” The risk appears small. Now consider another description of the same risk: “One of 100,000 vaccinated children will be permanently disabled.” The second statement does something to your mind that the first does not: it calls up the image of an individual child who is permanently disabled by a vaccine; the 99,999 safely vaccinated children have faded into the background. As predicted by denominator neglect, low-probability events are much more heavily weighted when described in terms of relative frequencies (how many) than when stated in more abstract terms of “chances,” “risk,” or “probability” (how likely). As we have seen, System 1 is much better at dealing with individuals than categories.
分母忽视的想法有助于解释为什么不同的 ,沟通风险的方式在效果上有很大的不同。你读到 "一种保护儿童免遭致命疾病的疫苗有0.001%的永久残疾风险"。这个风险似乎很小。现在考虑对同一风险的另一种描述。"10万名接种疫苗的儿童中,有一人将成为永久性残疾。"第二句话在你的脑海中起到了第一句话所没有的作用:它 ,唤起了一个因疫苗而永久残疾的儿童个体的形象;99,999个安全接种的儿童已经淡出了背景。正如分母忽略所预测的那样,当用相对频率(多少)来描述时,低概率事件比用 "机会"、"风险 "或 "概率"(可能性有多大)等更抽象的术语来表述时,其权重要大得多。正如 ,我们已经看到,系统1在处理个人方面比类别方面要好得多。
The effect of the frequency format is large. In one study, people who saw information about “a disease that kills 1,286 people out of every 10,000” judged it as more dangerous than people who were told about “a disease that kills 24.14% of the population.” The first disease appears more threatening than the second, although the former risk is only half as large as the latter! In an even more direct demonstration of denominator neglect, “a disease that kills 1,286 people out of every 10,000” was judged more dangerous than a disease that “kills 24.4 out of 100.” The effect would surely be reduced or eliminated if participants were asked for a direct comparison of the two formulations, a task that explicitly calls for System 2. Life, however, is usually a between-subjects experiment, in which you see only one formulation at a time. It would take an exceptionally active System 2 to generate alternative formulations of the one you see and to discover that they evoke a different response.
频率格式的影响是很大的。在一项研究中,看到关于 "一种每万人中有1286人死亡的疾病 "的信息的人比被告知 "一种24.14%的人口死亡的疾病 "的人 判断它更危险 。第一种疾病似乎比第二种疾病更具威胁性, ,尽管前者的风险只有后者的一半!在一个更直接的分母忽视的证明中,"一种每10000人中有1286人死亡的疾病 "被判断为比 "100人中有24.4人死亡 "的疾病更危险。如果要求参与者对这两种提法进行直接比较,这种效果肯定会减少或消除,这种任务明确地 ,需要系统2。然而,生活中, ,通常是一个被试者之间的实验,在这个实验中,你一次只看到一个配方。需要一个特别活跃的系统2来产生你所看到的那个配方的替代配方,并发现它们唤起了不同的反应。
Experienced forensic psychologists and psychiatrists are not immune to the effects of the format in which risks are expressed. In one experiment, professionals evaluated whether it was safe to discharge from the psychiatric hospital a patient, Mr. Jones, with a history of violence. The information they received included an expert’s assessment of the risk. The same statistics were described in two ways:
有经验的 法医心理学家 和精神病学家对风险 的表达形式的影响没有免疫力。在一个实验中,专业人员评估了让有暴力史的病人琼斯先生从精神病院出院是否安全。他们收到的信息包括一个专家对风险的评估。同样的统计数据被以两种方式描述。
Patients similar to Mr. Jones are estimated to have a 10% probability of committing an act of violence against others during the first several months after discharge.
与琼斯先生类似的病人估计有10%的概率在出院后的头几个月对他人实施暴力行为 。
Of every 100 patients similar to Mr. Jones, 10 are estimated to commit an act of violence against others during the first several months after discharge.
在每100名与琼斯先生类似的病人中,估计有10人在出院后的头几个月对他人实施暴力行为。
The professionals who saw the frequency format were almost twice as likely to deny the discharge (41%, compared to 21% in the probability format). The more vivid description produces a higher decision weight for the same probability.
看到频率格式的专业人员拒绝出院的可能性几乎是两倍(41%,而概率格式为21%)。更生动的描述产生 ,对于同样的概率,决定权重更高。
The power of format creates opportunities for manipulation, which people with an axe to grind know how to exploit. Slovic and his colleagues cite an article that states that “approximately 1,000 homicides a year are committed nationwide by seriously mentally ill individuals who are not taking their medication.” Another way of expressing the same fact is that “1,000 out of 273,000,000 Americans will die in this manner each year.” Another is that “the annual likelihood of being killed by such an individual is approximately 0.00036%.” Still another: “1,000 Americans will die in this manner each year, or less than one-thirtieth the number who will die of suicide and about one-fourth the number who will die of laryngeal cancer.” Slovic points out that “these advocates are quite open about their motivation: they want to frighten the general public about violence by people with mental disorder, in the hope that this fear will translate into increased funding for mental health services.”
形式的力量创造了操纵的机会,有心人知道如何利用这些机会。斯洛维奇和他的同事引用了一篇文章,其中指出:"全国范围内每年大约有1000起杀人案是由不服药的严重精神病患者犯下的。"同样的 ,另一种表达方式是:"每年273,000,000名美国人中有1,000人将以这种方式死亡。"另一种说法是:"每年被这种人杀害的可能性大约为0.00036%。"还有一个。"每年将有1000名美国人以这种方式死亡,即不到自杀死亡人数的三十分之一,约为喉癌死亡人数的四分之一。"斯洛维奇指出, ,"这些倡导者对他们的动机非常公开:他们 想 让公众对精神障碍患者的暴力感到恐惧,希望这种恐惧能转化为对心理健康服务的更多资助。"
A good attorney who wishes to cast doubt on DNA evidence will not tell the jury that “the chance of a false match is 0.1%.” The statement that “a false match occurs in 1 of 1,000 capital cases ” is far more likely to pass the threshold of reasonable doubt. The jurors hearing those words are invited to generate the image of the man who sits before them in the courtroom being wrongly convicted because of flawed DNA evidence. The prosecutor, of course, will favor the more abstract frame—hoping to fill the jurors’ minds with decimal points.
一个希望对DNA证据产生怀疑的好律师不会告诉陪审团 "错误匹配的机会是0.1%"。"在 1000个死刑案件中,有1个 发生错误的 匹配 "的说法更有可能通过 合理怀疑的门槛。听到这些话的陪审员会被邀请产生这样的印象:坐在他们面前的人在法庭上因为有缺陷的DNA证据而被错误定罪。当然,检察官会倾向于更抽象的框架--希望用小数点填满陪审员的头脑。
DECISIONS FROM GLOBAL IMPRESSIONS
决定 来自全球的印象
The evidence suggests the hypothesis that focal attention and salience contribute to both the overestimation of unlikely events and the overweighting of unlikely outcomes. Salience is enhanced by mere mention of an event, by its vividness, and by the format in which probability is described. There are exceptions, of course, in which focusing on an event does not raise its probability: cases in which an erroneous theory makes an event appear impossible even when you think about it, or cases in which an inability to imagine how an outcome might come about leaves you convinced that it will not happen. The bias toward overestimation and overweighting of salient events is not an absolute rule, but it is large and robust.
证据表明,焦点注意和突出性有助于高估不可能发生的事件和高估不可能发生的结果。仅仅提到一个事件,其生动性和描述概率的格式都会增强显著性。当然,也有一些例外情况,在这些情况下,对一个事件的关注并不能提高其 的概率:在一些情况下,一个错误的理论使一个事件看起来不可能发生,即使你想到了它,或者在一些情况下,无法想象一个结果是如何发生的,使你确信它不会发生。对突出事件的高估和超重的偏见不是一个绝对的规则,但它是巨大的和强大的。
There has been much interest in recent years in studies of choice from experience , which follow different rules from the choices from description that are analyzed in prospect theory. Participants in a typical experiment face two buttons. When pressed, each button produces either a monetary reward or nothing, and the outcome is drawn randomly according to the specifications of a prospect (for example, “5% to win $12” or “95% chance to win $1”). The process is truly random, so there is no guarantee that the sample a participant sees exactly represents the statistical setup. The expected values associated with the two buttons are approximately equal, but one is riskier (more variable) than the other. (For example, one button may produce $10 on 5% of the trials and the other $1 on 50% of the trials). Choice from experience is implemented by exposing the participant to many trials in which she can observe the consequences of pressing one button or another. On the critical trial, she chooses one of the two buttons, and she earns the outcome on that trial. Choice from description is realized by showing the subject the verbal description of the risky prospect associated with each button (such as “5% to win $12”) and asking her to choose one. As expected from prospect theory, choice from description yields a possibility effect—rare outcomes are overweighted relative to their probability. In sharp contrast, overweighting is never observed in choice from experience, and underweighting is common.
近年来,人们对 来自经验的选择 的 研究 很感兴趣, ,这与前景理论中分析的 来自描述的选择 遵循不同的规则。在一个典型的实验中,参与者面对两个按钮。当按下时,每个按钮要么产生金钱奖励,要么什么都不产生,结果是根据前景的规格随机抽取的(例如,"5%赢得12美元 "或 "95%的机会赢得 1美元")。这个过程是真正的随机的,所以不能保证参与者看到的样本完全代表统计设置。与两个按钮相关的预期值大致相等,但其中一个比另一个风险更大(变化更大)。(例如,一个按钮可能在5%的试验中产生10美元,另一个在50%的试验中产生1美元)。根据经验进行选择, ,让被试者接触许多试验,她可以观察到按下一个或另一个按钮的后果。在关键的试验中,她选择了两个按钮中的一个,她就能在该试验中获得结果。通过向被试展示与每个按钮相关的风险前景的口头描述(如 "5%赢得12美元"),并要求她 ,从而实现根据描述做出选择。正如前景理论所预期的那样,根据描述做出的选择会产生一种可能性效应--相对于其概率而言,罕见的结果会被加权。与此形成鲜明对比的是,在从经验出发的选择中从未观察到加权现象,而加权不足是很常见的。
The experimental situation of choice by experience is intended to represent many situations in which we are exposed to variable outcomes from the same source. A restaurant that is usually good may occasionally serve a brilliant or an awful meal. Your friend is usually good company, but he sometimes turns moody and aggressive. California is prone to earthquakes, but they happen rarely. The results of many experiments suggest that rare events are not overweighted when we make decisions such as choosing a restaurant or tying down the boiler to reduce earthquake damage.
经验选择的实验情况是为了代表许多情况,在这些情况下,我们接触到来自同一来源的可变 结果。一家通常很好的餐厅,偶尔也会提供美味或糟糕的食物。你的朋友通常是个好伙伴,但他有时会变得喜怒无常,具有攻击性。加州很容易发生地震,但地震很少发生。许多实验的结果表明,当我们做决定时,如选择餐馆或将 ,以减少地震的破坏,罕见的事件并没有被加权。
The interpretation of choice from experience is not yet settled , but there is general agreement on one major cause of underweighting of rare events, both in experiments and in the real world: many participants never experience the rare event! Most Californians have never experienced a major earthquake, and in 2007 no banker had personally experienced a devastating financial crisis. Ralph Hertwig and Ido Erev note that “chances of rare events (such as the burst of housing bubbles) receive less impact than they deserve according to their objective probabilities.” They point to the public’s tepid response to long-term environmental threats as an example.
对来自经验的选择的解释还 没有定论 ,但在实验和现实世界中,人们普遍同意罕见事件比重过低的一个主要原因:许多参与者从来没有经历过罕见的事件!这也是一个重要的原因。大多数加州人从未经历过大地震,2007年,没有一个银行家亲身经历过 一次毁灭性的金融危机。拉尔夫-赫特维格和伊多-埃雷夫指出, "罕见事件 (如住房泡沫破裂)的 几率 根据其客观概率得到的影响比它们应得的要少"。他们指出,公众对长期环境威胁的反应不温不火,就是一个例子。
These examples of neglect are both important and easily explained, but underweighting also occurs when people have actually experienced the rare event. Suppose you have a complicated question that two colleagues on your floor could probably answer. You have known them both for years and have had many occasions to observe and experience their character. Adele is fairly consistent and generally helpful, though not exceptional on that dimension. Brian is not quite as friendly and helpful as Adele most of the time, but on some occasions he has been extremely generous with his time and advice. Whom will you approach?
这些忽视的例子既重要又容易解释,但当人们真正经历了这一罕见的事件时,轻视也 。假设你有一个复杂的问题,你所在楼层的两个同事可能会回答。你认识他们俩多年,有很多机会观察和体验他们的性格。阿黛尔的性格相当稳定,一般来说很乐于助人,虽然在这个方面并不特别。布赖恩在大多数时候并不像阿黛尔那样友好和乐于助人 ,但在某些场合,他对自己的时间和建议都非常慷慨大方。你会找谁?
Consider two possible views of this decision:
请考虑对这一决定的两种可能的看法。
It is a choice between two gambles. Adele is closer to a sure thing; the prospect of Brian is more likely to yield a slightly inferior outcome, with a low probability of a very good one. The rare event will be overweighted by a possibility effect, favoring Brian.
这是在两个赌局之间的选择。阿黛尔更接近于一个肯定的结果;布莱恩的前景更有可能产生一个稍差的结果,而一个非常好的结果的概率很低。罕见的事件将被可能性效应加权 ,有利于布莱恩。
It is a choice between your global impressions of Adele and Brian. The good and the bad experiences you have had are pooled in your representation of their normal behavior. Unless the rare event is so extreme that it comes to mind separately (Brian once verbally abused a colleague who asked for his help), the norm will be biased toward typical and recent instances, favoring Adele.
这是一个在你对阿黛尔和布莱恩的总体印象之间的选择。你所拥有的好的和坏的经验都汇集在你对他们正常行为的表述中。除非罕见的事件是如此极端,以至于它单独出现在脑海中(布莱恩曾经辱骂一个要求他帮助的同事),否则常态会偏向于典型的和最近的 ,偏向于阿黛尔。
In a two-system mind, the second interpretation appears far more plausible. System 1 generates global representations of Adele and Brian, which include an emotional attitude and a tendency to approach or avoid. Nothing beyond a comparison of these tendencies is needed to determine the door on which you will knock. Unless the rare event comes to your mind explicitly, it will not be overweighted. Applying the same idea to the experiments on choice from experience is straightforward. As they are observed generating outcomes over time, the two buttons develop integrated “personalities” to which emotional responses are attached.
在一个双系统思维中,第二种解释似乎更有道理。系统1产生了阿黛尔和布莱恩的全局表征,其中包括情感态度和接近或回避的倾向。除了对这些倾向的比较之外,不需要其他东西来决定你要敲哪扇门。除非这个罕见的事件明确地出现在你的脑海中, ,否则它不会被加权。将同样的想法应用于从经验中选择的实验是直截了当的。当它们被观察到随着时间的推移产生结果时,这两个按钮就会发展出综合的 "个性",并对其作出情感反应。
The conditions under which rare events are ignored or overweighted are better understood now than they were when prospect theory was formulated. The probability of a rare event will (often, not always) be overestimated, because of the confirmatory bias of memory. Thinking about that event, you try to make it true in your mind. A rare event will be overweighted if it specifically attracts attention. Separate attention is effectively guaranteed when prospects are described explicitly (“99% chance to win $1,000, and 1% chance to win nothing”). Obsessive concerns (the bus in Jerusalem), vivid images (the roses), concrete representations (1 of 1,000), and explicit reminders (as in choice from description) all contribute to overweighting. And when there is no overweighting, there will be neglect. When it comes to rare probabilities, our mind is not designed to get things quite right. For the residents of a planet that may be exposed to events no one has yet experienced, this is not good news.
现在,人们对罕见事件被忽视或被高估的条件比前景理论制定时更了解。 。罕见事件的概率将被高估(通常,不总是),因为记忆的确认性偏差。想到这个事件,你就试图让它在你的脑海中成为现实。如果一个罕见的事件特别吸引人的注意,它就会被高估。当前景被明确描述时,单独的注意力得到有效保证("99%的机会赢得1000美元,1%的机会 ,什么都不赢")。强烈的关注(耶路撒冷的公共汽车)、生动的形象(玫瑰花)、具体的表述(1000分之一)和明确的提醒(如从描述中选择)都有助于加权。而当没有超重的时候,就会出现忽视的情况。当涉及到罕见的概率时,我们的头脑并不是被设计成能够完全正确地处理事情。对于一个可能会在 ,接触到没有人经历过的事件的星球的居民来说,这不是一个好消息。
SPEAKING OF RARE EVENTS
说到罕见的事件
“Tsunamis are very rare even in Japan, but the image is so vivid and compelling that tourists are bound to overestimate their probability.”
"即使在日本,海啸也是非常罕见的,但这种形象是如此生动和引人注目,以至于游客必然会高估其概率。"
“It’s the familiar disaster cycle. Begin by exaggeration and overweighting, then neglect sets in.”
"这是我们熟悉的灾难循环。开始是夸大和超重,然后是忽视。"
“We shouldn’t focus on a single scenario, or we will overestimate its probability. Let’s set up specific alternatives and make the probabilities add up to 100%.”
"我们不应该专注于单一的情景,否则我们会高估 其概率。让我们设置具体的备选方案,并使概率加起来达到100%。"
“They want people to be worried by the risk. That’s why they describe it as 1 death per 1,000. They’re counting on denominator neglect.”
"他们希望人们对风险感到担忧。这就是为什么他们把它描述为每1,000人中有1人死亡。他们指望着分母的忽视。"
31
31
Risk Policies
风险政策
Imagine that you face the following pair of concurrent decisions. First examine both decisions, then make your choices.
想象一下,你面临以下一对同时出现的决定。首先审查这两个决定,然后做出选择。
Decision (i): Choose between
决定(i):选择
A. sure gain of $240
A. 肯定收益为240美元
B. 25% chance to gain $1,000 and 75% chance to gain nothing
B.25%的机会获得1,000美元,75%的机会 ,一无所获
Decision (ii): Choose between
决定(二):在以下情况下选择
C. sure loss of $750
C. 肯定损失750美元
D. 75% chance to lose $1,000 and 25% chance to lose nothing
D.75%的机会失去1000美元,25%的机会一无所有
This pair of choice problems has an important place in the history of prospect theory, and it has new things to tell us about rationality. As you skimmed the two problems, your initial reaction to the sure things (A and C) was attraction to the first and aversion to the second. The emotional evaluation of “sure gain” and “sure loss” is an automatic reaction of System 1, which certainly occurs before the more effortful (and optional) computation of the expected values of the two gambles (respectively, a gain of $250 and a loss of $750). Most people’s choices correspond to the predilections of System 1, and large majorities prefer A to B and D to C. As in many other choices that involve moderate or high probabilities, people tend to be risk averse in the domain of gains and risk seeking in the domain of losses. In the original experiment that Amos and I carried out, 73% of respondents chose A in decision i and D in decision ii and only 3% favored the combination of B and C.
这对选择问题在前景理论的历史上有着重要的地位,它有新的东西可以告诉我们关于理性的问题。当你浏览这两个问题时,你对肯定的东西(A和C)的最初反应是对第一个的吸引,对第二个的厌恶 。对 "肯定收益 "和 "肯定损失 "的情感评价是系统1的自动反应,这肯定发生在对这两个赌博的预期价值(分别为收益250美元和损失750美元)进行更努力的(和可选的)计算之前。大多数人的选择对应于系统1的偏好,大多数人喜欢A而不是B,喜欢D而不是C。正如在许多 ,其他涉及中等或高概率的选择中,人们倾向于在收益领域厌恶风险,在损失领域寻求风险。 在我和阿莫斯进行的原始实验中,73%的受访者在决策一中选择了A,在决策二中选择了D,只有3%的人喜欢B和C的组合。
You were asked to examine both options before making your first choice, and you probably did so. But one thing you surely did not do: you did not compute the possible results of the four combinations of choices (A and C, A and D, B and C, B and D) to determine which combination you like best. Your separate preferences for the two problems were intuitively compelling and there was no reason to expect that they could lead to trouble. Furthermore, combining the two decision problems is a laborious exercise that you would need paper and pencil to complete. You did not do it. Now consider the following choice problem:
你被要求在做出第一个选择之前检查两个选项, ,你可能这样做了。但有一件事你肯定没有做:你没有计算四个选择组合(A和C、A和D、B和C、B和D)的可能结果,以确定你最喜欢哪个组合。你对这两个问题的单独偏好在直觉上是令人信服的,没有理由预期它们会导致麻烦。此外,将这两个决策 ,是一项费力的工作,你需要纸和笔来完成。你并没有这样做。现在考虑下面的选择问题。
AD. 25% chance to win $240 and 75% chance to lose $760
AD.25%的机会赢得240美元,75%的机会失去760美元
BC. 25% chance to win $250 and 75% chance to lose $750
BC。25%的机会赢得250美元,75%的机会失去750美元
This choice is easy! Option BC actually dominates option AD (the technical term for one option being unequivocally better than another). You already know what comes next. The dominant option in BC is the combination of the two rejected options in the first pair of decision problems, the one that only 3% of respondents favored in our original study. The inferior option AD was preferred by 73% of respondents.
这个选择很简单!选项BC实际上 主导了 选项AD(一个选项明确地优于另一个选项的技术术语)。 你已经知道接下来会发生什么。BC的主导选项是第一对决策问题中两个被拒绝的选项的组合,在我们最初的研究中,只有3%的受访者赞成这个选项。73%的受访者喜欢 劣质选项AD 。
BROAD OR NARROW?
广义还是狭义?
This set of choices has a lot to tell us about the limits of human rationality. For one thing, it helps us see the logical consistency of Human preferences for what it is—a hopeless mirage. Have another look at the last problem, the easy one. Would you have imagined the possibility of decomposing this obvious choice problem into a pair of problems that would lead a large majority of people to choose an inferior option? This is generally true: every simple choice formulated in terms of gains and losses can be deconstructed in innumerable ways into a combination of choices, yielding preferences that are likely to be inconsistent.
这组选择有很多东西可以告诉我们关于人类理性的局限性。首先,它有助于 ,让我们看到人类偏好的逻辑一致性是什么--一个无望的幻觉。再看一下最后一个问题,这个简单的问题。你能想象到把这个明显的选择问题分解成一对问题的可能性吗,这将导致大多数人选择一个较差的选项?这通常是真实的:每一个以收益和损失制定的简单选择 ,都可以以无数的方式分解成各种选择的组合,产生的偏好很可能是不一致的。
The example also shows that it is costly to be risk averse for gains and risk seeking for losses. These attitudes make you willing to pay a premium to obtain a sure gain rather than face a gamble, and also willing to pay a premium (in expected value) to avoid a sure loss. Both payments come out of the same pocket, and when you face both kinds of problems at once, the discrepant attitudes are unlikely to be optimal.
这个例子还表明,对收益厌恶风险,对损失寻求风险是有代价的。这些态度使你愿意支付溢价以获得肯定的收益,而不是面对一场赌博,同时也愿意支付溢价(预期价值)以避免肯定的 损失。这两种支付都是从同一个口袋里 ,当你同时面临这两种问题时,不一致的态度不太可能是最佳的。
There were two ways of construing decisions i and ii:
对决定一和决定二的解释有两种方式。
narrow framing: a sequence of two simple decisions, considered separately
狭义的框架:由两个简单的决定组成的序列,分别考虑。
broad framing: a single comprehensive decision, with four options
广义的框架:一个单一的综合决定,有四个选项
Broad framing was obviously superior in this case. Indeed, it will be superior (or at least not inferior) in every case in which several decisions are to be contemplated together. Imagine a longer list of 5 simple (binary) decisions to be considered simultaneously. The broad (comprehensive) frame consists of a single choice with 32 options. Narrow framing will yield a sequence of 5 simple choices. The sequence of 5 choices will be one of the 32 options of the broad frame. Will it be the best? Perhaps, but not very likely. A rational agent will of course engage in broad framing, but Humans are by nature narrow framers.
在本案中,广义的框架显然是优越的。 事实上,在每一个需要同时考虑几个决定的情况下,它都会有优势(或者至少不会有劣势)。想象一下,一个由5个简单(二元)决定组成的较长的清单,需要同时考虑。广义(全面)框架包括一个有32个选项的单一选择。狭义的框架将产生一个由5个简单选择组成的序列。5个选择的序列将是广义框架的 32个选项中的一个。这将是最好的吗?也许,但可能性不大。一个理性的代理人当然会参与广义的框架,但人类本质上是狭义的框架。
The ideal of logical consistency, as this example shows, is not achievable by our limited mind. Because we are susceptible to WY SIATI and averse to mental effort, we tend to make decisions as problems arise, even when we are specifically instructed to consider them jointly. We have neither the inclination nor the mental resources to enforce consistency on our preferences, and our preferences are not magically set to be coherent, as they are in the rational-agent model.
正如这个例子所显示的,逻辑一致性的理想是我们有限的心智无法实现的。因为我们容易受到WY SIATI的影响,并且厌恶精神上的努力,所以我们倾向于在问题出现时做出决定,即使 ,我们被特别指示要共同考虑这些问题。我们既没有倾向也没有精神资源来强制执行我们的偏好的一致性,而且我们的偏好并不像在理性代理模型中那样被神奇地设定为一致的。
SAMUELSON’S PROBLEM
萨缪尔森的问题
The great Paul Samuelson—a giant among the economists of the twentieth century—famously asked a friend whether he would accept a gamble on the toss of a coin in which he could lose $100 or win $200. His friend responded, “I won’t bet because I would feel the $100 loss more than the $200 gain. But I’ll take you on if you promise to let me make 100 such bets.” Unless you are a decision theorist, you probably share the intuition of Samuelson’s friend, that playing a very favorable but risky gamble multiple times reduces the subjective risk. Samuelson found his friend’s answer interesting and went on to analyze it. He proved that under some very specific conditions, a utility maximizer who rejects a single gamble should also reject the offer of many.
伟大的保罗-萨缪尔森(Paul Samuelson)是二十世纪经济学家中的巨人,他曾问过一个朋友,他是否会 ,接受抛掷硬币的赌博,他可能会损失100美元或赢得200美元。他的朋友回答说:"我不会赌,因为我对100美元的损失比200美元的收益感受更深。但如果你答应让我打100次这样的赌,我就接受你。"除非你是一个决策理论家,否则你可能和萨缪尔森的朋友一样有这样的直觉,即多次玩一个非常有利但有风险的赌博,可以减少 的主观风险。萨缪尔森发现他朋友的答案很有趣,于是继续分析。他证明,在一些非常具体的条件下,一个拒绝单一赌博的效用最大化者也应该拒绝许多赌博的提议。
Remarkably, Samuelson did not seem to mind the fact that his proof, which is of course valid, led to a conclusion that violates common sense, if not rationality: the offer of a hundred gambles is so attractive that no sane person would reject it. Matthew Rabin and Richard Thaler pointed out that “the aggregated gamble of one hundred 50–50 lose $100/gain $200 bets has an expected return of $5,000, with only a 1/2,300 chance of losing any money and merely a 1/62,000 chance of losing more than $1,000.” Their point, of course, is that if utility theory can be consistent with such a foolish preference under any circumstances, then something must be wrong with it as a model of rational choice. Samuelson had not seen Rabin’s proof of the absurd consequences of severe loss aversion for small bets, but he would surely not have been surprised by it. His willingness even to consider the possibility that it could be rational to reject the package testifies to the powerful hold of the rational model.
值得注意的是,萨缪尔森似乎并不介意他的证明, ,这当然是有效的,但却导致了一个违反常识,甚至违反理性的结论: ,一百次赌博的提议是如此有吸引力,以至于没有一个正常人会拒绝它。马修-拉宾和理查德-塞勒指出,"一百个50-50的输100美元/赢200美元的赌局,其预期收益为5000美元,只有1/2,300的机会输掉任何钱,仅有1/62,000的机会输掉超过1000美元。"当然,他们的观点是,如果效用理论在任何情况下都能与这种愚蠢的偏好 ,那么它作为理性选择的模型一定有问题。萨缪尔森没有看到拉宾对小赌注的严重损失厌恶的荒谬后果的证明,但他肯定不会对此感到惊讶。他甚至愿意考虑拒绝这套方案可能是理性的,这证明了 ,理性模型的强大影响力。
Let us assume that a very simple value function describes the preferences of Samuelson’s friend (call him Sam). To express his aversion to losses Sam first rewrites the bet, after multiplying each loss by a factor of 2 . He then computes the expected value of the rewritten bet. Here are the results, for one, two, or three tosses. They are sufficiently instructive to deserve some pupil-dilating effort.
让我们假设,一个非常简单的价值函数描述了萨缪尔森的朋友(称他为萨姆)的偏好。为了表达他对损失的厌恶,萨姆首先改写了赌注, 在将每次损失乘以2的系数后 。以下是一次、两次或三次抛掷的结果。它们有足够的指导意义, ,值得学生们做一些减法。
Expected Value
预期价值
One toss
一掷千金
(50% lose 100; 50% win 200)
(50%输100;50%赢200)
50
50
Losses doubled
损失翻倍
(50% lose 200; 50% win 200)
(50%输200;50%赢200)
0
0
Two tosses
两次投掷
(25% lose 200; 50% win 100; 25% win 400)
(25%输200;50%赢100;25%赢400)
100
100
Losses doubled
损失翻倍
(25% lose 400; 50% win 100; 25% win 400)
(25%输400;50%赢100;25%赢400)
50
50
Three tosses
三次投掷
(12.5% lose 300; 37.5% win 0; 37.5% win 300; 12.5% win 600)
(12.5%输300;37.5%赢0;37.5%赢300;12.5%赢600)
150
150
Losses doubled
损失翻倍
(12.5% lose 600; 37.5% win 0; 37.5% win 300; 12.5% win 600)
(12.5%输600;37.5%赢 0;37.5%赢300;12.5%赢600)
112.5
112.5
You can see in the display that the gamble has an expected value of 50. However, one toss is worth nothing to Sam because he feels that the pain of losing a dollar is twice as intense as the pleasure of winning a dollar. After rewriting the gamble to reflect his loss aversion, Sam will find that the value of the gamble is 0.
你可以在显示屏上看到,这场赌博的预期值为50。然而,一次抛掷对山姆来说一文不值,因为他觉得失去一美元的痛苦比赢得一美元的快乐要强烈一倍。在重写赌博以反映他的损失厌恶后,山姆会发现赌博的价值是0。
Now consider two tosses. The chances of losing have gone down to 25%. The two extreme outcomes (lose 200 or win 400) cancel out in value; they are equally likely, and the losses are weighted twice as much as the gain. But the intermediate outcome (one loss, one gain) is positive, and so is the compound gamble as a whole. Now you can see the cost of narrow framing and the magic of aggregating gambles. Here are two favorable gambles, which individually are worth nothing to Sam. If he encounters the offer on two separate occasions, he will turn it down both times. However, if he bundles the two offers together, they are jointly worth $50!
现在 ,考虑进行两次抛掷。 ,输的几率已经下降到25%。两个极端的结果(输200或赢400)在价值上抵消了;它们的可能性相同,而且损失的权重是收益的两倍。但中间的结果(一个损失,一个收益)是积极的,复合赌博的整体也是如此。现在你可以看到狭义框架的代价和聚合赌博的魅力。这里有两个有利的 赌博,这两个赌博对山姆来说没有任何价值。如果他在两个不同的场合遇到了这个提议,他两次都会拒绝它。但是,如果他把这两个提议捆绑在一起,它们就会共同值50美元!这时,他就会拒绝。
Things get even better when three gambles are bundled. The extreme outcomes still cancel out, but they have become less significant. The third toss, although worthless if evaluated on its own, has added $62.50 to the total value of the package. By the time Sam is offered five gambles, the expected value of the offer will be $250, his probability of losing anything will be 18.75%, and his cash equivalent will be $203.125. The notable aspect of this story is that Sam never wavers in his aversion to losses. However, the aggregation of favorable gambles rapidly reduces the probability of losing, and the impact of loss aversion on his preferences diminishes accordingly.
当三个赌博被捆绑在一起时,情况会变得更好。极端的结果仍然被抵消,但它们已经变得不那么重要。第三次抛掷,尽管如果单独评估 ,没有任何价值,但却为整套方案的总价值增加了62.5美元。当萨姆被提供五次赌博时,提供的预期价值将是250美元,他失去任何东西的概率将是18.75%,他的现金等价物将是203.125美元。这个故事值得注意的地方是,山姆对损失的厌恶从未动摇过。然而,有利的赌博的集合迅速减少了输掉的概率 ,损失厌恶对他的偏好的影响也相应地减少了。
Now I have a sermon ready for Sam if he rejects the offer of a single highly favorable gamble played once, and for you if you share his unreasonable aversion to losses:
现在我已经为山姆准备好了布道,如果他拒绝了一次高度有利的赌博提议,如果你和他一样对损失有不合理的厌恶,我也会为你准备布道。
I sympathize with your aversion to losing any gamble, but it is costing you a lot of money. Please consider this question: Are you on your deathbed? Is this the last offer of a small favorable gamble that you will ever consider? Of course, you are unlikely to be offered exactly this gamble again, but you will have many opportunities to consider attractive gambles with stakes that are very small relative to your wealth. You will do yourself a large financial favor if you are able to see each of these gambles as part of a bundle of small gambles and rehearse the mantra that will get you significantly closer to economic rationality: you win a few, you lose a few. The main purpose of the mantra is to control your emotional response when you do lose. If you can trust it to be effective, you should remind yourself of it when deciding whether or not to accept a small risk with positive expected value. Remember these qualifications when using the mantra:
我很同情你对输掉任何赌博的厌恶,但这让你损失了很多钱。请考虑这个问题。你是在你的 临终前吗?这是你最后一次考虑的小型有利赌博的提议吗?当然,你不太可能再得到这样的赌博机会,但你会有很多机会考虑相对于你的财富来说赌注非常小的有吸引力的赌博。如果你能把这些赌博中的每一个都看作是一捆小的 ,并反复念叨这个会让你大大接近经济理性的咒语:你赢了一些,你输了一些,那么你就会给自己带来很大的经济好处。这句口头禅的主要目的是控制你在输的时候的情绪反应。如果你能相信它是有效的,你应该在决定是否接受具有正预期价值的小风险时提醒自己。在使用该咒语时,请记住这些资格 。
• It works when the gambles are genuinely independent of each other; it does not apply to multiple investments in the same industry, which would all go bad together.
- 当赌博真正相互独立时,它是有效的;它不适用于同一行业的多项投资,这些投资会一起变坏。
• It works only when the possible loss does not cause you to worry about your total wealth. If you would take the loss as significant bad news about your economic future, watch it!
- 只有当可能的损失不会导致你对你的总财富感到担心时,它才会起作用 。如果你会把损失看作是对你经济未来的重大坏消息,那就小心点吧!
• It should not be applied to long shots, where the probability of winning is very small for each bet.
- 它不应该被应用于 ,因为长枪短炮,每一个赌注的赢钱概率都非常小。
If you have the emotional discipline that this rule requires, you will never consider a small gamble in isolation or be loss averse for a small gamble until you are actually on your deathbed—and not even then.
如果你有这一规则所要求的情感纪律,你永远不会孤立地考虑一个小赌博,也不会为一个小赌博而厌恶损失,直到你真的在临终前,甚至那时也不会。
This advice is not impossible to follow. Experienced traders in financial markets live by it every day, shielding themselves from the pain of losses by broad framing . As was mentioned earlier, we now know that experimental subjects could be almost cured of their loss aversion (in a particular context) by inducing them to “think like a trader,” just as experienced baseball card traders are not as susceptible to the endowment effect as novices are. Students made risky decisions (to accept or reject gambles in which they could lose) under different instructions. In the narrow-framing condition, they were told to “make each decision as if it were the only one” and to accept their emotions. The instructions for broad framing of a decision included the phrases “imagine yourself as a trader,” “you do this all the time,” and “treat it as one of many monetary decisions, which will sum together to produce a ‘portfolio.’” The experimenters assessed the subjects’ emotional response to gains and losses by physiological measures, including changes in the electrical conductance of the skin that are used in lie detection. As expected, broad framing blunted the emotional reaction to losses and increased the willingness to take risks.
这个建议并不是无法遵循。金融市场上有经验的交易者每天都在遵循它,通过 宽泛的框架 将 ,使自己免受损失的痛苦。如前所述,我们现在知道,通过诱导实验对象 "像交易员一样思考",几乎可以治愈他们的损失厌恶(在特定情况下),正如经验丰富的棒球卡交易员不像新手那样容易受到禀赋效应的影响一样。在不同的指示下,学生们做出了危险的决定(接受或拒绝他们可能会输的赌博 )。在狭义框架条件下,他们被告知要 "把每个决定当作唯一的决定 "并接受他们的情绪。广义决策的指令包括 "把自己想象成一个交易员"、"你一直在做这个"、"把它当作许多货币决策中的一个,这些决策加起来就会产生一个 '投资组合'"等短语。实验者通过生理措施评估受试者对收益和损失的情绪反应,包括用于测谎的皮肤电导率的变化。正如预期的那样,宽泛的框架削弱了对损失的情绪反应,增加了承担风险的意愿。
The combination of loss aversion and narrow framing is a costly curse. Individual investors can avoid that curse, achieving the emotional benefits of broad framing while also saving time and agony, by reducing the frequency with which they check how well their investments are doing. Closely following daily fluctuations is a losing proposition, because the pain of the frequent small losses exceeds the pleasure of the equally frequent small gains. Once a quarter is enough, and may be more than enough for individual investors. In addition to improving the emotional quality of life, the deliberate avoidance of exposure to short-term outcomes improves the quality of both decisions and outcomes. The typical short-term reaction to bad news is increased loss aversion. Investors who get aggregated feedback receive such news much less often and are likely to be less risk averse and to end up richer. You are also less prone to useless churning of your portfolio if you don’t know how every stock in it is doing every day (or every week or even every month). A commitment not to change one’s position for several periods ( the equivalent of “locking in” an investment) improves financial performance.
损失厌恶和狭义框架的结合是一个昂贵的诅咒。 个人投资者可以避免这种诅咒,通过减少检查他们的投资情况的频率,实现广泛框架的情感利益,同时也节省时间和痛苦。密切关注每天的波动是一个失败的提议,因为频繁的小损失带来的痛苦超过了同样频繁的小收益带来的快乐。一个季度一次就够了, ,对个人投资者来说可能绰绰有余。除了提高生活的情感质量外,刻意避免接触短期结果也能提高决策和结果的质量。对坏消息的典型短期反应是增加损失厌恶。得到汇总反馈的投资者收到这种消息的频率要低得多,而且很可能不那么 厌恶风险,并且 ,最终更富有。如果你不知道你的投资组合中的每只股票每天(或每周甚至每月)的表现如何,你也不容易对你的投资组合进行无用的搅动。承诺在几个时期内不改变自己的头寸 (相当于 "锁定 " 一项投资)可以提高财务业绩。
RISK POLICIES
风险政策
Decision makers who are prone to narrow framing construct a preference every time they face a risky choice. They would do better by having a risk policy that they routinely apply whenever a relevant problem arises. Familiar examples of risk policies are “always take the highest possible deductible when purchasing insurance” and “never buy extended warranties.” A risk policy is a broad frame. In the insurance examples, you expect the occasional loss of the entire deductible, or the occasional failure of an uninsured product. The relevant issue is your ability to reduce or eliminate the pain of the occasional loss by the thought that the policy that left you exposed to it will almost certainly be financially advantageous over the long run.
易受狭隘框架影响的决策者 ,每次面对风险选择时都会构建一个偏好。他们最好有一个 风险政策 ,每当相关问题出现时,他们都会例行公事地应用。熟悉的风险政策的例子是 "在购买保险时总是采取尽可能高的免赔额 "和 "从不购买延长保修"。风险政策是一个广泛的框架。在保险的例子中,你预计偶尔会有整个免赔额的损失 ,或者偶尔会有未投保产品的故障。相关的问题是你是否有能力减少或消除偶尔损失的痛苦,因为你想到让你暴露在这种情况下的政策几乎肯定在财务上是有利的,从长期来看。
A risk policy that aggregates decisions is analogous to the outside view of planning problems that I discussed earlier. The outside view shifts the focus from the specifics of the current situation to the statistics of outcomes in similar situations. The outside view is a broad frame for thinking about plans. A risk policy is a broad frame that embeds a particular risky choice in a set of similar choices.
汇总决策的风险政策类似于我之前讨论的规划问题的外部观点, 。外部观点将重点从当前情况的具体细节转移到类似情况下的结果统计。外部观点是思考计划的一个广泛框架。风险政策是一个广泛的框架,它将一个特定的风险选择嵌入一组类似的选择中。
The outside view and the risk policy are remedies against two distinct biases that affect many decisions: the exaggerated optimism of the planning fallacy and the exaggerated caution induced by loss aversion. The two biases oppose each other. Exaggerated optimism protects individuals and organizations from the paralyzing effects of loss aversion; loss aversion protects them from the follies of overconfident optimism. The upshot is rather comfortable for the decision maker. Optimists believe that the decisions they make are more prudent than they really are, and loss-averse decision makers correctly reject marginal propositions that they might otherwise accept. There is no guarantee, of course, that the biases cancel out in every situation. An organization that could eliminate both excessive optimism and excessive loss aversion should do so. The combination of the outside view with a risk policy should be the goal.
外部观点和风险政策是针对影响 许多决策的两种不同偏见的补救措施:计划谬误的夸大乐观和损失厌恶所引起的夸大谨慎。这两种偏见是相互对立的。夸张的乐观主义保护个人和组织免受损失规避的麻痹影响;损失规避则保护他们免受过度自信的乐观主义的愚弄。其结果是对决策者来说相当舒适。乐观主义者 ,相信他们所做的决定比实际情况更谨慎,而损失规避的决策者正确地拒绝了他们可能接受的边际主张。当然,不能保证这些偏见在每一种情况下都能被抵消。一个组织如果能够同时消除过度乐观和过度损失规避,就应该这样做。将外部观点 ,与风险政策相结合,应该是目标。
Richard Thaler tells of a discussion about decision making he had with the top managers of the 25 divisions of a large company. He asked them to consider a risky option in which, with equal probabilities, they could lose a large amount of the capital they controlled or earn double that amount. None of the executives was willing to take such a dangerous gamble. Thaler then turned to the CEO of the company, who was also present, and asked for his opinion. Without hesitation, the CEO answered, “I would like all of them to accept their risks.” In the context of that conversation, it was natural for the CEO to adopt a broad frame that encompassed all 25 bets. Like Sam facing 100 coin tosses, he could count on statistical aggregation to mitigate the overall risk.
理查德-塞勒(Richard Thaler)讲述了他与一家大公司25个部门的高级经理 ,讨论决策问题。他让他们考虑一个有风险的选项,在同等概率下,他们可能会损失大量他们控制的资本或赚取两倍的资本。没有一个高管愿意进行这样一场危险的赌博。 泰勒随后转向也在场的公司首席执行官,并询问他的意见。这位首席执行官毫不犹豫地回答说:"我希望他们所有人都能接受自己的风险"。在那次谈话的背景下,首席执行官很自然地采取了一个广泛的框架,包括所有25个赌注在内。就像山姆面对100次抛硬币一样,他可以依靠统计学上的汇总来减轻整体的 风险。
SPEAKING OF RISK POLICIES
谈及风险政策
“Tell her to think like a trader! You win a few, you lose a few.”
"告诉她要像个交易员一样思考!你赢了一些,你就输了一些。"
“I decided to evaluate my portfolio only once a quarter. I am too loss averse to make sensible decisions in the face of daily price fluctuations.”
"我决定每季度只评估一次我的投资组合。我太厌恶损失了,面对每天的价格波动,我无法做出明智的决定。"
“They never buy extended warranties. That’s their risk policy.”
"他们从不购买延长保修期。那是他们的风险政策。"
“Each of our executives is loss averse in his or her domain. That’s perfectly natural, but the result is that the organization is not taking enough risk.”
"我们的每个高管在他或她的领域都是厌恶损失的。这是很自然的, ,但结果是组织没有承担足够的风险。"
32
32
Keeping Score
记分
Except for the very poor, for whom income coincides with survival, the main motivators of money-seeking are not necessarily economic. For the billionaire looking for the extra billion, and indeed for the participant in an experimental economics project looking for the extra dollar, money is a proxy for points on a scale of self-regard and achievement. These rewards and punishments, promises and threats, are all in our heads. We carefully keep score of them. They shape our preferences and motivate our actions, like the incentives provided in the social environment. As a result, we refuse to cut losses when doing so would admit failure, we are biased against actions that could lead to regret, and we draw an illusory but sharp distinction between omission and commission, not doing and doing, because the sense of responsibility is greater for one than for the other. The ultimate currency that rewards or punishes is often emotional, a form of mental self-dealing that inevitably creates conflicts of interest when the individual acts as an agent on behalf of an organization.
除了非常贫穷的人,对他们来说,收入与生存是一致的,寻求金钱的主要动机不一定是经济的。对于寻找额外的10亿的亿万富翁来说,实际上对于在实验经济学项目中寻找额外的1美元的参与者 ,金钱是自我评价和成就的尺度上的一个代理点。这些奖励和惩罚,承诺和威胁,都在我们的脑子里。我们小心翼翼地记下它们的分数。它们塑造了我们的偏好,激励着我们的行动,就像社会环境中提供的激励一样。因此,我们拒绝在做 ,这样做会承认失败,我们对可能导致后悔的行动有偏见,我们在不作为和有作为,不做和做之间有一个虚幻但鲜明的区别,因为责任感对一个人来说比对另一个人要大。奖励或惩罚的最终货币往往是情绪化的,这是一种精神上的自我交易,当个人作为一个组织的代理人时,不可避免地产生利益冲突 。
MENTAL ACCOUNTS
心理账户
Richard Thaler has been fascinated for many years by analogies between the world of accounting and the mental accounts that we use to organize and run our lives, with results that are sometimes foolish and sometimes very helpful. Mental accounts come in several varieties. We hold our money in different accounts, which are sometimes physical, sometimes only mental. We have spending money, general savings, earmarked savings for our children’s education or for medical emergencies. There is a clear hierarchy in our willingness to draw on these accounts to cover current needs. We use accounts for self-control purposes, as in making a household budget, limiting the daily consumption of espressos, or increasing the time spent exercising. Often we pay for self-control, for instance simultaneously putting money in a savings account and maintaining debt on credit cards. The Econs of the rational-agent model do not resort to mental accounting: they have a comprehensive view of outcomes and are driven by external incentives. For Humans, mental accounts are a form of narrow framing; they keep things under control and manageable by a finite mind.
理查德-塞勒(Richard Thaler)多年来一直对会计世界和我们用来组织和管理生活的心理账户之间的类比感到着迷,其结果有时是愚蠢的,有时是非常有用的。心理账户有几个种类。我们在不同的账户中持有我们的钱, ,这些账户有时是物理的,有时只是心理的。 我们有花钱的地方,有一般的储蓄,有为孩子的教育或医疗紧急情况的专用储蓄。在我们愿意动用这些账户来满足当前需求方面,有一个明显的等级制度。我们将账户用于自我控制的目的,如制定家庭预算,限制每天的浓咖啡消费,或增加 锻炼的时间。我们经常为自我控制付出代价,例如,同时把钱存入储蓄账户和维持信用卡的债务。理性代理模型中的生态人不会诉诸于心理核算:他们对结果有一个全面的看法,并受到外部激励的驱动。对人类来说,心理账户是一种狭隘的框架;它们使事情处于 ,并由一个有限的头脑管理。
Mental accounts are used extensively to keep score. Recall that professional golfers putt more successfully when working to avoid a bogey than to achieve a birdie. One conclusion we can draw is that the best golfers create a separate account for each hole; they do not only maintain a single account for their overall success. An ironic example that Thaler related in an early article remains one of the best illustrations of how mental accounting affects behavior:
心理账户被广泛地用于记分。回顾一下,职业高尔夫球员在努力避免柏忌的时候,比在实现小鸟的时候推杆更成功。我们可以得出的一个结论是,最好的高尔夫球手为每个洞都建立了一个单独的账户;他们不仅为他们的整体成功保持一个单一的账户。Thaler在早期的一篇文章中提到的一个具有讽刺意味的例子 ,至今仍是心理账户如何影响行为的最佳说明之一。
Two avid sports fans plan to travel 40 miles to see a basketball game. One of them paid for his ticket; the other was on his way to purchase a ticket when he got one free from a friend. A blizzard is announced for the night of the game. Which of the two ticket holders is more likely to brave the blizzard to see the game?
两个狂热的体育迷计划旅行40英里去看一场篮球赛。其中一个人付了票钱;另一个人在去买票的路上,从朋友那里得到了一张免费的票。比赛当晚宣布会有暴风雪。这两个持票人中,哪一个更可能 ,冒着暴风雪去看比赛?
The answer is immediate: we know that the fan who paid for his ticket is more likely to drive. Mental accounting provides the explanation. We assume that both fans set up an account for the game they hoped to see. Missing the game will close the accounts with a negative balance. Regardless of how they came by their ticket, both will be disappointed—but the closing balance is distinctly more negative for the one who bought a ticket and is now out of pocket as well as deprived of the game. Because staying home is worse for this individual, he is more motivated to see the game and therefore more likely to make the attempt to drive into a blizzard . These are tacit calculations of emotional balance, of the kind that System 1 performs without deliberation. The emotions that people attach to the state of their mental accounts are not acknowledged in standard economic theory. An Econ would realize that the ticket has already been paid for and cannot be returned. Its cost is “sunk” and the Econ would not care whether he had bought the ticket to the game or got it from a friend (if Econs have friends). To implement this rational behavior, System 2 would have to be aware of the counterfactual possibility: “Would I still drive into this snowstorm if I had gotten the ticket free from a friend?” It takes an active and disciplined mind to raise such a difficult question.
答案是直接的:我们知道付钱买票的球迷更有可能开车。心理核算提供了解释。我们假设两个球迷都为他们希望看到的比赛设立了一个账户。错过比赛将使账户出现负数。无论他们是如何买到票的,都会感到失望--但对于买了票的人来说, ,结账时的余额显然是更多的负数,因为他现在既没有钱,也被剥夺了看比赛的机会。因为呆在家里对这个人来说更糟,他更有动力去看比赛,因此更有可能尝试 开车进入暴风雪 。这些都是情感平衡的默契计算,是系统1未经深思熟虑就进行的那种计算。 在标准的经济理论中,人们对其心理账户的状态所附加的情感并没有得到承认。一个经济学家会意识到,机票已经被支付了,而且不能被退回。它的成本是 "沉没的",而且经济学人不会关心他是 ,还是从朋友那里得到的比赛门票(如果经济学人有朋友的话)。为了实现这种理性行为,系统2将 ,必须意识到反事实的可能性。"如果我从朋友那里免费得到了门票,我还会开车进入这场暴风雪吗?"要提出这样一个困难的问题,需要一个积极和有纪律的头脑。
A related mistake afflicts individual investors when they sell stocks from their portfolio:
当个人投资者从其投资组合中出售股票时,一个相关的错误困扰着他们。
You need money to cover the costs of your daughter’s wedding and will have to sell some stock. You remember the price at which you bought each stock and can identify it as a “winner,” currently worth more than you paid for it, or as a loser. Among the stocks you own, Blueberry Tiles is a winner; if you sell it today you will have achieved a gain of $5,000. You hold an equal investment in Tiffany Motors, which is currently worth $5,000 less than you paid for it. The value of both stocks has been stable in recent weeks. Which are you more likely to sell?
你需要钱来支付你女儿的婚礼费用,并将不得不出售一些 股票。你记得你买入每支股票的价格,并能确定它是一个 "赢家",目前的价值超过你为它支付的价格,或者是一个输家。在你拥有的股票中,蓝莓瓷砖是赢家;如果你今天卖掉它,你将获得5,000美元的收益。你持有的蒂芙尼汽车的投资额相等,它目前的价值比你买它时少了5,000美元。这两只 股票的价值在最近几周一直很稳定。你更有可能卖掉哪一支?
A plausible way to formulate the choice is this: “I could close the Blueberry Tiles account and score a success for my record as an investor. Alternatively, I could close the Tiffany Motors account and add a failure to my record. Which would I rather do?” If the problem is framed as a choice between giving yourself pleasure and causing yourself pain, you will certainly sell Blueberry Tiles and enjoy your investment prowess. As might be expected, finance research has documented a massive preference for selling winners rather than losers—a bias that has been given an opaque label: the disposition effect .
一个合理的方式来制定选择是这样的。"我可以关闭蓝莓瓷砖的账户,为我的投资者记录添上一笔成功。或者,我可以关闭蒂芙尼汽车公司的账户,在我的记录中增加一个失败。我宁愿做哪一个?"如果这个问题被设定为在给自己带来快乐 ,和给自己带来痛苦之间的选择,你肯定会卖掉蓝莓瓷砖,享受你的投资能力。正如可以预料的那样,金融研究已经记录了大量对出售赢家而不是输家的偏好--这种偏见被赋予了一个不透明的标签: 处置效应 。
The disposition effect is an instance of narrow framing . The investor has set up an account for each share that she bought, and she wants to close every account as a gain. A rational agent would have a comprehensive view of the portfolio and sell the stock that is least likely to do well in the future, without considering whether it is a winner or a loser. Amos told me of a conversation with a financial adviser, who asked him for a complete list of the stocks in his portfolio, including the price at which each had been purchased. When Amos asked mildly, “Isn’t it supposed not to matter?” the adviser looked astonished. He had apparently always believed that the state of the mental account was a valid consideration.
处置效应是 狭义框架 的一个实例。投资者为她买入的每只股票设立了一个账户, ,她想把每个账户都作为收益关闭。一个理性的代理人会对投资组合有一个全面的看法,并卖掉未来最不可能做得好的股票,而不考虑它是赢家还是输家。阿莫斯告诉我与一位财务顾问的谈话,这位财务顾问要求他提供其投资组合中股票的完整清单,包括每个股票的购买价格 。当阿莫斯温和地问道:"这不是应该不重要吗?"这位顾问显得很惊讶。他显然一直认为,心理账户的状况是一个有效的考虑因素。
Amos’s guess about the financial adviser’s beliefs was probably right, but he was wrong to dismiss the buying price as irrelevant. The purchase price does matter and should be considered, even by Econs. The disposition effect is a costly bias because the question of whether to sell winners or losers has a clear answer, and it is not that it makes no difference. If you care about your wealth rather than your immediate emotions, you will sell the loser Tiffany Motors and hang on to the winning Blueberry Tiles. At least in the United States, taxes provide a strong incentive: realizing losses reduces your taxes, while selling winners exposes you to taxes. This elementary fact of financial life is actually known to all American investors, and it determines the decisions they make during one month of the year—investors sell more losers in December, when taxes are on their mind. The tax advantage is available all year, of course, but for 11 months of the year mental accounting prevails over financial common sense. Another argument against selling winners is the well-documented market anomaly that stocks that recently gained in value are likely to go on gaining at least for a short while. The net effect is large: the expected after-tax extra return of selling Tiffany rather than Blueberry is 3.4% over the next year. Closing a mental account with a gain is a pleasure, but it is a pleasure you pay for. The mistake is not one that an Econ would ever make, and experienced investors, who are using their System 2, are less susceptible to it than are novices.
阿莫斯对财务顾问的信念的猜测可能是正确的,但他认为购买价格无关紧要是错误的。购买价格确实很重要,应该被考虑, ,甚至是Econs。处置效应 ,因为卖出赢家还是输家的问题有一个明确的答案,并不是说它没有区别。如果你关心的是你的财富而不是你眼前的情绪,你会卖掉失败者的蒂芙尼汽车,而保留胜利者的蓝莓瓷砖。至少在美国,税收提供了一个强有力的激励:实现 损失可以减少你的税收,而出售赢家则会使你面临税收。这个金融生活的基本事实实际上是所有美国投资者都知道的,它决定了他们在一年中的某一个月所做的决定--投资者在12月卖出更多的失败者,这时他们会考虑到税收问题。当然,税收优势是全年都有的,但在一年中的11个月里,心理会计 ,胜过财务常识。反对卖出赢家的另一个论据是有据可查的市场反常现象,即最近增值的股票至少在短时间内可能会继续增值。净影响很大:卖出蒂芙尼而不是蓝莓的预期税后额外回报在未来一年是3.4%。以收益关闭心理账户是一种乐趣,但 ,这是一种你付出代价的乐趣。这个错误不是一个经济学人会犯的,有经验的投资者,在使用他们的系统2时,比新手 更不容易受其影响 。
A rational decision maker is interested only in the future consequences of current investments. Justifying earlier mistakes is not among the Econ’s concerns. The decision to invest additional resources in a losing account, when better investments are available, is known as the sunk-cost fallacy , a costly mistake that is observed in decisions large and small. Driving into the blizzard because one paid for tickets is a sunk-cost error.
一个理性的决策者只对当前投资的未来后果感兴趣。为早期的错误辩护并不在经济学的关注范围之内。当有更好的投资时,将额外的资源投资于一个亏损的 账户的决定被称为 " 沉没成本谬误 ",这是一个代价高昂的错误,在大大小小的决策中都可以看到。因为买了票而开车进入暴风雪中,就是一个沉没成本的错误。
Imagine a company that has already spent $50 million on a project. The project is now behind schedule and the forecasts of its ultimate returns are less favorable than at the initial planning stage. An additional investment of $60 million is required to give the project a chance. An alternative proposal is to invest the same amount in a new project that currently looks likely to bring higher returns. What will the company do? All too often a company afflicted by sunk costs drives into the blizzard, throwing good money after bad rather than accepting the humiliation of closing the account of a costly failure. This situation is in the top-right cell of the fourfold pattern ( here ), where the choice is between a sure loss and an unfavorable gamble, which is often unwisely preferred.
想象一下,一家公司已经在一个项目上花费了5000万美元。该项目现在已经落后于计划,对其最终回报的预测也不如在 最初的计划阶段那么有利。为了给这个项目一个机会,需要增加6000万美元的投资。另一个建议是在一个新的项目上投资同样的金额,这个项目目前看来可能会带来更高的回报。公司会怎么做?一个受沉没成本困扰的公司往往会驶入暴风雪中,把好的钱扔到坏的地方,而不是接受羞辱 ,关闭一个昂贵的失败账户。这种情况在四重模式的右上角 (这里 ),在肯定的损失和不利的赌博之间进行选择,而这往往是不明智的选择。
The escalation of commitment to failing endeavors is a mistake from the perspective of the firm but not necessarily from the perspective of the executive who “owns” a floundering project. Canceling the project will leave a permanent stain on the executive’s record, and his personal interests are perhaps best served by gambling further with the organization’s resources in the hope of recouping the original investment—or at least in an attempt to postpone the day of reckoning. In the presence of sunk costs, the manager’s incentives are misaligned with the objectives of the firm and its shareholders, a familiar type of what is known as the agency problem. Boards of directors are well aware of these conflicts and often replace a CEO who is encumbered by prior decisions and reluctant to cut losses. The members of the board do not necessarily believe that the new CEO is more competent than the one she replaces. They do know that she does not carry the same mental accounts and is therefore better able to ignore the sunk costs of past investments in evaluating current opportunities.
从公司的角度来看,对失败的努力的承诺升级是一个错误,但从 "拥有 "一个陷入困境的 项目的高管的角度来看却不一定。取消项目会在高管的记录上留下永久的污点,而他的个人利益 ,也许最好的办法是进一步用组织的资源进行赌博,希望能收回最初的投资,或者至少试图推迟清算的日子。在存在沉没成本的情况下,经理人的激励机制与 公司及其股东的目标不一致,这就是人们所熟悉的代理问题的类型。董事会清楚地意识到这些冲突,并经常更换受先前决定束缚和不愿意减少损失的首席执行官。董事会成员不一定认为新的CEO比她所取代的人更有能力。他们确实知道,她没有携带相同的心理账户 ,因此在评估当前的机会时能够更好地忽略过去投资的沉没成本。
The sunk-cost fallacy keeps people for too long in poor jobs, unhappy marriages, and unpromising research projects. I have often observed young scientists struggling to salvage a doomed project when they would be better advised to drop it and start a new one. Fortunately, research suggests that at least in some contexts the fallacy can be overcome . The sunk-cost fallacy is identified and taught as a mistake in both economics and business courses, apparently to good effect: there is evidence that graduate students in these fields are more willing than others to walk away from a failing project.
沉没成本谬论使人们在糟糕的工作、不幸福的婚姻和没有前途的研究项目中呆了太长时间。我经常看到年轻的科学家在努力挽救一个注定失败的项目,而他们最好是放弃这个项目,开始一个新的项目。幸运的是,研究表明 ,至少在某些情况下,这种 谬误是可以克服的 。在经济学和商业课程中,沉没成本谬误被认为是一个错误,并被作为一个错误来教授,显然效果不错:有证据表明,这些领域的研究生比其他人更愿意放弃一个失败的项目。
REGRET
遗憾的是
Regret is an emotion, and it is also a punishment that we administer to ourselves. The fear of regret is a factor in many of the decisions that people make (“Don’t do this, you will regret it” is a common warning), and the actual experience of regret is familiar. The emotional state has been well described by two Dutch psychologists, who noted that regret is “accompanied by feelings that one should have known better, by a sinking feeling , by thoughts about the mistake one has made and the opportunities lost, by a tendency to kick oneself and to correct one’s mistake, and by wanting to undo the event and to get a second chance.” Intense regret is what you experience when you can most easily imagine yourself doing something other than what you did.
后悔是一种情绪,它也是我们对自己的一种惩罚。 对后悔的恐惧是人们所做的许多决定中的一个因素("不要这样做,你会后悔的 "是一个常见的警告),而后悔的实际经历也很熟悉。两位荷兰心理学家对这种情绪状态进行了很好的描述,他们指出,后悔是 "伴随着一种感觉,即一个人应该知道的更多, 一种沉沦的感觉 ,对自己所犯的错误 ,失去的机会的思考,一种踢自己和纠正自己错误的倾向,以及想要撤销事件和获得第二次机会"。强烈的后悔是你所经历的,当你最容易想象自己做了什么以外的事情。
Regret is one of the counterfactual emotions that are triggered by the availability of alternatives to reality. After every plane crash there are special stories about passengers who “should not” have been on the plane—they got a seat at the last moment, they were transferred from another airline, they were supposed to fly a day earlier but had had to postpone. The common feature of these poignant stories is that they involve unusual events—and unusual events are easier than normal events to undo in imagination. Associative memory contains a representation of the normal world and its rules. An abnormal event attracts attention, and it also activates the idea of the event that would have been normal under the same circumstances.
遗憾是反事实的情绪之一,它是由现实的可替代性引发的。在每一次 飞机失事之后,都会有一些特殊的故事,讲述那些 "不应该 "在飞机上的乘客--他们在最后一刻得到了一个座位,他们是从另一家航空公司转来的, 他们本应提前一天飞,但不得不推迟。这些凄美的故事的共同特点是,它们涉及不寻常的事件,而不寻常的事件比正常的事件更容易在想象中撤销。联想 记忆包含了对正常世界及其规则的表述。一个不正常的事件吸引了人们的注意,它也激活了在相同情况下本来是正常的事件的想法。
To appreciate the link of regret to normality , consider the following scenario:
为了理解 遗憾与常态 的联系,请考虑以下情景。
Mr. Brown almost never picks up hitchhikers. Yesterday he gave a man a ride and was robbed.
布朗先生几乎从不接载搭车者。昨天他让一个人搭车,结果被抢了。
Mr. Smith frequently picks up hitchhikers. Yesterday he gave a man a ride and was robbed.
史密斯先生经常 ,接载搭便车的人。昨天他搭了一个人的车,结果被抢了。
Who of the two will experience greater regret over the episode?
两人中谁会对这一事件经历更大的遗憾?
The results are not surprising: 88% of respondents said Mr. Brown, 12% said Mr. Smith.
结果并不令人惊讶。88%的受访者说布朗先生,12%说史密斯先生。
Regret is not the same as blame. Other participants were asked this question about the same incident:
后悔与指责是不一样的。其他参与者被问到关于同一事件的这个问题。
Who will be criticized most severely by others?
谁会受到别人最严厉的批评?
The results: Mr. Brown 23%, Mr. Smith 77%.
结果是。布朗先生 23%,史密斯先生77%。
Regret and blame are both evoked by a comparison to a norm, but the relevant norms are different. The emotions experienced by Mr. Brown and Mr. Smith are dominated by what they usually do about hitchhikers. Taking a hitchhiker is an abnormal event for Mr. Brown, and most people therefore expect him to experience more intense regret. A judgmental observer, however, will compare both men to conventional norms of reasonable behavior and is likely to blame Mr. Smith for habitually taking unreasonable risks . We are tempted to say that Mr. Smith deserved his fate and that Mr. Brown was unlucky. But Mr. Brown is the one who is more likely to be kicking himself, because he acted out of character in this one instance.
遗憾和责备都是由与规范的比较引起的,但相关的规范是不同的。布朗先生和史密斯先生所经历的情绪是由他们通常对搭便车的行为所主导的。搭顺风车对布朗先生来说是一个不正常的事件,因此大多数人都期望他能经历更强烈的后悔。然而,一个有判断力的观察者会将 这两个人与传统的合理行为规范进行比较,并有可能指责史密斯先生 习惯性地承担不合理的风险 。我们很想说,史密斯先生是罪有应得,布朗先生是运气不好。但布朗先生才是更有可能自责的人,因为他在这一次的行为超出了他的性格。
Decision makers know that they are prone to regret, and the anticipation of that painful emotion plays a part in many decisions. Intuitions about regret are remarkably uniform and compelling, as the next example illustrates.
决策者知道他们容易后悔,而且 ,对这种痛苦情绪的预期在许多决策中都起着作用。正如下一个例子所表明的,关于后悔的直觉是 非常统一 和有说服力的。
Paul owns shares in company A. During the past year he considered switching to stock in company B, but he decided against it. He now learns that he would have been better off by $1,200 if he had switched to the stock of company B.
保罗拥有A公司的股票 。在过去的一年里,他曾考虑转为B公司的股票,但他决定不这样做。他现在得知,如果他换成B公司的股票,他的收入会增加1200美元, 。
George owned shares in company B. During the past year he switched to stock in company A. He now learns that he would have been better off by $1,200 if he had kept his stock in company B.
乔治拥有B公司的股票,在过去的一年里,他换成了A公司的股票。他现在得知,如果他保留B公司的股票,他的收入会增加1200美元。
Who feels greater regret?
谁会感到更大的遗憾?
The results are clear-cut: 8% of respondents say Paul, 92% say George.
结果很明显:8%的受访者说是保罗,92%说是乔治。
This is curious, because the situations of the two investors are objectively identical. They both now own stock A and both would have been better off by the same amount if they owned stock B. The only difference is that George got to where he is by acting, whereas Paul got to the same place by failing to act. This short example illustrates a broad story: people expect to have stronger emotional reactions (including regret) to an outcome that is produced by action than to the same outcome when it is produced by inaction. This has been verified in the context of gambling: people expect to be happier if they gamble and win than if they refrain from gambling and get the same amount. The asymmetry is at least as strong for losses, and it applies to blame as well as to regret. The key is not the difference between commission and omission but the distinction between default options and actions that deviate from the default . When you deviate from the default, you can easily imagine the norm—and if the default is associated with bad consequences, the discrepancy between the two can be the source of painful emotions. The default option when you own a stock is not to sell it, but the default option when you meet your colleague in the morning is to greet him. Selling a stock and failing to greet your coworker are both departures from the default option and natural candidates for regret or blame.
这很奇怪,因为这两个投资者的情况客观上是相同的。他们现在都拥有 股票A,如果他们拥有股票B,他们的情况都会好很多。唯一的区别是,乔治是通过行动达到他的目的,而保罗是通过不行动达到同一目的。这个简短的例子说明了一个广泛的故事:人们期望对一个由行动产生的结果产生更强烈的情感反应(包括后悔),而不是对同样的结果由不行动产生的 。这一点在赌博的背景下得到了验证:人们期望如果他们赌博并赢了,会比他们不赌博并得到相同的金额更高兴。这种不对称性对于损失来说至少是同样强烈的,而且它也 适用于责备 和后悔。关键不在于委托和不委托之间的区别,而在于默认选项和 偏离默认的行动 之间的区别 。当你偏离默认值时,你很容易想象到规范--如果默认值与不良后果有关,两者之间的差异就会成为痛苦情绪的来源。当你拥有一支股票时,默认选项是不卖掉它,但当你早上见到你的同事时,默认选项是向他打招呼。卖出股票和没有向你的同事打招呼 ,都是偏离了默认选项,自然会引起后悔或指责。
In a compelling demonstration of the power of default options, participants played a computer simulation of blackjack. Some players were asked “Do you wish to hit?” while others were asked “Do you wish to stand?” Regardless of the question, saying yes was associated with much more regret than saying no if the outcome was bad! The question evidently suggests a default response, which is, “I don’t have a strong wish to do it.” It is the departure from the default that produces regret. Another situation in which action is the default is that of a coach whose team lost badly in their last game. The coach is expected to make a change of personnel or strategy, and a failure to do so will produce blame and regret .
在一个关于默认选项的力量的引人注目的演示中,参与者玩了一个21点的计算机模拟。一些玩家被问到 "你想打吗?"而另一些人则被问到 "你想站着吗?"无论问题是什么,如果结果不好,说 "是 "比 说 "不 "要后悔得多!这个问题显然暗示了一个默认的回答,那就是:"我没有强烈的愿望去做这件事。"正是对默认的偏离产生了遗憾。另一种以行动为默认的情况是,一个教练的球队在上一场比赛中输得很惨。教练被期望做出人员或策略的改变,如果不这样做,就会 产生责备和遗憾 。
The asymmetry in the risk of regret favors conventional and risk-averse choices. The bias appears in many contexts. Consumers who are reminded that they may feel regret as a result of their choices show an increased preference for conventional options, favoring brand names over generics . The behavior of the managers of financial funds as the year approaches its end also shows an effect of anticipated evaluation: they tend to clean up their portfolios of unconventional and otherwise questionable stocks. Even life-or-death decisions can be affected. Imagine a physician with a gravely ill patient. One treatment fits the normal standard of care; another is unusual. The physician has some reason to believe that the unconventional treatment improves the patient’s chances, but the evidence is inconclusive. The physician who prescribes the unusual treatment faces a substantial risk of regret, blame, and perhaps litigation. In hindsight, it will be easier to imagine the normal choice; the abnormal choice will be easy to undo. True, a good outcome will contribute to the reputation of the physician who dared, but the potential benefit is smaller than the potential cost because success is generally a more normal outcome than is failure.
遗憾风险的不对称性有利于传统和风险规避的选择。这种偏见出现在许多情况下。消费者如果被提醒他们可能会因为自己的选择而感到后悔,就会表现出对传统选择的更多偏好,偏爱 品牌而不是非专利产品 。金融基金经理在接近年底时的行为也显示了预期 评估的影响:他们倾向于 清理其投资组合 中的非常规和其他有问题的股票。甚至生死攸关的决定也会受到影响。想象一下,一个医生面对一个重病患者。一种治疗方法符合正常的护理标准;另一种则是不寻常的。医生有理由相信,非常规的治疗方法可以改善病人的机会,但证据并不确定。 开出非常规治疗方法的医生面临着后悔、指责,甚至是诉讼的巨大风险。事后看来,想象正常的选择会比较容易;不正常的选择则很容易被撤销。诚然,一个好的结果将有助于提高敢于冒险的医生的声誉,但潜在的利益小于潜在的成本,因为成功通常是 ,比失败更正常的结果。
RESPONSIBILITY
责任
Losses are weighted about twice as much as gains in several contexts: choice between gambles, the endowment effect, and reactions to price changes. The loss-aversion coefficient is much higher in some situations. In particular, you may be more loss averse for aspects of your life that are more important than money, such as health. Furthermore, your reluctance to “sell” important endowments increases dramatically when doing so might make you responsible for an awful outcome. Richard Thaler’s early classic on consumer behavior included a compelling example, slightly modified in the following question:
在几种情况下,损失的权重大约是收益的两倍:赌博之间的选择,禀赋效应,以及对价格变化的反应。在某些情况下,损失厌恶系数要高得多。特别是, 对于你生活中 比金钱更重要的 方面 ,如健康,你可能会更 厌恶损失 。此外, ,当这样做可能使你对一个可怕的结果负责时,你不愿意 "出售 "重要的禀赋就会急剧增加。理查德-塞勒(Richard Thaler)早期 关于消费者行为的经典著作 包括一个引人注目的例子,在以下问题中稍作修改。
You have been exposed to a disease which if contracted leads to a quick and painless death within a week. The probability that you have the disease is 1/1,000. There is a vaccine that is effective only before any symptoms appear. What is the maximum you would be willing to pay for the vaccine?
你已经接触到一种疾病,如果被感染,会在一周内迅速无痛死亡。你已经 这种疾病的概率是1/1,000。有一种疫苗,只有在任何症状出现之前才会有效。你愿意为这种疫苗支付的最高金额是多少?
Most people are willing to pay a significant but limited amount. Facing the possibility of death is unpleasant, but the risk is small and it seems unreasonable to ruin yourself to avoid it. Now consider a slight variation:
大多数人都愿意支付一笔可观但有限的费用。面对死亡的可能性是令人不快的,但风险很小,为了避免它而毁掉自己似乎是不合理的。现在考虑一个轻微的变化。
Volunteers are needed for research on the above disease. All that is required is that you expose yourself to a 1/1,000 chance of contracting the disease. What is the minimum you would ask to be paid in order to volunteer for this program? (You would not be allowed to purchase the vaccine.)
需要志愿者 ,对上述疾病进行研究。所要求的是,你要将自己暴露在1/1000的机会中,感染这种疾病。为了成为这个项目的志愿者,你要求的最低报酬是什么?(你将不被允许购买疫苗)。
As you might expect, the fee that volunteers set is far higher than the price they were willing to pay for the vaccine. Thaler reported informally that a typical ratio is about 50:1. The extremely high selling price reflects two features of this problem. In the first place, you are not supposed to sell your health; the transaction is not considered legitimate and the reluctance to engage in it is expressed in a higher price. Perhaps most important, you will be responsible for the outcome if it is bad. You know that if you wake up one morning with symptoms indicating that you will soon be dead, you will feel more regret in the second case than in the first, because you could have rejected the idea of selling your health without even stopping to consider the price. You could have stayed with the default option and done nothing, and now this counterfactual will haunt you for the rest of your life.
如你所料,志愿者设定的费用远远高于他们愿意为疫苗支付的价格。 Thaler非正式地报告说,典型的比例大约是50:1。极高的售价反映了这个问题的两个特点。首先,你不应该出售你的健康;这种交易被认为是不合法的,不愿意从事这种交易的表现是价格更高。也许最重要的是,如果结果是坏的,你将对其负责。你知道 ,如果有一天早上你醒来,症状表明你很快就会死亡,在第二种情况下,你会比第一种情况下感到更多的遗憾,因为你本可以拒绝出售你的健康的想法,甚至不用停下来考虑价格。你本可以保持默认的选择,什么都不做,而现在这个反事实将困扰你的余生。
The survey of parents’ reactions to a potentially hazardous insecticide mentioned earlier also included a question about the willingness to accept increased risk. The respondents were told to imagine that they used an insecticide where the risk of inhalation and child poisoning was 15 per 10,000 bottles. A less expensive insecticide was available, for which the risk rose from 15 to 16 per 10,000 bottles. The parents were asked for the discount that would induce them to switch to the less expensive (and less safe) product. More than two-thirds of the parents in the survey responded that they would not purchase the new product at any price! They were evidently revolted by the very idea of trading the safety of their child for money. The minority who found a discount they could accept demanded an amount that was significantly higher than the amount they were willing to pay for a far larger improvement in the safety of the product.
前面提到的关于父母对一种潜在危险的杀虫剂的反应的 调查还包括一个关于是否愿意接受增加风险的问题。受访者被告知,想象他们使用一种杀虫剂,吸入和儿童中毒的风险是每10,000瓶15。有一种价格较低的杀虫剂,其风险从每10,000瓶15美元上升到16美元。 家长们被要求提供能促使他们改用较便宜(和较不安全)的产品的折扣。在调查中,超过三分之二的家长回答说他们不会以任何价格购买新产品!他们显然对新产品感到反感。显然,他们对用孩子的安全来换取金钱的想法感到反感。少数找到他们可以接受的折扣的家长要求 ,这个数额明显高于他们愿意为产品安全的更大改善而支付的数额。
Anyone can understand and sympathize with the reluctance of parents to trade even a minute increase of risk to their child for money. It is worth noting, however, that this attitude is incoherent and potentially damaging to the safety of those we wish to protect. Even the most loving parents have finite resources of time and money to protect their child (the keeping-my-child-safe mental account has a limited budget), and it seems reasonable to deploy these resources in a way that puts them to best use. Money that could be saved by accepting a minute increase in the risk of harm from a pesticide could certainly be put to better use in reducing the child’s exposure to other harms, perhaps by purchasing a safer car seat or covers for electric sockets. The taboo tradeoff against accepting any increase in risk is not an efficient way to use the safety budget. In fact, the resistance may be motivated by a selfish fear of regret more than by a wish to optimize the child’s safety. The what-if? thought that occurs to any parent who deliberately makes such a trade is an image of the regret and shame he or she would feel in the event the pesticide caused harm.
任何人都可以理解和同情父母不愿意用孩子哪怕是增加一分钟的风险来换取金钱。然而,值得注意的是,这种态度是不连贯的,对我们希望保护的人的安全有潜在的损害。 即使是最有爱心的父母也只有有限的时间和金钱资源来保护他们的孩子(保持我的孩子安全的心理账户的预算是有限的),以最有效的方式部署这些资源,似乎是合理的。通过接受杀虫剂伤害风险的微小增加而省下的钱,当然可以更好地用于减少孩子的 , ,也许可以通过购买更安全的汽车座椅或电插座的盖子来减少其他伤害。反对接受任何风险增加的 禁忌权衡 并不是使用安全预算的有效方式。事实上,抵制的动机可能是对后悔的自私恐惧,而不是希望优化孩子的安全。对于任何故意将这种 交易的父母来说,他们想到的是,如果农药造成伤害,他或她会感到后悔和羞愧。
The intense aversion to trading increased risk for some other advantage plays out on a grand scale in the laws and regulations governing risk. This trend is especially strong in Europe, where the precautionary principle , which prohibits any action that might cause harm, is a widely accepted doctrine. In the regulatory context, the precautionary principle imposes the entire burden of proving safety on anyone who undertakes actions that might harm people or the environment. Multiple international bodies have specified that the absence of scientific evidence of potential damage is not sufficient justification for taking risks. As the jurist Cass Sunstein points out, the precautionary principle is costly, and when interpreted strictly it can be paralyzing. He mentions an impressive list of innovations that would not have passed the test, including “airplanes, air conditioning, antibiotics, automobiles, chlorine, the measles vaccine, open-heart surgery, radio, refrigeration, smallpox vaccine, and X-rays.” The strong version of the precautionary principle is obviously untenable. But enhanced loss aversion is embedded in a strong and widely shared moral intuition; it originates in System 1. The dilemma between intensely loss-averse moral attitudes and efficient risk management does not have a simple and compelling solution.
对用增加的风险换取其他好处的强烈反感,在管理风险的法律和法规中得到了大规模的体现。这种趋势在欧洲尤其强烈, 在那里, 禁止任何可能造成伤害的行动的 预防原则 ,是一个被广泛接受的 理论。在监管方面,预防原则将证明安全的全部责任强加给任何采取可能伤害人类或环境的行动的人。多个国际机构明确指出,缺乏潜在损害的科学证据并不是承担风险的充分理由。正如法学家Cass Sunstein所指出的那样,预防 原则的成本很高,如果严格解释的话,它可能会造成瘫痪。他提到了一份令人印象深刻的创新清单,其中包括 "飞机、空调、抗生素、汽车、氯气、麻疹疫苗、开胸手术、无线电、制冷、天花疫苗和X射线"。强势版的预防原则显然是站不住脚的。 但是, 强化的损失厌恶 被嵌入到一个强大而广泛共享的道德直觉中;它起源于系统1。在强烈的损失规避的道德态度和有效的风险管理之间的困境并没有一个简单而令人信服的解决方案。
We spend much of our day anticipating, and trying to avoid, the emotional pains we inflict on ourselves. How seriously should we take these intangible outcomes, the self-administered punishments (and occasional rewards) that we experience as we score our lives? Econs are not supposed to have them, and they are costly to Humans. They lead to actions that are detrimental to the wealth of individuals, to the soundness of policy, and to the welfare of society. But the emotions of regret and moral responsibility are real, and the fact that Econs do not have them may not be relevant.
我们每天花很多时间来预测并试图避免我们对自己造成的情感痛苦。我们应该如何认真对待这些无形的 结果,即我们在生活中经历的自我管理的惩罚(和偶尔的奖励)?生态环境是不应该有这些结果的,而且它们对人类来说代价高昂。它们导致了不利于个人财富、政策健全和社会福利的行为。但是遗憾和道德责任的情绪是真实的,而Econs 没有这些情绪的事实可能并不重要。
Is it reasonable, in particular, to let your choices be influenced by the anticipation of regret? Susceptibility to regret, like susceptibility to fainting spells, is a fact of life to which one must adjust. If you are an investor, sufficiently rich and cautious at heart, you may be able to afford the luxury of a portfolio that minimizes the expectation of regret even if it does not maximize the accrual of wealth.
特别是,让你的选择受到后悔预期的影响,这合理吗?对后悔的敏感度,就像对昏厥的敏感度一样,是一个人必须适应的生活事实。如果你是一个投资者,有足够的财富和谨慎的心态,你可能有能力负担一个奢侈的 投资组合,即使不能最大限度地积累财富,也能最大限度地减少对 遗憾的预期。
You can also take precautions that will inoculate you against regret. Perhaps the most useful is to be explicit about the anticipation of regret. If you can remember when things go badly that you considered the possibility of regret carefully before deciding, you are likely to experience less of it. You should also know that regret and hindsight bias will come together, so anything you can do to preclude hindsight is likely to be helpful. My personal hindsight-avoiding policy is to be either very thorough or completely casual when making a decision with long-term consequences. Hindsight is worse when you think a little, just enough to tell yourself later, “I almost made a better choice.”
你也可以采取一些预防措施,使你免受后悔之苦。也许最有用的是明确说明对后悔的预期。如果你能在事情变糟时想起你在做决定前仔细考虑过后悔的可能性,那么你就有可能减少后悔的经历。你也应该知道,后悔和 事后诸葛亮的偏见会一起出现,所以你能做的任何事都可能有助于排除事后诸葛亮。我个人的事后诸葛亮政策是,在做出具有长期后果的决定时,要么非常彻底,要么完全随意。事后诸葛亮的情况更糟糕,当你稍微思考一下,就足以在事后告诉自己,"我几乎做出了一个更好的选择"。
Daniel Gilbert and his colleagues provocatively claim that people generally anticipate more regret than they will actually experience, because they underestimate the efficacy of the psychological defenses they will deploy—which they label the “psychological immune system.” Their recommendation is that you should not put too much weight on regret; even if you have some, it will hurt less than you now think.
丹尼尔-吉尔伯特和他的同事挑衅性地 ,声称人们通常预期的遗憾比他们实际会经历的更多,因为他们低估了他们将部署的心理防御措施的效力--他们称之为 "心理免疫系统"。 他们的建议是,你不应该把遗憾看得太重;即使你有一些遗憾,它的伤害也会比你现在想象的小。
SPEAKING OF KEEPING SCORE
说到记分
“He has separate mental accounts for cash and credit purchases. I constantly remind him that money is money.”
"他有 独立的心理账户,用于现金和信用卡购物。我不断提醒他,钱就是钱"。
“We are hanging on to that stock just to avoid closing our mental account at a loss. It’s the disposition effect.”
"我们挂在那支股票上,只是为了避免亏损时关闭我们的心理账户。这就是处置效应。"
“We discovered an excellent dish at that restaurant and we never try anything else, to avoid regret.”
"我们在那家餐厅发现了一道极好的菜,为了避免后悔,我们从不尝试其他东西。"
“The salesperson showed me the most expensive car seat and said it was the safest, and I could not bring myself to buy the cheaper model. It felt like a taboo tradeoff.”
"销售人员给我看了最贵的汽车座椅,并说它是 最安全的,我无法让自己购买更便宜的型号。这感觉像是一种禁忌的权衡。"
33
33
Reversals
反转
You have the task of setting compensation for victims of violent crimes. You consider the case of a man who lost the use of his right arm as a result of a gunshot wound. He was shot when he walked in on a robbery occurring in a convenience store in his neighborhood.
你的任务是为暴力犯罪的受害者确定赔偿。你要考虑一个因枪伤而失去右臂功能的人的案例。他是在撞见发生在他家附近的一家便利店的抢劫案时中枪的, 。
Two stores were located near the victim’s home, one of which he frequented more regularly than the other. Consider two scenarios:
受害人的家附近有两家商店,其中一家他比另一家更经常去。请考虑两种情况。
(i) The burglary happened in the man’s regular store .
(i) 入室盗窃发生 在该男子的正规商店 。
(ii) The man’s regular store was closed for a funeral, so he did his shopping in the other store, where he was shot.
(该男子的固定商店因葬礼而关闭,所以他在另一家商店购物,他在那里被枪杀。
Should the store in which the man was shot make a difference to his compensation?
该男子被枪击的商店是否应该对他的赔偿产生 ?
You made your judgment in joint evaluation, where you consider two scenarios at the same time and make a comparison. You can apply a rule. If you think that the second scenario deserves higher compensation, you should assign it a higher dollar value.
你在联合评价中做出了判断,即同时考虑两种情况并进行比较。你可以运用一个规则。如果你认为第二种情况应该得到更高的补偿,你应该给它一个更高的美元价值。
There is almost universal agreement on the answer: compensation should be the same in both situations. The compensation is for the crippling injury, so why should the location in which it occurred make any difference? The joint evaluation of the two scenarios gave you a chance to examine your moral principles about the factors that are relevant to victim compensation. For most people, location is not one of these factors. As in other situations that require an explicit comparison, thinking was slow and System 2 was involved.
答案几乎是一致的:两种情况下的赔偿应该是一样的。赔偿 ,是针对残缺的伤害,那么为什么发生的地点应该有任何区别?对这两种情况的联合评估给了你一个机会,来审视你对与受害者 赔偿有关的因素的道德原则。对大多数人来说,地点不是这些因素之一。正如在其他需要明确比较的情况下,思考是缓慢的,系统2 。
The psychologists Dale Miller and Cathy McFarland, who originally designed the two scenarios, presented them to different people for single evaluation. In their between-subjects experiment, each participant saw only one scenario and assigned a dollar value to it. They found, as you surely guessed, that the victim was awarded a much larger sum if he was shot in a store he rarely visited than if he was shot in his regular store. Poignancy (a close cousin of regret) is a counterfactual feeling, which is evoked because the thought “if only he had shopped at his regular store …” comes readily to mind. The familiar System 1 mechanisms of substitution and intensity matching translate the strength of the emotional reaction to the story onto a monetary scale, creating a large difference in dollar awards.
最初设计这两个场景的心理学家戴尔-米勒和凯西-麦克法兰将它们呈现给不同的人进行单一评价。在他们的受试者之间的实验中,每个受试者只看到一个场景,并为其分配一个美元价值。他们发现,正如你肯定猜到的那样,如果受害者是在他很少去的商店 ,那么他获得的金额要比在他经常去的商店被枪杀的金额大得多。尊严(后悔的近亲)是一种反事实的感觉,它被唤起是因为 "如果他在他的常规商店购物就好了...... "的想法很容易出现在脑海中。熟悉的系统1的替代和强度匹配机制将对故事的情绪反应的强度转化为货币规模,在美元奖励中创造了一个巨大的差异 。
The comparison of the two experiments reveals a sharp contrast. Almost everyone who sees both scenarios together (within-subject) endorses the principle that poignancy is not a legitimate consideration. Unfortunately, the principle becomes relevant only when the two scenarios are seen together, and this is not how life usually works. We normally experience life in the between-subjects mode, in which contrasting alternatives that might change your mind are absent, and of course WYSIATI. As a consequence, the beliefs that you endorse when you reflect about morality do not necessarily govern your emotional reactions, and the moral intuitions that come to your mind in different situations are not internally consistent.
这两个实验的比较揭示了一个鲜明的对比。几乎所有把这两个场景放在一起看的人(被试内)都赞同这样的原则:凄美不是一个合法的考虑。不幸的是,这个原则只有在两个场景一起看时才变得相关,而这并不是生活通常的运作方式。我们通常在主体间的 模式下体验生活,在这种模式下,可能改变你想法的对比性替代方案是不存在的,当然还有所见即所得。因此,当你思考道德问题时,你所认可的信念不一定会支配你的情绪反应,而且在不同情况下出现在你脑海中的道德直觉也不是内在一致的。
The discrepancy between single and joint evaluation of the burglary scenario belongs to a broad family of reversals of judgment and choice . The first preference reversals were discovered in the early 1970s, and many reversals of other kinds were reported over the years.
入室盗窃情景的单一评价和联合评价之间的差异,属于 判断和选择的 广泛的 逆转 系列。第一个偏好反转是在20世纪70年代初发现的,多年来有许多其他类型的反转被报道。
CHALLENGING ECONOMICS
挑战经济
Preference reversals have an important place in the history of the conversation between psychologists and economists . The reversals that attracted attention were reported by Sarah Lichtenstein and Paul Slovic, two psychologists who had done their graduate work at the University of Michigan at the same time as Amos. They conducted an experiment on preferences between bets, which I show in a slightly simplified version.
在 心理学家和经济学家的对话 史上,偏好逆转有着重要的地位。 引起关注的逆转是由萨拉-利希滕斯坦和保罗-斯洛维奇报告的,这两位心理学家与阿莫斯同时在密歇根大学完成了他们的研究生工作。他们进行了一个关于赌注之间的偏好的实验,我以一个略微简化的版本展示。
You are offered a choice between two bets, which are to be played on a roulette wheel with 36 sectors.
你 ,可以选择两个赌注,在一个有36个区的轮盘上进行。
Bet A: 11/36 to win $160, 25/36 to lose $15
赌注A:11/36赢160美元,25/36 ,输15美元
Bet B: 35/36 to win $40, 1/36 to lose $10
赌注B:35/36赢40美元,1/36输10美元
You are asked to choose between a safe bet and a riskier one: an almost certain win of a modest amount, or a small chance to win a substantially larger amount and a high probability of losing. Safety prevails, and B is clearly the more popular choice.
你被要求在一个安全的赌注和一个风险较大的赌注之间做出选择:是几乎肯定能赢取少量的钱,还是有很小的机会赢取大量的钱,但输的可能性很大。安全优先,B显然是更受欢迎的选择。
Now consider each bet separately: If you owned that bet, what is the lowest price at which you would sell it? Remember that you are not negotiating with anyone—your task is to determine the lowest price at which you would truly be willing to give up the bet. Try it. You may find that the prize that can be won is salient in this task, and that your evaluation of what the bet is worth is anchored on that value. The results support this conjecture, and the selling price is higher for bet A than for bet B. This is a preference reversal: people choose B over A, but if they imagine owning only one of them, they set a higher value on A than on B. As in the burglary scenarios, the preference reversal occurs because joint evaluation focuses attention on an aspect of the situation—the fact that bet A is much less safe than bet B—which was less salient in single evaluation. The features that caused the difference between the judgments of the options in single evaluation—the poignancy of the victim being in the wrong grocery store and the anchoring on the prize—are suppressed or irrelevant when the options are evaluated jointly. The emotional reactions of System 1 are much more likely to determine single evaluation; the comparison that occurs in joint evaluation always involves a more careful and effortful assessment, which calls for System 2.
现在分别考虑每个赌注。如果你拥有这个赌注, ,你会以怎样的最低价格出售它?记住,你不是在和任何人谈判--你的任务是确定你真正愿意放弃赌注的最低价格。试试吧。你可能会发现,在这项任务中,可以赢得的奖品是很突出的,你对赌注价值的评估是以这个价值为基础的。结果支持这一猜想, A赌注的售价比B赌注的售价高。这是一种偏好逆转:人们选择B而不是A,但如果他们想象只拥有其中一个,他们对A的价值就比B高。正如在入室盗窃的情景中,偏好逆转的发生是因为联合评价将注意力集中在情况的一个方面--A赌注比B赌注的安全性低很多--这在单一评价中不那么突出。在单项评价中造成对选项判断差异的特征 --受害者在错误的杂货店中的悲痛感和对奖品的锚定--在选项被联合评价时被抑制或不相关。系统1的情绪反应更有可能决定单一评价;在联合评价中发生的比较总是涉及 ,这是一个更谨慎和努力的评估,这需要系统2。
The preference reversal can be confirmed in a within-subject experiment, in which subjects set prices on both sets as part of a long list, and also choose between them. Participants are unaware of the inconsistency, and their reactions when confronted with it can be entertaining. A 1968 interview of a participant in the experiment, conducted by Sarah Lichtenstein, is an enduring classic of the field. The experimenter talks at length with a bewildered participant , who chooses one bet over another but is then willing to pay money to exchange the item he just chose for the one he just rejected, and goes through the cycle repeatedly.
这种偏好反转可以在被试内部实验中得到证实,在实验中,被试对两套产品都设定了价格,作为一个长长的清单的一部分,同时也在两套产品之间进行选择。参与者没有意识到这种不一致,而他们在面对这种不一致时的反应也是很有趣的。1968年对该实验参与者的采访, ,由萨拉-利希滕斯坦主持,是该领域的一个永恒的经典。实验者与一个 困惑的参与者 进行了长时间的交谈,他选择了一个赌注而不是另一个,但随后愿意花钱用他刚刚选择的物品交换他刚刚拒绝的物品,并反复进行循环。
Rational Econs would surely not be susceptible to preference reversals, and the phenomenon was therefore a challenge to the rational-agent model and to the economic theory that is built on this model. The challenge could have been ignored, but it was not. A few years after the preference reversals were reported, two respected economists, David Grether and Charles Plott, published an article in the prestigious American Economic Review , in which they reported their own studies of the phenomenon that Lichtenstein and Slovic had described. This was probably the first finding by experimental psychologists that ever attracted the attention of economists. The introductory paragraph of Grether and Plott’s article was unusually dramatic for a scholarly paper, and their intent was clear: “A body of data and theory has been developing within psychology which should be of interest to economists. Taken at face value the data are simply inconsistent with preference theory and have broad implications about research priorities within economics …. This paper reports the results of a series of experiments designed to discredit the psychologists’ works as applied to economics.”
理性的生态环境肯定不会受到偏好逆转的影响,因此, ,这是对理性代理人模型 ,以及建立在这个模型上的经济理论的挑战。这个挑战本可以被忽略,但它没有被忽略。在偏好逆转被报道的几年后,两位受人尊敬的经济学家大卫-格雷特和查尔斯-普洛特在 著名的《 美国经济评论》 上发表了一篇文章,其中他们报告了自己对利希滕斯坦和斯洛维奇所描述的 现象的研究。这可能是实验心理学家的第一个发现,引起了经济学家的注意。格雷特和普洛特的文章的导言段对于一篇学术论文来说异常引人注目,他们的意图也很明确:"心理学中正在发展一套数据和理论,经济学家应该对此感兴趣。 从表面上看,这些数据与偏好理论根本不一致,对经济学的研究重点有广泛的影响....。本文报告了一系列实验的结果,旨在否定心理学家的工作在经济学中的应用。"
Grether and Plott listed thirteen theories that could explain the original findings and reported carefully designed experiments that tested these theories. One of their hypotheses, which—needless to say—psychologists found patronizing, was that the results were due to the experiment being carried out by psychologists! Eventually, only one hypothesis was left standing: the psychologists were right. Grether and Plott acknowledged that this hypothesis is the least satisfactory from the point of view of standard preference theory, because “it allows individual choice to depend on the context in which the choices are made ”—a clear violation of the coherence doctrine.
格雷特和普洛特列出了十三种可以解释原始发现的理论,并报告了精心设计的 实验来检验这些理论。他们的假设之一,不用说,心理学家们觉得很光荣,那就是这些结果是由于实验是由心理学家进行的!最后,只剩下一个假设:心理学家是对的。最终,只剩下一个假设:心理学家是对的。格雷特和普洛特承认,从标准 偏好理论的角度来看,这个假说是最不令人满意的,因为 "它允许个人的选择取决于 做出选择的背景 "--明显违反了一致性原则。
You might think that this surprising outcome would cause much anguished soul-searching among economists, as a basic assumption of their theory had been successfully challenged. But this is not the way things work in social science, including both psychology and economics. Theoretical beliefs are robust, and it takes much more than one embarrassing finding for established theories to be seriously questioned. In fact, Grether and Plott’s admirably forthright report had little direct effect on the convictions of economists, probably including Grether and Plott. It contributed, however, to a greater willingness of the community of economists to take psychological research seriously and thereby greatly advanced the conversation across the boundaries of the disciplines.
你可能会认为这个令人惊讶的结果会在经济学家中引起许多痛苦的反省,因为他们理论的一个基本假设被成功地挑战了。但这不是社会科学的工作方式,包括 ,包括心理学和经济学。理论信念是强大的,要想让既定理论受到严重质疑,需要的不仅仅是 一个尴尬的发现 。事实上,格雷特和普洛特令人钦佩的直率报告对经济学家的信念没有什么直接影响,可能包括格雷特和普洛特。然而,它促成了经济学家群体更愿意认真对待心理学研究, ,从而大大推动了跨越学科界限的对话。
CATEGORIES
类别
“How tall is John?” If John is 5′ tall, your answer will depend on his age; he is very tall if he is 6 years old, very short if he is 16. Your System 1 automatically retrieves the relevant norm, and the meaning of the scale of tallness is adjusted automatically. You are also able to match intensities across categories and answer the question, “How expensive is a restaurant meal that matches John’s height?” Your answer will depend on John’s age: a much less expensive meal if he is 16 than if he is 6.
"约翰有多高?"如果约翰是5′高,你的答案将取决于他的年龄;如果他是6岁,他就很高,如果他是16岁,就很矮。你的系统1会自动 ,检索出相关的规范,身高尺度的含义也会自动调整。 你也能够在不同的类别中匹配强度,并回答问题:"符合约翰身高的餐厅饭菜有多贵?"你的答案将取决于约翰的年龄:如果他是16岁,那么这顿饭要比他6岁的时候便宜很多。
But now look at this:
但现在看看这个。
John is 6. He is 5′ tall.
约翰今年6岁,身高5英尺。
Jim is 16. He is 5′1″ tall.
吉姆今年16岁。他身高5′1″。
In single evaluations, everyone will agree that John is very tall and Jim is not, because they are compared to different norms. If you are asked a directly comparative question, “Is John as tall as Jim?” you will answer that he is not. There is no surprise here and little ambiguity. In other situations, however, the process by which objects and events recruit their own context of comparison can lead to incoherent choices on serious matters.
在单一评价中,每个人都会同意约翰很高,而吉姆 ,因为他们是与不同的标准相比较。如果你被问到一个直接比较的问题,"约翰和吉姆一样高吗?"你会回答说他不高。这里没有惊喜,也没有什么歧义。然而,在其他情况下,物体和事件招募自己的比较背景的过程可能会导致在严重问题上的不连贯的选择。
You should not form the impression that single and joint evaluations are always inconsistent, or that judgments are completely chaotic. Our world is broken into categories for which we have norms, such as six-year-old boys or tables. Judgments and preferences are coherent within categories but potentially incoherent when the objects that are evaluated belong to different categories. For an example, answer the following three questions:
你不应该形成这样的印象:单一评价和联合评价总是不一致的,或者判断是完全混乱的。 。我们的世界被分割成我们有规范的类别,如六岁的男孩或桌子。判断和偏好在类别内是连贯的,但当被评价的对象属于不同的类别时,就有可能不连贯了。以此为例,请回答以下三个 。
Which do you like more, apples or peaches?
你更喜欢哪个,苹果还是桃子?
Which do you like more, steak or stew?
你更喜欢哪一个,牛排还是炖肉?
Which do you like more, apples or steak?
你更喜欢哪个,苹果还是牛排?
The first and the second questions refer to items that belong to the same category, and you know immediately which you like more. Furthermore, you would have recovered the same ranking from single evaluation (“How much do you like apples?” and “How much do you like peaches?”) because apples and peaches both evoke fruit. There will be no preference reversal because different fruits are compared to the same norm and implicitly compared to each other in single as well as in joint evaluation. In contrast to the within-category questions, there is no stable answer for the comparison of apples and steak. Unlike apples and peaches, apples and steak are not natural substitutes and they do not fill the same need. You sometimes want steak and sometimes an apple, but you rarely say that either one will do just as well as the other.
第一个和第二个问题指的是属于同一类别的物品,你马上就知道你更喜欢哪个。此外,你会从单一评价中恢复相同的排名("你有多喜欢苹果?"和 "你有多喜欢桃子 ?"),因为苹果和桃子都会唤起水果。不会出现偏好逆转,因为不同的水果在单一评价以及联合评价中都是与同一规范相比较,并隐含地相互比较。与类别内的问题相比,苹果和牛排的比较没有稳定的答案。与苹果和桃子不同,苹果和牛排不是 自然的替代品,它们也不满足相同的需求。你有时想吃牛排,有时想吃苹果,但你很少说任何一个都能和另一个一样好。
Imagine receiving an e-mail from an organization that you generally trust, requesting a contribution to a cause:
想象一下 ,收到一封来自你普遍信任的组织的电子邮件,要求为某项事业捐款。
Dolphins in many breeding locations are threatened by pollution, which is expected to result in a decline of the dolphin population. A special fund supported by private contributions has been set up to provide pollution-free breeding locations for dolphins.
许多繁殖地的海豚受到了污染的威胁,预计这将导致海豚数量的下降 。一个由私人捐款支持的特别基金已经成立,为海豚提供无污染的繁殖地。
What associations did this question evoke? Whether or not you were fully aware of them, ideas and memories of related causes came to your mind. Projects intended to preserve endangered species were especially likely to be recalled. Evaluation on the GOOD–BAD dimension is an automatic operation of System 1, and you formed a crude impression of the ranking of the dolphin among the species that came to mind. The dolphin is much more charming than, say, ferrets, snails, or carp—it has a highly favorable rank in the set of species to which it is spontaneously compared.
这个问题唤起了什么联想?无论你是否完全意识到,你的脑海中都会出现相关事业的想法和记忆。旨在保护濒危物种的项目特别容易被回忆起来。在好坏维度上的评价 ,是系统1的自动操作,你对海豚在所想到的物种中的排名形成了一个粗略的印象。海豚比雪貂、蜗牛或鲤鱼更有魅力--它在与之自发比较的一系列物种中有着非常有利的排名。
The question you must answer is not whether you like dolphins more than carp; you have been asked to come up with a dollar value. Of course, you may know from the experience of previous solicitations that you never respond to requests of this kind. For a few minutes, imagine yourself as someone who does contribute to such appeals.
你必须回答的问题不是你是否喜欢海豚多于鲤鱼 ;你被要求拿出一个美元价值。当然,你可能从以前的募捐经验中知道,你从不回应这种要求。在几分钟内,想象自己是一个确实为这种呼吁作出贡献的人。
Like many other difficult questions, the assessment of dollar value can be solved by substitution and intensity matching. The dollar question is difficult, but an easier question is readily available. Because you like dolphins, you will probably feel that saving them is a good cause. The next step, which is also automatic, generates a dollar number by translating the intensity of your liking of dolphins onto a scale of contributions. You have a sense of your scale of previous contributions to environmental causes, which may differ from the scale of your contributions to politics or to the football team of your alma mater. You know what amount would be a “very large” contribution for you and what amounts are “large,” “modest,” and “small.” You also have scales for your attitude to species (from “like very much” to “not at all”). You are therefore able to translate your attitude onto the dollar scale, moving automatically from “like a lot” to “fairly large contribution” and from there to a number of dollars.
与其他许多难题一样,对美元价值的评估可以通过替换和强度匹配来解决。美元 ,这个问题很困难,但有一个更容易的问题是现成的。因为你喜欢海豚,你可能会觉得拯救海豚是一件好事。下一步,也是自动的,通过将你 喜欢海豚 的强度转化为捐款的规模,产生一个美元数字。你对自己以前对环境事业的贡献有一种感觉,这可能与你对政治或对你母校的足球队的贡献规模不同, 。你知道什么金额对你来说是 "非常大 "的贡献,什么金额是 "大"、"小 "和 "小"。你也有你对物种态度的标尺(从 "非常喜欢 "到 "根本不喜欢")。因此,你能够将你的态度转化为美元的刻度,从 "非常喜欢 "到 "相当大的贡献",再从那里到若干美元,自动移动 。
On another occasion, you are approached with a different appeal:
在另一个场合,有人向你提出了不同的呼吁。
Farmworkers, who are exposed to the sun for many hours, have a higher rate of skin cancer than the general population. Frequent medical check-ups can reduce the risk. A fund will be set up to support medical check-ups for threatened groups.
农民工由于长时间暴露在阳光下,其皮肤癌的发病率比一般人要高。经常进行体检可以 ,降低风险。将设立一个基金来支持受威胁群体的体检。
Is this an urgent problem? Which category did it evoke as a norm when you assessed urgency? If you automatically categorized the problem as a public-health issue, you probably found that the threat of skin cancer in farmworkers does not rank very high among these issues—almost certainly lower than the rank of dolphins among endangered species. As you translated your impression of the relative importance of the skin cancer issue into a dollar amount, you might well have come up with a smaller contribution than you offered to protect an endearing animal. In experiments, the dolphins attracted somewhat larger contributions in single evaluation than did the farmworkers.
这是一个紧急问题吗?当你评估紧迫性时,它唤起了哪一类的规范?如果你把这个问题自动归类为公共卫生问题,你可能会发现,农民工皮肤癌的威胁在这些问题中的排名并不高--几乎可以肯定比海豚在濒危物种中的排名低。当你把你对皮肤癌问题的相对 重要性的印象转化为美元数额时,你很可能得出的捐款比你为保护一种可爱的动物所提供的捐款要少。在实验中,海豚在单次评估中吸引的捐款比农场工人要多一些。
Next, consider the two causes in joint evaluation. Which of the two, dolphins or farmworkers, deserves a larger dollar contribution? Joint evaluation highlights a feature that was not noticeable in single evaluation but is recognized as decisive when detected: farmers are human, dolphins are not. You knew that, of course, but it was not relevant to the judgment that you made in single evaluation. The fact that dolphins are not human did not arise because all the issues that were activated in your memory shared that feature. The fact that farmworkers are human did not come to mind because all public-health issues involve humans. The narrow framing of single evaluation allowed dolphins to have a higher intensity score, leading to a high rate of contributions by intensity matching. Joint evaluation changes the representation of the issues: the “human vs. animal” feature becomes salient only when the two are seen together. In joint evaluation people show a solid preference for the farmworkers and a willingness to contribute substantially more to their welfare than to the protection of a likable nonhuman species. Here again, as in the cases of the bets and the burglary shooting, the judgments made in single and in joint evaluation will not be consistent.
接下来,在联合评估中考虑这两项事业。海豚和农民工这两者中,哪一个应该得到更大的美元 ?联合评价突出了一个在单一评价中没有注意到的特征,但在发现时被认为是决定性的:农民是人,海豚不是。你当然知道这一点,但它与你在单一评价中的判断无关。海豚不是人这一事实没有出现,因为在你的记忆中被激活的所有问题都共享 这一特征。农民工是人类这一事实没有出现,因为所有的公共卫生问题都涉及人类。单一评价的狭窄框架使海豚拥有更高的强度得分,导致强度匹配的高贡献率。联合评价改变了问题的表述:"人类与动物 "的特征只有在两者被 ,才会变得突出。在联合评价中,人们表现出对农民工的明确偏好,并愿意为他们的福利做出比保护一个可爱的非人类物种更多的贡献。在这里,就像在打赌和入室抢劫的案件中一样,在单一评价和联合评价中做出的判断不会一致。
Christopher Hsee, of the University of Chicago, has contributed the following example of preference reversal, among many others of the same type. The objects to be evaluated are secondhand music dictionaries.
芝加哥大学的Christopher Hsee, ,在许多其他同类型的例子中,贡献了下面这个偏好逆转的例子。要评估的对象是二手音乐词典。
Dictionary A
词典A
Dictionary B
词典B
Year of publication
出版年份
1993
1993
1993
1993
Number of entries
条目数
10,000
10,000
20,000
20,000
Condition
状况
Like new
像新的一样
Cover torn, otherwise like new
封面破损,其他方面如新
When the dictionaries are presented in single evaluation, dictionary A is valued more highly, but of course the preference changes in joint evaluation. The result illustrates Hsee’s evaluability hypothesis : The number of entries is given no weight in single evaluation, because the numbers are not “evaluable” on their own. In joint evaluation, in contrast, it is immediately obvious that dictionary B is superior on this attribute , and it is also apparent that the number of entries is far more important than the condition of the cover.
当词典在单一评价中呈现时,词典A的价值更 ,但在联合评价中,偏好当然会改变。这个结果说明了Hsee的 可评价性假说 。在单项评价中,词条的数量没有被赋予任何权重,因为这些数字本身是不可评价的。相反,在联合评价中,很明显,词典B 在这一属性上更胜一筹 ,而且也很明显,条目数 ,远比封面的条件更重要。
UNJUST REVERSALS
不公正的逆转
There is good reason to believe that the administration of justice is infected by predictable incoherence in several domains. The evidence is drawn in part from experiments, including studies of mock juries, and in part from observation of patterns in legislation, regulation, and litigation.
我们有充分的理由相信,在一些领域,司法行政受到了可预测的不一致的影响。这些证据部分来自实验,包括对模拟陪审团的研究,部分来自对立法、监管和诉讼模式的观察。
In one experiment, mock jurors recruited from jury rolls in Texas were asked to assess punitive damages in several civil cases. The cases came in pairs, each consisting of one claim for physical injury and one for financial loss. The mock jurors first assessed one of the scenarios and then they were shown the case with which it was paired and were asked to compare the two. The following are summaries of one pair of cases:
在一项实验中,模拟陪审员 ,他们被要求评估几个民事案件中的惩罚性赔偿。这些案件是成对出现的,每个案件包括一个身体伤害的索赔和一个经济损失的索赔。模拟陪审员首先评估其中一个场景,然后向他们展示与之配对的案件,并要求他们对这两个案件进行比较。以下是一对案件的摘要。
Case 1: A child suffered moderate burns when his pajamas caught fire as he was playing with matches. The firm that produced the pajamas had not made them adequately fire resistant.
案例1:一个孩子在玩火柴时,他的睡衣着火了,造成中度烧伤。生产该睡衣的公司没有对其进行充分的防火处理。
Case 2: The unscrupulous dealings of a bank caused another bank a loss of $10 million.
案例2:一家银行的不正当交易使另一家银行损失了1000万美元。
Half of the participants judged case 1 first (in single evaluation) before comparing the two cases in joint evaluation. The sequence was reversed for the other participants. In single evaluation, the jurors awarded higher punitive damages to the defrauded bank than to the burned child, presumably because the size of the financial loss provided a high anchor.
一半的参与者先判断案例1(在单一评价中),然后在联合评价中对两个案例进行比较。其他参与者的顺序 ,而其他参与者则相反。在单一评价中,陪审员对被骗银行的惩罚性赔偿比对被烧伤儿童的惩罚性赔偿高,可能是因为经济损失的大小提供了一个高的锚。
When the cases were considered together, however, sympathy for the individual victim prevailed over the anchoring effect and the jurors increased the award to the child to surpass the award to the bank. Averaging over several such pairs of cases, awards to victims of personal injury were more than twice as large in joint than in single evaluation. The jurors who saw the case of the burned child on its own made an offer that matched the intensity of their feelings. They could not anticipate that the award to the child would appear inadequate in the context of a large award to a financial institution. In joint evaluation, the punitive award to the bank remained anchored on the loss it had sustained, but the award to the burned child increased, reflecting the outrage evoked by negligence that causes injury to a child.
然而,当这些案件被放在一起考虑时,对单个受害者的同情战胜了锚定效应,陪审员增加了对儿童的赔偿 ,超过了对银行的赔偿。在几对这样的案件中平均来看,对人身伤害受害者的赔偿金在联合评估中是单一评估的两倍以上。 ,陪审员们看到被烧伤的孩子的案件本身就提出了一个与他们的感情强度相匹配的提议。他们无法预料,在 金融机构的巨额赔偿的情况下,对孩子的赔偿会显得不足。在联合评估中,对银行的惩罚性赔偿仍以其所遭受的损失为基础,但对被烧伤儿童的赔偿却增加了,这反映了因过失导致儿童受伤而引起的愤慨。
As we have seen, rationality is generally served by broader and more comprehensive frames, and joint evaluation is obviously broader than single evaluation. Of course, you should be wary of joint evaluation when someone who controls what you see has a vested interest in what you choose. Salespeople quickly learn that manipulation of the context in which customers see a good can profoundly influence preferences. Except for such cases of deliberate manipulation, there is a presumption that the comparative judgment, which necessarily involves System 2, is more likely to be stable than single evaluations, which often reflect the intensity of emotional responses of System 1. We would expect that any institution that wishes to elicit thoughtful judgments would seek to provide the judges with a broad context for the assessments of individual cases. I was surprised to learn from Cass Sunstein that jurors who are to assess punitive damages are explicitly prohibited from considering other cases. The legal system, contrary to psychological common sense, favors single evaluation.
正如我们所看到的,理性通常由更广泛和更全面的框架来服务,而联合评价显然比单一评价更广泛 。当然,当控制你所看到的东西的人对你的选择有既得利益时,你应该对联合评价保持警惕。销售人员很快就会了解到,操纵客户看到商品的环境可以深刻地影响偏好。 除了这种故意操纵的情况,有一种推测是,比较判断(必然涉及到系统2)比单一评价更可能是稳定的,后者往往反映了系统1的情绪反应的强度。我们期望,任何希望引起深思熟虑的判断的机构都会设法为法官提供一个广泛的背景,以便对个别案件进行评估。我很惊讶地从Cass Sunstein那里得知,要评估惩罚性 ,明确禁止陪审员考虑其他案件。与心理学常识相反,法律体系倾向于单一评估。
In another study of incoherence in the legal system, Sunstein compared the administrative punishments that can be imposed by different U.S. government agencies including the Occupational Safety and Health Administration and the Environmental Protection Agency. He concluded that “within categories, penalties seem extremely sensible, at least in the sense that the more serious harms are punished more severely. For occupational safety and health violations, the largest penalties are for repeated violations, the next largest for violations that are both willful and serious, and the least serious for failures to engage in the requisite record-keeping .” It should not surprise you, however, that the size of penalties varied greatly across agencies, in a manner that reflected politics and history more than any global concern for fairness. The fine for a “serious violation” of the regulations concerning worker safety is capped at $7,000, while a violation of the Wild Bird Conservation Act can result in a fine of up to $25,000. The fines are sensible in the context of other penalties set by each agency, but they appear odd when compared to each other. As in the other examples in this chapter, you can see the absurdity only when the two cases are viewed together in a broad frame. The system of administrative penalties is coherent within agencies but incoherent globally.
在另一项关于法律体系不连贯性的研究中,孙斯坦比较了不同的美国政府机构可以实施的行政处罚,包括职业安全与健康管理局和环境保护 机构。他的结论是:"在类别内,惩罚似乎是非常合理的,至少在更严重的伤害被惩罚的意义上是如此。对于职业安全和健康的违法行为,最大的惩罚是重复的违法行为,其次是故意和严重的违法行为,而最不严重的是没有进行 必要的记录 "。然而,你不应该感到惊讶的是,各机构的处罚规模差异很大,这种方式反映了政治和历史,而不是对公平的任何全球关注。对 "严重违反 "有关工人安全的规定的罚款上限为7,000美元,而违反《野生鸟类保护法》可导致高达25,000美元的罚款。从每个机构制定的其他处罚措施来看,这些罚款是合理的 ,但它们相互比较起来就显得很奇怪。正如本章中的其他例子一样,只有在 一个广泛的框架中一起看这两个案例时,你才能看到荒谬之处。行政处罚制度在机构内部是连贯的,但在全球范围内是不连贯的。
SPEAKING OF REVERSALS
说到反转
“The BTU units meant nothing to me until I saw how much air-conditioning units vary. Joint evaluation was essential.”
"BTU单位对我来说毫无意义,直到我看到 ,空调设备的差异有多大。联合评估是必不可少的"。
“You say this was an outstanding speech because you compared it to her other speeches. Compared to others, she was still inferior.”
"你说这是一次出色的演讲,因为你把它与她的其他演讲相比。与其他人相比,她还是逊色的。"
“It is often the case that when you broaden the frame, you reach more reasonable decisions.”
"通常的情况是,当你扩大框架时,你会达成更合理的决定。"
“When you see cases in isolation, you are likely to be guided by an emotional reaction of System 1.”
"当你孤立地看待案例时,你很可能被系统 1的情绪反应所引导。"
34
34
Frames and Reality
框架与现实
Italy and France competed in the 2006 final of the World Cup. The next two sentences both describe the outcome: “Italy won.” “France lost.” Do those statements have the same meaning? The answer depends entirely on what you mean by meaning .
意大利和法国在2006年的世界杯决赛中展开角逐。接下来的两个句子都描述了结果。"意大利赢了"。"法国输了"。这些语句有相同的含义吗?答案完全取决于 ,你的 意思 是什么。
For the purpose of logical reasoning, the two descriptions of the outcome of the match are interchangeable because they designate the same state of the world. As philosophers say, their truth conditions are identical: if one of these sentences is true, then the other is true as well. This is how Econs understand things. Their beliefs and preferences are reality-bound. In particular, the objects of their choices are states of the world, which are not affected by the words chosen to describe them.
就逻辑推理而言,对比赛结果的两种描述是可以互换的,因为它们指定的是世界的同一状态。正如哲学家所说,它们的真理条件是相同的:如果这些句子中的一个是真的,那么另一个也是真的。这就是Econs理解事物的方式。他们的信念和偏好是受现实约束的。 特别是,他们选择的对象是世界的状态,不受选择来描述它们的词语的影响。
There is another sense of meaning , in which “Italy won” and “France lost” do not have the same meaning at all. In this sense, the meaning of a sentence is what happens in your associative machinery while you understand it. The two sentences evoke markedly different associations. “Italy won” evokes thoughts of the Italian team and what it did to win. “France lost” evokes thoughts of the French team and what it did that caused it to lose, including the memorable head butt of an Italian player by the French star Zidane. In terms of the associations they bring to mind—how System 1 reacts to them—the two sentences really “mean” different things. The fact that logically equivalent statements evoke different reactions makes it impossible for Humans to be as reliably rational as Econs.
还有另一种 意义 ,在这种意义上,"意大利赢了 "和 "法国输了 "根本就没有相同的意义。在这个意义上,一个句子的意义是你在理解它时在你的联想机器中发生的事情。这两个句子唤起了明显不同的 联想。"意大利赢了 "唤起了人们对意大利队的思考,以及它是如何赢得比赛的。而 "法国队输了 "则让人想到法国队和它所做的导致它输掉比赛的事情,包括法国球星齐达内对一名意大利球员令人难忘的头部撞击。就它们所带来的联想而言--系统1对它们的反应--这两个句子确实 "意味着 "不同的事情。逻辑上 相等的语句唤起了不同的反应,这一事实使得人类不可能像生态人那样可靠地理性。
EMOTIONAL FRAMING
情感框架
Amos and I applied the label of framing effects to the unjustified influences of formulation on beliefs and preferences. This is one of the examples we used:
阿莫斯和我将 "框架效应 "这一标签用于 表述 对信念和偏好的 不合理影响 。这是我们使用的例子之一。
Would you accept a gamble that offers a 10% chance to win $95 and a 90% chance to lose $5?
你会接受一个有10%的机会赢95美元,90%的机会输5美元的赌博吗?
Would you pay $5 to participate in a lottery that offers a 10% chance to win $100 and a 90% chance to win nothing?
你会 ,支付5美元来参加一个有10%的机会赢得100美元,90%的机会什么都没有的彩票吗?
First, take a moment to convince yourself that the two problems are identical. In both of them you must decide whether to accept an uncertain prospect that will leave you either richer by $95 or poorer by $5. Someone whose preferences are reality-bound would give the same answer to both questions, but such individuals are rare. In fact, one version attracts many more positive answers: the second. A bad outcome is much more acceptable if it is framed as the cost of a lottery ticket that did not win than if it is simply described as losing a gamble. We should not be surprised: losses evokes stronger negative feelings than costs . Choices are not reality-bound because System 1 is not reality-bound.
首先,花点时间让自己相信,这两个问题是相同的。在这两个问题中,你必须决定是否接受一个不确定的前景,这个前景将使你要么更富有95美元,要么更贫穷5美元。一个偏好受现实约束的人将对这两个问题给出相同的答案, ,但这样的人很少。事实上,一个版本吸引了更多积极的答案:第二个版本。如果把坏结果说成是一张没有中奖的彩票的成本,要比把它简单地说成是输掉一场赌博更容易被接受。我们不应该感到惊讶: 损失 比 成本 更能唤起人们的负面情绪。选择不受现实的约束,因为系统1不受现实的约束。
The problem we constructed was influenced by what we had learned from Richard Thaler, who told us that when he was a graduate student he had pinned on his board a card that said COSTS ARE NOT LOSSES . In his early essay on consumer behavior, Thaler described the debate about whether gas stations would be allowed to charge different prices for purchases paid with cash or on credit . The credit-card lobby pushed hard to make differential pricing illegal, but it had a fallback position: the difference, if allowed, would be labeled a cash discount, not a credit surcharge. Their psychology was sound: people will more readily forgo a discount than pay a surcharge. The two may be economically equivalent, but they are not emotionally equivalent.
我们构建的问题受到了我们从理查德-塞勒那里学到的东西的影响,他告诉我们,当他还是一个研究生的时候,他在他的黑板上钉了一张卡片,上面写着 成本不是损失 。在他早期关于消费者行为的文章中,塞勒描述了关于是否允许加油站 对用现金或信用卡支付 的消费收取不同价格的辩论。信用卡 游说团极力要求将差别定价定为非法,但他们有一个后备立场:如果允许,这种差别将被标记为现金折扣,而不是信贷附加费。他们的心理学是正确的:人们会更乐意放弃折扣而不是支付附加费。两者在经济上可能是对等的,但在情感上是不对等的。
In an elegant experiment, a team of neuroscientists at University College London combined a study of framing effects with recordings of activity in different areas of the brain. In order to provide reliable measures of the brain response, the experiment consisted of many trials. Figure 14 illustrates the two stages of one of these trials.
在一个优雅的实验中,伦敦大学学院的一个神经科学家团队 ,将框架效应的研究与大脑不同区域的活动记录相结合。为了提供对大脑反应的可靠测量,该实验包括许多试验。 图14 说明了其中一个试验的两个阶段。
First, the subject is asked to imagine that she received an amount of money, in this example £50.
首先,受试者被要求想象她收到一笔钱,在这个例子中是50英镑。
The subject is then asked to choose between a sure outcome and a gamble on a wheel of chance. If the wheel stops on white she “receives” the entire amount; if it stops on black she gets nothing. The sure outcome is simply the expected value of the gamble, in this case a gain of £20.
然后要求受试者 ,在肯定的结果和 ,在机会之轮上的赌博之间做出选择。如果转盘停在白色上,她就 "得到 "全部金额;如果转盘停在黑色上,她就什么也得不到。确定的结果只是赌博的预期值,在这种情况下是20英镑的收益。

Figure 14
图14
As shown, the same sure outcome can be framed in two different ways: as KEEP £20 or as LOSE £30. The objective outcomes are precisely identical in the two frames, and a reality-bound Econ would respond to both in the same way—selecting either the sure thing or the gamble regardless of the frame—but we already know that the Human mind is not bound to reality. Tendencies to approach or avoid are evoked by the words, and we expect System 1 to be biased in favor of the sure option when it is designated as KEEP and against that same option when it is designated as LOSE.
如图所示,同一个确定的结果可以用两种不同的方式来框定:作为KEEP 20英镑或作为LOSE 30英镑。在这两种框架中,客观结果是完全相同的,一个受现实约束的经济学人将以同样的方式对这两种结果做出反应--无论框架如何,都会选择肯定的事情或赌博--但我们已经知道,人类的思想并不受现实约束。接近 或回避的倾向是由文字唤起的,我们期望系统1在被指定为KEEP时偏向于肯定的选项,在被指定为LOSE时偏向于同样的选项。
The experiment consisted of many trials, and each participant encountered several choice problems in both the KEEP and the LOSE frames. As expected, every one of the 20 subjects showed a framing effect: they were more likely to choose the sure thing in the KEEP frame and more likely to accept the gamble in the LOSE frame. But the subjects were not all alike. Some were highly susceptible to the framing of the problem. Others mostly made the same choice regardless of the frame—as a reality-bound individual should do. The authors ranked the 20 subjects accordingly and gave the ranking a striking label: the rationality index.
实验由许多次试验组成,每个被试都在KEEP和LOSE框架中遇到了几个选择问题。正如预期的那样,20名受试者中的每一个人都表现出了框架效应:他们 ,更有可能在KEEP框架中选择确定的东西,而在LOSE框架中更有可能接受赌博。但这些受试者并不都是一样的。有些人非常容易受到问题框架的影响。而其他被试者无论在哪种框架下都会做出同样的选择--就像一个受现实约束的人应该做的那样。作者对这20名受试者进行了相应的排名,并给这个排名贴上了一个醒目的标签: 理性指数。
The activity of the brain was recorded as the subjects made each decision. Later, the trials were separated into two categories:
当受试者做出每个决定时,大脑的活动被记录下来。后来,这些试验被分成两类。
Trials on which the subject’s choice conformed to the frame
被试的选择符合框架要求的试验
preferred the sure thing in the KEEP version
更喜欢 "保留 "版本中的 "肯定"。
preferred the gamble in the LOSS version
在 "输 "的版本中更倾向于 "赌"。
Trials in which the choice did not conform to the frame.
选择不符合框架的试验。
The remarkable results illustrate the potential of the new discipline of neuroeconomics—the study of what a person’s brain does while he makes decisions. Neuroscientists have run thousands of such experiments, and they have learned to expect particular regions of the brain to “light up”—indicating increased flow of oxygen, which suggests heightened neural activity—depending on the nature of the task. Different regions are active when the individual attends to a visual object, imagines kicking a ball, recognizes a face, or thinks of a house. Other regions light up when the individual is emotionally aroused, is in conflict, or concentrates on solving a problem. Although neuroscientists carefully avoid the language of “this part of the brain does such and such …,” they have learned a great deal about the “personalities” of different brain regions, and the contribution of analyses of brain activity to psychological interpretation has greatly improved. The framing study yielded three main findings:
这些显著的结果说明了神经经济学这门新学科的潜力--研究一个人在做决定时大脑做什么。神经科学家们已经进行了数千次这样的实验,他们已经学会了期望大脑的特定区域 "亮起来"--表明氧气流量增加,这表明神经活动增强--取决于任务的性质。当 ,个人关注一个视觉物体、想象踢球、识别脸部或想到房子时,不同的区域会活跃起来。当个体情绪激动、处于冲突中或集中精力解决一个问题时,其他区域就会亮起来。尽管神经科学家们小心翼翼地避免使用 "大脑的这个部分做这样那样的事情...... "的语言,但他们已经学到了很多关于不同 大脑区域的 "个性",而且大脑活动分析对心理学解释的贡献已经大大改善。该框架研究产生了三个主要的发现。
A region that is commonly associated with emotional arousal (the amygdala) was most likely to be active when subjects’ choices conformed to the frame. This is just as we would expect if the emotionally loaded words KEEP and LOSE produce an immediate tendency to approach the sure thing (when it is framed as a gain) or avoid it (when it is framed as a loss). The amygdala is accessed very rapidly by emotional stimuli—and it is a likely suspect for involvement in System 1.
当受试者的选择符合框架时,一个通常与情绪唤醒有关的区域(杏仁核)最有可能被激活。这就像我们所期望的那样,如果充满情感的词汇KEEP和LOSE ,就会立即产生接近肯定的东西(当它被框定为收益)或避免它(当它被框定为损失)的倾向。杏仁核会被情绪刺激迅速进入--它可能是参与系统1的嫌疑人。
A brain region known to be associated with conflict and self-control (the anterior cingulate) was more active when subjects did not do what comes naturally—when they chose the sure thing in spite of its being labeled LOSE. Resisting the inclination of System 1 apparently involves conflict.
一个已知与冲突和自我控制有关的脑区(前扣带)在受试者不做自然的事情时更加活跃--当 ,他们选择了肯定的事情,尽管它被标记为LOSE。抵制系统1的倾向显然涉及冲突。
The most “rational” subjects—those who were the least susceptible to framing effects—showed enhanced activity in a frontal area of the brain that is implicated in combining emotion and reasoning to guide decisions. Remarkably, the “rational” individuals were not those who showed the strongest neural evidence of conflict. It appears that these elite participants were (often, not always) reality-bound with little conflict.
最 "理性 "的受试者--那些最不容易受到框架效应影响的人--显示出大脑前额区域的活动增强,该区域与结合情感和推理来指导决策有关。值得注意的是,这些 "理性 "的人并不是那些 ,他们表现出最强烈的冲突的神经证据。似乎这些精英参与者(经常,而不是总是)被现实所束缚,很少有冲突。
By joining observations of actual choices with a mapping of neural activity, this study provides a good illustration of how the emotion evoked by a word can “leak” into the final choice.
通过将对实际选择的观察与神经活动的映射结合起来,这项研究很好地说明了一个词所唤起的情绪是如何 "泄漏 "到最终选择中的。
An experiment that Amos carried out with colleagues at Harvard Medical School is the classic example of emotional framing. Physician participants were given statistics about the outcomes of two treatments for lung cancer: surgery and radiation. The five-year survival rates clearly favor surgery, but in the short term surgery is riskier than radiation. Half the participants read statistics about survival rates, the others received the same information in terms of mortality rates. The two descriptions of the short-term outcomes of surgery were:
阿莫斯与哈佛大学医学院的同事 ,进行的一项实验是情感框架的典型例子。参与实验的医生得到了关于两种肺癌治疗方法结果的统计数据:手术和放射。五年生存率显然有利于手术,但在短期内,手术比放疗更有风险。一半的参与者阅读了有关生存率的统计数字,其他的参与者则收到了相同的信息, ,即死亡率。关于手术的短期结果的两个描述是:。
The one-month survival rate is 90%.
一个月的存活率为90%。
There is 10% mortality in the first month.
第一个月有10%的死亡率。
You already know the results: surgery was much more popular in the former frame (84% of physicians chose it) than in the latter (where 50% favored radiation). The logical equivalence of the two descriptions is transparent, and a reality-bound decision maker would make the same choice regardless of which version she saw. But System 1, as we have gotten to know it, is rarely indifferent to emotional words: mortality is bad, survival is good, and 90% survival sounds encouraging whereas 10% mortality is frightening . An important finding of the study is that physicians were just as susceptible to the framing effect as medically unsophisticated people (hospital patients and graduate students in a business school). Medical training is, evidently, no defense against the power of framing.
你已经知道了结果:在前一个框架中,手术要比后一个框架中更受欢迎(84%的医生选择了手术)(50%的医生倾向于放射)。两种描述的逻辑等同性是透明的, ,一个受现实约束的决策者无论看到哪个版本都会做出相同的选择。但是系统1,正如我们所了解的那样,很少对情绪化的词语无动于衷:死亡是坏事,生存是好事,90%的生存听起来令人鼓舞,而 10%的死亡则令人恐惧 。该研究的一个重要发现是,医生和 ,在医学上不成熟的人(医院病人和商学院的研究生)一样容易受到框架效应的影响。显然,医学培训并不能抵御框架效应的力量。
The KEEP–LOSE study and the survival–mortality experiment differed in one important respect. The participants in the brain-imaging study had many trials in which they encountered the different frames. They had an opportunity to recognize the distracting effects of the frames and to simplify their task by adopting a common frame, perhaps by translating the LOSE amount into its KEEP equivalent. It would take an intelligent person (and an alert System 2) to learn to do this, and the few participants who managed the feat were probably among the “rational” agents that the experimenters identified. In contrast, the physicians who read the statistics about the two therapies in the survival frame had no reason to suspect that they would have made a different choice if they had heard the same statistics framed in terms of mortality. Reframing is effortful and System 2 is normally lazy. Unless there is an obvious reason to do otherwise, most of us passively accept decision problems as they are framed and therefore rarely have an opportunity to discover the extent to which our preferences are frame-bound rather than reality-bound .
KEEP-LOSE研究和生存-死亡实验在一个重要方面有所不同。在脑成像研究中,参与者有很多试验,他们遇到了不同的框架。他们有机会 ,认识到这些框架的干扰作用,并通过采用一个共同的框架来简化他们的任务,也许是通过将LOSE的数量转换为KEEP的等价物。这需要一个聪明的人(和一个警觉的系统2)来学习这样做,而少数能够完成这一壮举的参与者可能是实验者确定的 "理性 "代理人之一。相比之下,那些在生存框架中阅读关于两种疗法的统计数据的医生 ,没有理由怀疑,如果他们听到以死亡率为框架的相同统计数据,他们会做出不同的选择。重新构思是费力的,系统2通常是懒惰的。除非有明显的理由,否则我们大多数人都会被动地接受决策问题的框架,因此很少有机会 ,以发现我们的偏好在多大程度上是 受框架约束 而不是 受现实约束 。
EMPTY INTUITIONS
空 直觉
Amos and I introduced our discussion of framing by an example that has become known as the “Asian disease problem” :
阿莫斯和我通过一个被称为 "亚洲疾病问题 " 的例子来介绍我们关于框架的讨论 。
Imagine that the United States is preparing for the outbreak of an unusual Asian disease, which is expected to kill 600 people. Two alternative programs to combat the disease have been proposed. Assume that the exact scientific estimates of the consequences of the programs are as follows:
想象一下,美国正在为一种不寻常的亚洲疾病的爆发做准备,这种疾病预计将导致600人死亡。已经提出了两个备选方案 ,以防治该疾病。假设对这些方案的后果的确切科学估计如下。
If program A is adopted, 200 people will be saved.
如果方案A被采纳,将有200人获救。
If program B is adopted, there is a one-third probability that 600 people will be saved and a two-thirds probability that no people will be saved.
如果采用方案B,有三分之一的概率会有600人获救,三分之二的概率不会有人获救。
A substantial majority of respondents choose program A: they prefer the certain option over the gamble.
相当多的受访者选择了方案A:他们 ,比起赌博,他们更愿意选择一定的方案。
The outcomes of the programs are framed differently in a second version:
在第二个版本中,方案的结果被赋予了不同的框架。
If program A′ is adopted, 400 people will die.
如果方案A′被采纳,将有400人死亡。
If program B′ is adopted, there is a one-third probability that nobody will die and a two-thirds probability that 600 people will die.
如果采用B′方案,有三分之一的概率无人死亡,三分之二的概率有600人死亡。
Look closely and compare the two versions: the consequences of programs A and A′ are identical; so are the consequences of programs B and B′. In the second frame, however, a large majority of people choose the gamble.
仔细观察并比较这两个版本:方案A和A′的后果是相同的; ,方案B和B′的后果也是如此。然而,在第二个框架中,绝大多数人都选择了赌博。
The different choices in the two frames fit prospect theory, in which choices between gambles and sure things are resolved differently, depending on whether the outcomes are good or bad. Decision makers tend to prefer the sure thing over the gamble (they are risk averse) when the outcomes are good. They tend to reject the sure thing and accept the gamble (they are risk seeking) when both outcomes are negative. These conclusions were well established for choices about gambles and sure things in the domain of money. The disease problem shows that the same rule applies when the outcomes are measured in lives saved or lost. In this context, as well, the framing experiment reveals that risk-averse and risk-seeking preferences are not reality-bound. Preferences between the same objective outcomes reverse with different formulations.
两个框架中的不同选择符合前景理论,其中赌博和确定的事情之间的选择以不同方式解决,取决于结果是好是坏。当 ,结果是好的,决策者倾向于选择肯定的东西而不是赌博(他们是风险厌恶者)。当两种结果都是负面的时候,他们倾向于拒绝肯定的事情而接受赌博(他们是寻求风险的)。这些结论对于金钱领域中关于赌博和确定的事情的选择来说是很成熟的。疾病问题表明,当结果是以拯救或失去的生命来衡量时,同样的规则也适用。在这种情况下,框架实验 ,揭示了规避风险和寻求风险的偏好并不受现实约束。相同的客观结果之间的偏好在不同的表述中会发生逆转。
An experience that Amos shared with me adds a grim note to the story. Amos was invited to give a speech to a group of public-health professionals—the people who make decisions about vaccines and other programs. He took the opportunity to present them with the Asian disease problem: half saw the “lives-saved” version, the others answered the “lives-lost” question. Like other people, these professionals were susceptible to the framing effects. It is somewhat worrying that the officials who make decisions that affect everyone’s health can be swayed by such a superficial manipulation—but we must get used to the idea that even important decisions are influenced, if not governed, by System 1.
阿莫斯与我分享的一段经历为这个故事增添了一个严峻的注解。 阿莫斯应邀为一群公共卫生专业人员--对疫苗和其他项目作出决定的人--作了一次演讲。他利用这个机会 ,向他们展示了亚洲疾病问题:一半人看到的是 "拯救生命 "的版本,其他人回答的是 "失去生命 "的问题。像其他人一样,这些专业人士很容易受到框架效应的影响。令人有些担心的是,做出影响每个人健康的决定的官员会被这种肤浅的操纵所左右--但我们必须习惯于这样的想法,即即使是重要的 决定也会受到系统1的影响,如果不是被支配。
Even more troubling is what happens when people are confronted with their inconsistency: “You chose to save 200 lives for sure in one formulation and you chose to gamble rather than accept 400 deaths in the other. Now that you know these choices were inconsistent, how do you decide?” The answer is usually embarrassed silence. The intuitions that determined the original choice came from System 1 and had no more moral basis than did the preference for keeping £20 or the aversion to losing £30. Saving lives with certainty is good, deaths are bad. Most people find that their System 2 has no moral intuitions of its own to answer the question.
更令人不安的是,当人们面对他们的不一致时,会发生什么。"在一种表述中,你选择肯定地拯救200条生命,而在另一种表述中,你选择赌博而不是接受400条死亡。现在你知道这些选择是不一致的,你如何决定?"答案通常是尴尬的沉默。决定最初选择的直觉 ,来自于系统1,与对保留20英镑的偏好或对失去30英镑的厌恶相比,没有更多的道德基础。有把握地拯救生命是好事,死亡是坏事。大多数人发现,他们的系统2没有自己的道德直觉来回答这个问题。
I am grateful to the great economist Thomas Schelling for my favorite example of a framing effect, which he described in his book Choice and Consequence . Schelling’s book was written before our work on framing was published, and framing was not his main concern. He reported on his experience teaching a class at the Kennedy School at Harvard, in which the topic was child exemptions in the tax code. Schelling told his students that a standard exemption is allowed for each child, and that the amount of the exemption is independent of the taxpayer’s income. He asked their opinion of the following proposition:
我很感谢伟大的经济学家托马斯-谢林(Thomas Schelling)提供了我最喜欢的框架效应的例子, ,他在他的《 选择与后果》 一书中描述了这种效应。谢林的书是在我们关于框架的工作发表之前写的,而且框架不是他的主要关注点。他报告了他在哈佛大学肯尼迪学院教课的经历,主题是税法中的儿童豁免。谢林告诉他的学生,每个孩子都有一个标准的免税额,而且 ,免税额与纳税人的收入无关。他问他们对以下命题的看法。
Should the child exemption be larger for the rich than for the poor?
富人的儿童免税额应该比穷人的大吗?
Your own intuitions are very likely the same as those of Schelling’s students: they found the idea of favoring the rich by a larger exemption completely unacceptable.
你自己的直觉很可能与谢林的学生一样:他们发现通过更大的豁免权来偏袒富人的想法是完全不可接受的。
Schelling then pointed out that the tax law is arbitrary. It assumes a childless family as the default case and reduces the tax by the amount of the exemption for each child. The tax law could of course be rewritten with another default case: a family with two children. In this formulation, families with fewer than the default number of children would pay a surcharge. Schelling now asked his students to report their view of another proposition:
Schelling随后指出,税收 法律是任意的。它假定一个没有孩子的家庭为默认情况,并按每个孩子的免税额减少税收。当然,税法可以用另一种默认情况来重写:有两个孩子的家庭。在这种表述中, ,孩子少于默认数量的家庭将支付附加费。Schelling现在要求他的学生报告他们对另一个 命题的看法。
Should the childless poor pay as large a surcharge as the childless rich?
无子女的穷人是否应该像无子女的富人一样支付大量的附加费?
Here again you probably agree with the students’ reaction to this idea, which they rejected with as much vehemence as the first. But Schelling showed his class that they could not logically reject both proposals. Set the two formulations next to each other. The difference between the tax due by a childless family and by a family with two children is described as a reduction of tax in the first version and as an increase in the second. If in the first version you want the poor to receive the same (or greater) benefit as the rich for having children, then you must want the poor to pay at least the same penalty as the rich for being childless.
在这里,你可能又会同意学生们对这个想法的反应,他们和第一个想法一样激烈地拒绝。但是谢林向他的同学们表明,他们不可能从逻辑上拒绝这两个提议。把这两个提法放在一起。一个没有孩子的家庭 ,和一个有两个孩子的家庭所应缴纳的税款之间的差异,在第一个版本中被描述为减税,在第二个版本中被描述为增税。如果在第一个版本中,你希望穷人因有孩子而获得与富人相同(或更大)的利益,那么你一定希望穷人因无孩子而至少支付与富人相同的惩罚。
We can recognize System 1 at work. It delivers an immediate response to any question about rich and poor: when in doubt, favor the poor. The surprising aspect of Schelling’s problem is that this apparently simple moral rule does not work reliably. It generates contradictory answers to the same problem, depending on how that problem is framed. And of course you already know the question that comes next. Now that you have seen that your reactions to the problem are influenced by the frame, what is your answer to the question: How should the tax code treat the children of the rich and the poor?
我们可以认识到系统1在工作。它对任何关于富人和穷人的问题都会立即做出 :在有疑问的时候,偏向穷人。谢林问题的令人惊讶之处在于,这个看似简单的道德规则并没有可靠地发挥作用。它对同一个问题产生了相互矛盾的答案,这取决于这个问题是如何提出的。当然,你已经知道接下来的问题了。现在你已经看到,你对 问题的反应受到框架的影响,那么你对这个问题的答案是什么。税法应该如何对待富人和穷人的孩子?
Here again, you will probably find yourself dumbfounded. You have moral intuitions about differences between the rich and the poor, but these intuitions depend on an arbitrary reference point, and they are not about the real problem. This problem—the question about actual states of the world—is how much tax individual families should pay, how to fill the cells in the matrix of the tax code. You have no compelling moral intuitions to guide you in solving that problem. Your moral feelings are attached to frames, to descriptions of reality rather than to reality itself. The message about the nature of framing is stark: framing should not be viewed as an intervention that masks or distorts an underlying preference. At least in this instance—and also in the problems of the Asian disease and of surgery versus radiation for lung cancer—there is no underlying preference that is masked or distorted by the frame. Our preferences are about framed problems, and our moral intuitions are about descriptions, not about substance.
在这里,你可能又会发现自己哑然失笑。你对富人和穷人之间的差异有道德上的直觉,但这些直觉取决于一个任意的参考点,而且它们不是关于真正的问题。这个问题--关于世界实际状态的问题 --是个人家庭应该交多少税,如何填补税法矩阵中的单元格。你没有令人信服的道德直觉来指导你解决这个问题。你的道德感觉依附于框架,依附于对现实的描述,而不是依附于现实本身。关于框架性质的信息是严酷的:框架不应该被视为 ,因为它是一种掩盖或扭曲潜在偏好的干预。至少在这个例子中,以及在亚洲疾病和肺癌手术与放射治疗的问题中,没有任何潜在的偏好被框架所掩盖或扭曲。我们的偏好是关于框架内的问题,而我们的道德直觉是关于描述的,而不是关于实质。
GOOD FRAMES
好的框架
Not all frames are equal, and some frames are clearly better than alternative ways to describe (or to think about) the same thing. Consider the following pair of problems:
不是所有的 框架都是平等的,有些框架显然比描述(或思考)同一事物的其他方式更好。请考虑以下一对问题。
A woman has bought two $80 tickets to the theater. When she arrives at the theater, she opens her wallet and discovers that the tickets are missing. Will she buy two more tickets to see the play?
一个女人买了两张80美元的戏票。当她到达剧院时,她打开钱包,发现门票不见了。她会再买两张票去看戏吗?
A woman goes to the theater, intending to buy two tickets that cost $80 each. She arrives at the theater, opens her wallet, and discovers to her dismay that the $160 with which she was going to make the purchase is missing. She could use her credit card. Will she buy the tickets?
一个女人去剧院,打算 ,买两张各80美元的票。她到了剧院,打开钱包,惊奇地发现,她准备购买的160美元不见了。她可以用她的信用卡。她会买票吗?
Respondents who see only one version of this problem reach different conclusions, depending on the frame. Most believe that the woman in the first story will go home without seeing the show if she has lost tickets, and most believe that she will charge tickets for the show if she has lost money.
只看到这个问题的一个版本的受访者,根据框架的不同,得出不同的结论。大多数人认为,第一个故事中的女人 ,如果她丢了票,就会不看演出就回家,而大多数人认为,如果她丢了钱,就会收票看演出。
The explanation should already be familiar—this problem involves mental accounting and the sunk-cost fallacy. The different frames evoke different mental accounts, and the significance of the loss depends on the account to which it is posted. When tickets to a particular show are lost, it is natural to post them to the account associated with that play. The cost appears to have doubled and may now be more than the experience is worth. In contrast, a loss of cash is charged to a “general revenue” account—the theater patron is slightly poorer than she had thought she was, and the question she is likely to ask herself is whether the small reduction in her disposable wealth will change her decision about paying for tickets. Most respondents thought it would not.
这个解释应该已经很熟悉了--这个问题涉及心理会计和沉没成本谬误。不同的框架唤起了不同的心理账户,而损失的意义则取决于它被记入的账户。当 某场演出的门票丢失时,很自然地将其记入与该剧有关的账户。成本似乎增加了一倍,现在可能超过了体验的价值。与此相反,现金的损失被记入 "一般收入 "账户--剧院的顾客比她想象的要稍微贫穷一些,她可能会问自己的问题是,可支配财富的小幅减少 ,是否会改变她支付门票的决定。大多数受访者认为这不会。
The version in which cash was lost leads to more reasonable decisions. It is a better frame because the loss, even if tickets were lost, is “sunk,” and sunk costs should be ignored. History is irrelevant and the only issue that matters is the set of options the theater patron has now, and their likely consequences. Whatever she lost, the relevant fact is that she is less wealthy than she was before she opened her wallet. If the person who lost tickets were to ask for my advice, this is what I would say: “Would you have bought tickets if you had lost the equivalent amount of cash? If yes, go ahead and buy new ones.” Broader frames and inclusive accounts generally lead to more rational decisions.
丢失现金的版本会导致更合理的决定。这是一个更好的框架,因为损失,即使是门票损失,也是 "沉没 "的,而沉没成本应该被忽略。历史是不相关的,唯一重要的问题是剧院顾客现在拥有的一系列选择 ,以及它们可能的后果。无论她失去了什么,相关的事实是,她的财富比她打开钱包之前要少。如果那个丢了票的人要征求我的意见,我会这样说。"如果你丢了同等数量的现金,你还会买票吗?如果会,就去 ,买新的吧。"更广泛的框架和包容性的说法通常会导致 更理性的决定。
In the next example, two alternative frames evoke different mathematical intuitions, and one is much superior to the other. In an article titled “The MPG Illusion,” which appeared in Science magazine in 2008, the psychologists Richard Larrick and Jack Soll identified a case in which passive acceptance of a misleading frame has substantial costs and serious policy consequences. Most car buyers list gas mileage as one of the factors that determine their choice; they know that high-mileage cars have lower operating costs. But the frame that has traditionally been used in the United States—miles per gallon—provides very poor guidance to the decisions of both individuals and policy makers. Consider two car owners who seek to reduce their costs:
在下一个例子中,两个替代框架唤起了不同的数学直觉,而其中一个比另一个要好得多。心理学家理查德-拉里克和杰克-索尔在2008年发表在《 科学 》杂志上的一篇题为 "MPG幻觉 "的文章中,指出了一个案例,即被动接受一个 误导性的框架 会带来巨大的成本和严重的政策后果。 大多数汽车购买者将汽油里程列为决定他们选择的因素之一;他们知道高里程的汽车有较低的运营成本。但美国传统上使用的框架--每加仑英里数--对个人和政策制定者的决定提供了非常糟糕的指导。考虑一下两位寻求减少成本的车主。
Adam switches from a gas-guzzler of 12 mpg to a slightly less voracious guzzler that runs at 14 mpg.
亚当从12mpg的油耗子 ,换成了稍微不那么贪婪的油耗子,运行速度为14mpg。
The environmentally virtuous Beth switches from a 30 mpg car to one that runs at 40 mpg.
具有环境美德的贝丝从一辆30英里的汽车换成一辆40英里的汽车。
Suppose both drivers travel equal distances over a year. Who will save more gas by switching? You almost certainly share the widespread intuition that Beth’s action is more significant than Adam’s: she reduced mpg by 10 miles rather than 2, and by a third (from 30 to 40) rather than a sixth (from 12 to 14). Now engage your System 2 and work it out. If the two car owners both drive 10,000 miles, Adam will reduce his consumption from a scandalous 833 gallons to a still shocking 714 gallons, for a saving of 119 gallons. Beth’s use of fuel will drop from 333 gallons to 250, saving only 83 gallons. The mpg frame is wrong, and it should be replaced by the gallons-per-mile frame (or liters-per–100 kilometers, which is used in most other countries). As Larrick and Soll point out, the misleading intuitions fostered by the mpg frame are likely to mislead policy makers as well as car buyers.
假设两位司机在一年内的旅行距离相等。谁会因为换车而节省更多的汽油?你几乎可以肯定的是,人们普遍认为贝丝的行动比亚当的行动更重要:她减少了10英里而不是 2,减少了三分之一(从30到40)而不是六分之一(从12到14)。现在启动你的系统2并计算一下。如果这两位车主都开了10,000英里,亚当将把他的消耗量从可怕的833加仑减少到仍然令人震惊的714加仑,节省119加仑。贝丝的用油量将从333加仑降至250加仑,仅节省83加仑。mpg框架是错误的,它应该被每英里加仑框架(或每100公里升数,在大多数其他国家使用)取代 。正如Larrick和Soll所指出的,mpg框架所产生的误导性直觉很可能会误导政策制定者和汽车买家。
Under President Obama, Cass Sunstein served as administrator of the Office of Information and Regulatory Affairs. With Richard Thaler, Sunstein coauthored Nudge , which is the basic manual for applying behavioral economics to policy. It was no accident that the “fuel economy and environment” sticker that will be displayed on every new car starting in 2013 will for the first time in the United States include the gallons-per-mile information. Unfortunately, the correct formulation will be in small print, along with the more familiar mpg information in large print, but the move is in the right direction. The five-year interval between the publication of “The MPG Illusion” and the implementation of a partial correction is probably a speed record for a significant application of psychological science to public policy.
在奥巴马总统时期,卡斯-桑斯坦曾担任信息和监管事务办公室的主任。与理查德-塞勒(Richard Thaler )合作,孙斯坦 ,该书是将行为经济学应用于政策的基本手册。从2013年开始,每辆新车都会贴上 "燃油经济性和环境 "的标签,这并非偶然,在美国,这将首次包括每英里加仑的信息。 不幸的是,正确的表述将是小字,与更熟悉的大字的mpg 信息一起,但此举的方向是正确的。从 "MPG错觉 "的发表到部分更正的实施之间的五年间隔,可能是心理学科学在公共政策中的重要应用的速度记录。
A directive about organ donation in case of accidental death is noted on an individual’s driver license in many countries. The formulation of that directive is another case in which one frame is clearly superior to the other. Few people would argue that the decision of whether or not to donate one’s organs is unimportant, but there is strong evidence that most people make their choice thoughtlessly. The evidence comes from a comparison of the rate of organ donation in European countries , which reveals startling differences between neighboring and culturally similar countries. An article published in 2003 noted that the rate of organ donation was close to 100% in Austria but only 12% in Germany, 86% in Sweden but only 4% in Denmark.
在许多 国家的个人驾驶执照上都注明了关于意外死亡时器官捐赠的指令。该指令的制定是另一个框架明显优于其他框架的案例。很少有人会认为是否捐献器官的决定是不重要的,但有强有力的证据表明,大多数人的选择是不经意的。这些证据来自于对 欧洲国家器官捐赠率 的比较,它揭示了相邻的、文化相似的国家之间惊人的 。2003年发表的一篇文章指出,奥地利的器官捐赠率接近100%,但在德国只有12%,瑞典有86%,但丹麦只有4%。
These enormous differences are a framing effect, which is caused by the format of the critical question. The high-donation countries have an opt-out form, where individuals who wish not to donate must check an appropriate box. Unless they take this simple action, they are considered willing donors. The low-contribution countries have an opt-in form: you must check a box to become a donor. That is all. The best single predictor of whether or not people will donate their organs is the designation of the default option that will be adopted without having to check a box.
这些巨大的差异是一种框架效应,它是由关键问题的格式造成的。高捐赠率国家有一个选择退出的表格,希望不捐赠的个人 ,必须勾选一个适当的方框。除非他们采取这个简单的行动,否则他们被认为是自愿捐赠的。低捐款国家有一个选择进入的表格:你必须勾选一个方框才能成为捐款人。这就是全部。预测人们是否会捐献器官的最佳单一因素是指定默认选项,无需勾选方框就能采用。
Unlike other framing effects that have been traced to features of System 1, the organ donation effect is best explained by the laziness of System 2. People will check the box if they have already decided what they wish to do. If they are unprepared for the question, they have to make the effort of thinking whether they want to check the box. I imagine an organ donation form in which people are required to solve a mathematical problem in the box that corresponds to their decision. One of the boxes contains the problem 2 + 2 = ? The problem in the other box is 13 × 37 = ? The rate of donations would surely be swayed.
与其他被追溯到系统1特征的框架效应不同,器官捐赠效应最好用系统2的懒惰来解释。如果人们已经决定了他们想做什么,就会勾选这个方框。如果他们对这个问题毫无准备,他们就必须努力思考是否要勾选这个方框。我想象一张器官捐赠表格,其中人们被要求在与他们的决定相对应的盒子里解决一个数学问题, 。其中一个盒子里的问题是2+2=?另一个盒子里的问题是13×37=?捐赠率肯定会被动摇。
When the role of formulation is acknowledged, a policy question arises: Which formulation should be adopted? In this case, the answer is straightforward. If you believe that a large supply of donated organs is good for society, you will not be neutral between a formulation that yields almost 100% donations and another formulation that elicits donations from 4% of drivers.
当配方的作用被承认时,就会出现一个政策问题。应该采用哪种提法?在这种情况下,答案是直截了当的。如果你相信 ,大量的器官捐献对社会有益,那么你就不会在一个能产生几乎100%捐献的表述和另一个能引起4%的司机捐献的表述之间保持中立。
As we have seen again and again, an important choice is controlled by an utterly inconsequential feature of the situation. This is embarrassing—it is not how we would wish to make important decisions. Furthermore, it is not how we experience the workings of our mind, but the evidence for these cognitive illusions is undeniable.
正如 ,我们一再看到,一个重要的选择被情况中一个完全不重要的特征所控制。这很令人尴尬--这不是我们希望做出重要 决定的方式。此外,这也不是我们体验心灵运作的方式,但这些认知幻觉的证据是不可否认的。
Count that as a point against the rational-agent theory. A theory that is worthy of the name asserts that certain events are impossible—they will not happen if the theory is true. When an “impossible” event is observed, the theory is falsified. Theories can survive for a long time after conclusive evidence falsifies them, and the rational-agent model certainly survived the evidence we have seen, and much other evidence as well.
这也算是对理性代理理论的一种反对。一个名副其实的理论断言,某些事件是不可能的--如果理论是真的,它们就不会发生。当一个 "不可能 "的事件被观察到时,该理论就被证伪了。理论在被确凿的证据证伪后,可以 ,而理性-代理人模型在我们所看到的证据以及许多其他证据面前肯定是幸存的。
The case of organ donation shows that the debate about human rationality can have a large effect in the real world. A significant difference between believers in the rational-agent model and the skeptics who question it is that the believers simply take it for granted that the formulation of a choice cannot determine preferences on significant problems. They will not even be interested in investigating the problem—and so we are often left with inferior outcomes.
器官捐赠的案例表明,关于人类理性的辩论在现实世界中会产生很大的影响。理性代理模式的信奉者和质疑它的怀疑论者之间的一个重要区别是 ,信奉者只是想当然地认为,选择的表述不能决定对重大问题的偏好。他们甚至不会有兴趣去调查这个问题--因此,我们往往只能得到劣质的结果。
Skeptics about rationality are not surprised. They are trained to be sensitive to the power of inconsequential factors as determinants of preference—my hope is that readers of this book have acquired this sensitivity.
对合理性持怀疑态度的人并不感到惊讶。他们被训练成对无关紧要的因素作为偏好决定因素的力量很敏感--我 ,希望这本书的读者能获得这种敏感性。
SPEAKING OF FRAMES AND REALITY
说到框架和现实
“They will feel better about what happened if they manage to frame the outcome in terms of how much money they kept rather than how much they lost.”
"如果他们设法将结果定格在他们保留了多少钱而不是失去了多少钱,他们会对发生的事情感觉更好。"
“Let’s reframe the problem by changing the reference point. Imagine we did not own it; how much would we think it is worth?”
"让我们通过改变参考点来重塑问题。假设我们不拥有它;我们会认为它值多少钱?"
“Charge the loss to your mental account of ‘general revenue’—you will feel better!”
"把损失记在你的 "一般收入 "的心理账户上--你会感觉好些!"
“They ask you to check the box to opt out of their mailing list. Their list would shrink if they asked you to check a box to opt in!”
"他们要求你勾选退出他们的邮件列表的方框。如果他们要求你打勾选择加入,他们的名单就会缩减!"
35
35
Two Selves
两个自我
The term utility has had two distinct meanings in its long history. Jeremy Bentham opened his Introduction to the Principles of Morals and Legislation with the famous sentence “Nature has placed mankind under the governance of two sovereign masters, pain and pleasure . It is for them alone to point out what we ought to do, as well as to determine what we shall do.” In an awkward footnote, Bentham apologized for applying the word utility to these experiences, saying that he had been unable to find a better word. To distinguish Bentham’s interpretation of the term, I will call it experienced utility .
在其漫长的历史中, 效用 一词有两种不同的含义。杰里米-边沁在他的《 道德与立法原则》导言 中,以一句著名的话作为开篇:"大自然把人类置于两个主宰的管理之下 ,即 痛苦 和 快乐 。只有他们才能指出我们应该做什么,以及决定我们应该做什么"。在一个尴尬的脚注中,边沁为将 效用 一词用于这些经验而道歉,说他无法找到一个更好的词。为了区别于边沁对这个词的解释,我将把它称为 经验效用 。
For the last 100 years, economists have used the same word to mean something else. As economists and decision theorists apply the term, it means “wantability” —and I have called it decision utility . Expected utility theory, for example, is entirely about the rules of rationality that should govern decision utilities; it has nothing at all to say about hedonic experiences. Of course, the two concepts of utility will coincide if people want what they will enjoy, and enjoy what they chose for themselves—and this assumption of coincidence is implicit in the general idea that economic agents are rational. Rational agents are expected to know their tastes, both present and future, and they are supposed to make good decisions that will maximize these interests.
在过去的100 年里,经济学家们用同一个词来表示别的意思。正如经济学家和决策理论家所应用的术语,它的意思是 "可望性"-- 我称之为 决策效用 。例如,预期效用理论完全是关于应该支配决策效用的理性规则;它对享乐体验完全没有说法。当然,如果人们想要他们将享受的东西,并享受他们为自己选择的东西,那么这两个效用概念 ,就会重合--这种重合的假设隐含在经济主体是理性的一般想法中。理性的代理人应该知道他们的品味,包括现在和未来,他们应该做出良好的决定,使这些利益最大化。
EXPERIENCED UTILITY
有经验的公用事业
My fascination with the possible discrepancies between experienced utility and decision utility goes back a long way. While Amos and I were still working on prospect theory, I formulated a puzzle, which went like this: imagine an individual who receives one painful injection every day. There is no adaptation; the pain is the same day to day. Will people attach the same value to reducing the number of planned injections from 20 to 18 as from 6 to 4? Is there any justification for a distinction?
我对 经验效用和决策效用之间可能存在的差异的迷恋可以追溯到很早以前。当阿莫斯和我还在研究前景理论时,我提出了一个难题,它是这样的:想象一个人每天接受一次痛苦的注射。没有任何适应;疼痛每天都是一样的。人们会不会对把计划中的注射次数从 20次减少到18次与从6次减少到4次赋予同样的价值?有什么理由进行区分吗?
I did not collect data, because the outcome was evident. You can verify for yourself that you would pay more to reduce the number of injections by a third (from 6 to 4) than by one tenth (from 20 to 18). The decision utility of avoiding two injections is higher in the first case than in the second, and everyone will pay more for the first reduction than for the second. But this difference is absurd. If the pain does not change from day to day, what could justify assigning different utilities to a reduction of the total amount of pain by two injections, depending on the number of previous injections? In the terms we would use today, the puzzle introduced the idea that experienced utility could be measured by the number of injections. It also suggested that, at least in some cases, experienced utility is the criterion by which a decision should be assessed. A decision maker who pays different amounts to achieve the same gain of experienced utility (or be spared the same loss) is making a mistake. You may find this observation obvious, but in decision theory the only basis for judging that a decision is wrong is inconsistency with other preferences. Amos and I discussed the problem but we did not pursue it. Many years later, I returned to it.
我没有收集数据,因为结果是显而易见的。你可以自己验证,减少三分之一的注射次数(从6次到4次)比减少十分之一的注射次数(从20次到18次)付出更多。在第一种情况下,避免两次注射的决策效用高于第二种情况,而且每个人都会为第一次减少支付比第二次更多的 。但这种差异是荒谬的。如果疼痛不会每天都发生变化,那么有什么理由根据以前的注射次数,对减少两次疼痛的总量分配不同的效用呢?用我们今天使用的术语来说,这个谜题提出了这样一个观点:经验的效用可以通过注射的次数来衡量 。它还表明,至少在某些情况下,经验效用是评估一个决定的标准。一个决策者如果为获得相同的经验效用(或避免相同的损失)而支付不同的金额,那就是犯了一个错误。你可能觉得这个观察很明显,但在决策理论中,判断一个决定是错误的唯一依据 ,就是与其他偏好的不一致。阿莫斯和我讨论了这个问题,但我们并没有继续讨论。许多年后,我又回到了这个问题。
EXPERIENCE AND MEMORY
经验和记忆
How can experienced utility be measured? How should we answer questions such as “How much pain did Helen suffer during the medical procedure?” or “How much enjoyment did she get from her 20 minutes on the beach?” The British economist Francis Edgeworth speculated about this topic in the nineteenth century and proposed the idea of a “hedonimeter,” an imaginary instrument analogous to the devices used in weather-recording stations, which would measure the level of pleasure or pain that an individual experiences at any moment .
如何衡量经验效用?我们应该如何回答这样的问题:"海伦在医疗过程中遭受了多少痛苦?"或者 "她从海滩上的20分钟中获得了多少享受?"英国经济学家Francis Edgeworth在19世纪推测了这个问题,并提出了 "享乐计 "的想法,这是一种类似于天气记录站所使用的设备的想象中的仪器,它可以测量个人 在任何时刻 所经历的快乐或痛苦程度。
Experienced utility would vary, much as daily temperature or barometric pressure do, and the results would be plotted as a function of time. The answer to the question of how much pain or pleasure Helen experienced during her medical procedure or vacation would be the “area under the curve.” Time plays a critical role in Edgeworth’s conception. If Helen stays on the beach for 40 minutes instead of 20, and her enjoyment remains as intense, then the total experienced utility of that episode doubles, just as doubling the number of injections makes a course of injections twice as bad. This was Edgeworth’s theory, and we now have a precise understanding of the conditions under which his theory holds .
经历的效用会有变化,就像每天的温度或气压一样,结果会被 ,作为时间的函数。 对于海伦在医疗过程或度假期间经历了多少痛苦或快乐的问题,答案将是 "曲线下的面积"。时间在埃奇沃斯的构思中起着关键作用。如果海伦在海滩上呆了40分钟而不是20分钟,而她的享受仍然那么强烈,那么那段经历的总效用就会增加一倍,就像注射次数增加一倍会使一个疗程的注射效果增加一倍一样, 。这就是埃奇沃斯的理论,我们现在对 他的理论成立的 条件有了精确的理解。
The graphs in figure 15 show profiles of the experiences of two patients undergoing a painful colonoscopy, drawn from a study that Don Redelmeier and I designed together. Redelmeier, a physician and researcher at the University of Toronto, carried it out in the early 1990s. This procedure is now routinely administered with an anesthetic as well as an amnesic drug, but these drugs were not as widespread when our data were collected. The patients were prompted every 60 seconds to indicate the level of pain they experienced at the moment. The data shown are on a scale where zero is “no pain at all” and 10 is “intolerable pain.” As you can see, the experience of each patient varied considerably during the procedure, which lasted 8 minutes for patient A and 24 minutes for patient B (the last reading of zero pain was recorded after the end of the procedure). A total of 154 patients participated in the experiment; the shortest procedure lasted 4 minutes, the longest 69 minutes.
图15 中的图表显示了两名接受痛苦的结肠镜检查的病人的经历,这是从唐-雷德尔梅尔和我共同设计的一项研究中得出的。Redelmeier是多伦多大学的 医生和研究员 ,他在20世纪90年代初进行了这项研究。这种手术现在常规使用麻醉剂和失忆药,但在我们收集数据时,这些药物还没有那么广泛地使用。每隔60秒就会提示病人指出他们此刻的疼痛程度。显示的数据是在一个量表上,零是 "完全没有疼痛 ",10是 "无法忍受的疼痛"。正如你所看到的,每个病人在手术过程中的体验差异很大,病人A持续了8分钟,病人B持续了24分钟(最后一次读数为零疼痛是在手术结束后记录的)。共有154名病人参加了实验;最短的手术持续了4分钟,最长的是69分钟。
Next, consider an easy question: Assuming that the two patients used the scale of pain similarly, which patient suffered more? No contest. There is general agreement that patient B had the worse time. Patient B spent at least as much time as patient A at any level of pain, and the “area under the curve” is clearly larger for B than for A. The key factor, of course, is that B’s procedure lasted much longer. I will call the measures based on reports of momentary pain hedonimeter totals.
接下来,考虑 一个简单的问题。假设这两个病人对疼痛量表的使用相似,哪个病人的痛苦更大?没有争议。人们普遍认为,病人B的情况更糟糕。患者B在任何程度的疼痛中所花费的时间至少与患者A一样多,而且B的 "曲线下面积 "明显大于A。当然,关键因素是B的 程序持续的时间更长。我 ,将把基于瞬间疼痛报告的衡量标准称为享乐指数总量。

Figure 15
图15
When the procedure was over, all participants were asked to rate “the total amount of pain” they had experienced during the procedure. The wording was intended to encourage them to think of the integral of the pain they had reported, reproducing the hedonimeter totals. Surprisingly, the patients did nothing of the kind. The statistical analysis revealed two findings, which illustrate a pattern we have observed in other experiments:
当手术结束后,所有参与者都被要求对他们在手术中经历的 "疼痛总量 "进行评价。这样的措辞是为了鼓励他们思考他们所报告的疼痛的整体性,重现享乐计的总量。令人惊讶的是,病人没有做任何类似的事情。统计分析显示了两个结果,它们说明了 我们在其他实验中观察到的模式。
Peak-end rule: The global retrospective rating was well predicted by the average of the level of pain reported at the worst moment of the experience and at its end.
峰值结束规则。全球回顾性评价是由经历中最糟糕的时刻和结束时报告的疼痛程度的平均值来预测的。
Duration neglect: The duration of the procedure had no effect whatsoever on the ratings of total pain.
持续时间被忽视。手术的持续时间对总疼痛的评分没有任何影响。
You can now apply these rules to the profiles of patients A and B. The worst rating (8 on the 10-point scale) was the same for both patients, but the last rating before the end of the procedure was 7 for patient A and only 1 for patient B. The peak-end average was therefore 7.5 for patient A and only 4.5 for patient B. As expected, patient A retained a much worse memory of the episode than patient B. It was the bad luck of patient A that the procedure ended at a bad moment, leaving him with an unpleasant memory.
现在你可以把这些规则应用到病人A和B的档案中。两个病人的最差 (10分制的8分)是一样的,但是在手术结束前的最后一次评分,病人A是7分,而病人B只有1分。因此,病人A的高峰期平均数是7.5分,而病人B只有4.5分。正如所料,病人A比病人B保留了更糟糕的记忆。这是病人A的厄运,手术在一个糟糕的时刻结束, ,给他留下不愉快的回忆。
We now have an embarrassment of riches: two measures of experienced utility—the hedonimeter total and the retrospective assessment—that are systematically different. The hedonimeter totals are computed by an observer from an individual’s report of the experience of moments. We call these judgments duration-weighted, because the computation of the “area under the curve” assigns equal weights to all moments: two minutes of pain at level 9 is twice as bad as one minute at the same level of pain. However, the findings of this experiment and others show that the retrospective assessments are insensitive to duration and weight two singular moments, the peak and the end, much more than others. So which should matter? What should the physician do? The choice has implications for medical practice. We noted that:
我们现在有了一个财富的尴尬:对经验效用的两种测量方法--享乐计总和和回顾性评估--是系统性的不同。享乐计总数是由观察者从个人对瞬间体验的报告中计算出来的。我们称这些判断为持续时间加权,因为 " 曲线下的面积 "的计算对所有的时刻都赋予了同等的权重:在第9级的两分钟的疼痛是相同级别的一分钟的疼痛的两倍。然而,这个实验和其他实验的结果表明,回顾性评估对持续时间不敏感,而且对两个单一的时刻,即峰值和终点的权重比其他时刻高得多。那么,哪些应该是重要的?医生应该怎么做?这个选择 ,对医疗实践有影响。我们注意到,。
If the objective is to reduce patients’ memory of pain, lowering the peak intensity of pain could be more important than minimizing the duration of the procedure. By the same reasoning, gradual relief may be preferable to abrupt relief if patients retain a better memory when the pain at the end of the procedure is relatively mild.
如果目标是减少病人对疼痛的记忆,那么降低疼痛的峰值强度可能比尽量缩短手术的时间更重要。根据同样的推理,如果病人在手术结束时的疼痛相对较轻,那么逐渐缓解可能比突然缓解更可取。
If the objective is to reduce the amount of pain actually experienced, conducting the procedure swiftly may be appropriate even if doing so increases the peak pain intensity and leaves patients with an awful memory.
如果 , ,目的是减少实际经历的疼痛量,那么迅速进行手术可能是合适的,即使这样做会增加疼痛的峰值强度,给病人留下可怕的记忆。
Which of the two objectives did you find most compelling? I have not conducted a proper survey, but my impression is that a strong majority will come down in favor of reducing the memory of pain. I find it helpful to think of this dilemma as a conflict of interests between two selves (which do not correspond to the two familiar systems). The experiencing self is the one that answers the question: “Does it hurt now?” The remembering self is the one that answers the question: “How was it, on the whole?” Memories are all we get to keep from our experience of living, and the only perspective that we can adopt as we think about our lives is therefore that of the remembering self.
在这两个目标中,你认为哪一个最令人信服?我没有进行过适当的调查,但我的印象是,大多数人都会支持减少疼痛的记忆 。我发现把这种困境看作是两个自我(与两个熟悉的系统 并不 对应)之间的利益冲突是有帮助的。 体验的自我 是回答问题的那个。"现在疼吗?" 记忆中的自我 是回答这个问题的人。"总的来说,情况如何?"记忆是我们从生活经验中得到的所有保留,因此,我们在思考我们的生活时,唯一可以采用的 ,就是记忆中的自我。
A comment I heard from a member of the audience after a lecture illustrates the difficulty of distinguishing memories from experiences. He told of listening raptly to a long symphony on a disc that was scratched near the end, producing a shocking sound, and he reported that the bad ending “ruined the whole experience.” But the experience was not actually ruined, only the memory of it. The experiencing self had had an experience that was almost entirely good, and the bad end could not undo it, because it had already happened. My questioner had assigned the entire episode a failing grade because it had ended very badly, but that grade effectively ignored 40 minutes of musical bliss. Does the actual experience count for nothing?
我在一次讲座后听到一位听众的评论,说明了区分记忆和经验的困难。他说,他全神贯注地听着一张光盘上的长篇交响乐,该光盘在接近尾声时被划伤,产生了令人震惊的声音,他报告说,这个糟糕的结尾 "毁掉了 "整个经历。但实际上体验并没有被毁掉,只是对它的记忆。体验者的自我经历几乎完全是好的,糟糕的结局无法挽回它,因为它已经发生了。我的提问者给整个情节打了一个不及格的分数,因为它的结局非常糟糕,但这个分数实际上忽略了40分钟的音乐快乐。 ,实际的体验是否算不上什么?
Confusing experience with the memory of it is a compelling cognitive illusion—and it is the substitution that makes us believe a past experience can be ruined. The experiencing self does not have a voice. The remembering self is sometimes wrong, but it is the one that keeps score and governs what we learn from living, and it is the one that makes decisions. What we learn from the past is to maximize the qualities of our future memories, not necessarily of our future experience. This is the tyranny of the remembering self.
将经验与对它的记忆混为一谈,是一种引人注目的认知错觉--正是这种替代使我们相信过去的经验可以被毁掉。体验中的自我没有发言权。记忆中的自我有时是错误的,但它是记分的人,支配着我们从生活中学到的东西,它是做决定的人。 我们从过去学到的东西是为了最大限度地提高我们未来记忆的质量,而不一定是我们未来经验的质量。这就是记忆中的自我的暴政。
WHICH SELF SHOULD COUNT?
哪个自己应该算?
To demonstrate the decision-making power of the remembering self, my colleagues and I designed an experiment, using a mild form of torture that I will call the cold-hand situation (its ugly technical name is cold-pressor). Participants are asked to hold their hand up to the wrist in painfully cold water until they are invited to remove it and are offered a warm towel. The subjects in our experiment used their free hand to control arrows on a keyboard to provide a continuous record of the pain they were enduring, a direct communication from their experiencing self. We chose a temperature that caused moderate but tolerable pain: the volunteer participants were of course free to remove their hand at any time, but none chose to do so.
为了证明记忆中的自我的决策能力,我和我的同事设计了一个实验,使用一种温和的酷刑形式,我将其称为冷手情况(其丑陋的技术名称 是冷压器)。 参与者被要求将他们的手握在痛苦的冷水中,直到他们被邀请将手拿开,并被提供一条温暖的毛巾。在我们的实验中,受试者用空闲的手来控制键盘上的箭头,以连续记录他们所承受的痛苦,这是他们体验自我的直接交流。我们选择了一个能引起 中度但可以忍受的疼痛的温度:当然,志愿参与者可以在任何时候自由地移开他们的手,但没有人选择这样做。
Each participant endured two cold-hand episodes:
每个参与者都经受了两次冷手的考验。
The short episode consisted of 60 seconds of immersion in water at 14° Celsius, which is experienced as painfully cold, but not intolerable. At the end of the 60 seconds, the experimenter instructed the participant to remove his hand from the water and offered a warm towel.
短暂的插曲包括在14摄氏度的水中浸泡60秒,这被认为是痛苦的寒冷,但不是不能忍受的。在60秒结束时,实验者指示被试者 ,将他的手从水中拿出来,并提供一条温暖的毛巾。
The long episode lasted 90 seconds. Its first 60 seconds were identical to the short episode. The experimenter said nothing at all at the end of the 60 seconds. Instead he opened a valve that allowed slightly warmer water to flow into the tub. During the additional 30 seconds, the temperature of the water rose by roughly 1°, just enough for most subjects to detect a slight decrease in the intensity of pain.
长集持续了90秒。它的前60秒与短情节相同。实验者在这60秒结束时什么都没说。相反,他打开了一个阀门,让稍暖的水流入浴缸。在额外的30秒内,水温大约上升了1°,这对大多数受试者来说刚好够 ,以检测疼痛强度的轻微下降。
Our participants were told that they would have three cold-hand trials, but in fact they experienced only the short and the long episodes, each with a different hand. The trials were separated by seven minutes. Seven minutes after the second trial, the participants were given a choice about the third trial. They were told that one of their experiences would be repeated exactly, and were free to choose whether to repeat the experience they had had with their left hand or with their right hand. Of course, half the participants had the short trial with the left hand, half with the right; half had the short trial first, half began with the long, etc. This was a carefully controlled experiment.
我们的参与者被告知他们将有三次冷手试验,但事实上他们只经历了短时和长时的发作,每次都是用不同的手。这些试验相隔7分钟。在第二次试验的七分钟后,被试者被给予关于第三次试验的选择。他们被告知 ,他们的一个经历将被完全重复,并且可以 自由选择 是用左手还是用右手重复他们的经历。当然,一半的参与者用左手进行短的试验,一半用右手;一半先进行短的试验,一半从长的开始,等等。这是一个精心控制的实验。
The experiment was designed to create a conflict between the interests of the experiencing and the remembering selves, and also between experienced utility and decision utility. From the perspective of the experiencing self, the long trial was obviously worse. We expected the remembering self to have another opinion. The peak-end rule predicts a worse memory for the short than for the long trial, and duration neglect predicts that the difference between 90 seconds and 60 seconds of pain will be ignored. We therefore predicted that the participants would have a more favorable (or less unfavorable) memory of the long trial and choose to repeat it. They did. Fully 80% of the participants who reported that their pain diminished during the final phase of the longer episode opted to repeat it, thereby declaring themselves willing to suffer 30 seconds of needless pain in the anticipated third trial.
实验的设计 ,在体验自我和记忆自我的利益之间产生冲突,也在体验效用和决策效用之间产生冲突。从体验自我的角度来看,长期试验显然更糟糕。我们预期记忆中的自我会有另一种看法。峰值结束法则预示着对短的试验的记忆比对长的试验的记忆要差,而持续时间忽视 ,预示着90秒和60秒的疼痛之间的差异会被忽略。因此,我们预测参与者会对长的试验有更有利的(或 不太不利的)记忆,并选择重复它。他们做到了。足足有80%的参与者报告说他们的疼痛在较长事件的最后阶段有所减轻,他们选择重复这一事件,从而宣布 ,他们愿意在预期的第三次试验中承受30秒的无谓疼痛。
The subjects who preferred the long episode were not masochists and did not deliberately choose to expose themselves to the worse experience; they simply made a mistake. If we had asked them, “Would you prefer a 90-second immersion or only the first part of it?” they would certainly have selected the short option. We did not use these words, however, and the subjects did what came naturally: they chose to repeat the episode of which they had the less aversive memory. The subjects knew quite well which of the two exposures was longer—we asked them—but they did not use that knowledge. Their decision was governed by a simple rule of intuitive choice: pick the option you like the most, or dislike the least. Rules of memory determined how much they disliked the two options, which in turn determined their choice. The cold-hand experiment, like my old injections puzzle, revealed a discrepancy between decision utility and experienced utility.
那些喜欢长情节的受试者并不是受虐狂,他们并没有故意选择让自己暴露在更糟糕的体验中;他们只是犯了一个错误。如果我们问他们:"你喜欢90秒的沉浸还是只喜欢它的第一部分?"他们肯定会选择短的 。然而,我们没有使用这些词,受试者自然而然地做了一件事:他们选择重复他们有较少厌恶性记忆的那段情节。受试者非常清楚地知道两种曝光中哪一种更长--我们问过他们--但他们并没有使用这种知识。他们的决定是由一个简单的直觉选择规则支配的:选择你最喜欢的选项,或最不喜欢的 。记忆规则决定了他们对这两个选项的厌恶程度,这反过来又决定了他们的选择。这个冷手实验,就像我以前的注射谜题一样,揭示了决策效用和经验效用之间的差异。
The preferences we observed in this experiment are another example of the less-is-more effect that we have encountered on previous occasions. One was Christopher Hsee’s study in which adding dishes to a set of 24 dishes lowered the total value because some of the added dishes were broken. Another was Linda, the activist woman who is judged more likely to be a feminist bank teller than a bank teller. The similarity is not accidental. The same operating feature of System 1 accounts for all three situations: System 1 represents sets by averages, norms, and prototypes, not by sums. Each cold-hand episode is a set of moments, which the remembering self stores as a prototypical moment. This leads to a conflict. For an objective observer evaluating the episode from the reports of the experiencing self, what counts is the “area under the curve” that integrates pain over time; it has the nature of a sum. The memory that the remembering self keeps, in contrast, is a representative moment, strongly influenced by the peak and the end.
我们在这个实验中观察到的偏好是我们在以前的场合中遇到的少即是多效应的另一个例子。一个是Christopher Hsee的研究,在一套24个盘子中加入盘子会降低总价值,因为有些加入的盘子是坏的。另一个是琳达,这位活跃的女性被判断为比银行出纳员更有可能成为女权主义银行出纳员。这种相似性并不是偶然的。系统1的相同操作特征说明了这三种情况。系统1通过平均数、规范、 和原型来表示集合,而不是通过总和。每个冷手的事件都是一组时刻,记忆中的自我将其储存为一个原型时刻。这导致了一种冲突。对于一个客观的观察者来说,从体验自我的报告中评估这一事件,重要的是 "曲线下的面积",它综合了一段时间的疼痛;它具有总和的性质。相比之下,记忆中的自己所保留的记忆, ,是一个有代表性的时刻,受到高峰和终点的强烈影响。
Of course, evolution could have designed animals’ memory to store integrals, as it surely does in some cases. It is important for a squirrel to “know” the total amount of food it has stored, and a representation of the average size of the nuts would not be a good substitute. However, the integral of pain or pleasure over time may be less biologically significant. We know, for example, that rats show duration neglect for both pleasure and pain. In one experiment, rats were consistently exposed to a sequence in which the onset of a light signals that an electric shock will soon be delivered. The rats quickly learned to fear the light, and the intensity of their fear could be measured by several physiological responses. The main finding was that the duration of the shock has little or no effect on fear—all that matters is the painful intensity of the stimulus.
当然,进化可以将动物的记忆设计为存储积分,因为在某些情况下它肯定会这样做。对一只松鼠来说,"知道 "它所储存的食物总量是很重要的,而坚果的平均大小的表示法不会是一个很好的替代品。然而,随着时间的推移,痛苦或 快乐的积分可能在生物学上不那么重要。 例如,我们知道,大鼠对快乐和痛苦都表现出持续时间的忽视。在一个实验中,大鼠被持续地暴露在一个序列中,在这个序列中,光的开始预示着电击即将来临。大鼠很快学会了害怕灯光,它们的恐惧强度可以通过几种生理学 。主要发现是 电击的持续时间 对恐惧几乎没有影响,重要的是刺激的疼痛强度。
Other classic studies showed that electrical stimulation of specific areas in the rat brain (and of corresponding areas in the human brain) produce a sensation of intense pleasure, so intense in some cases that rats who can stimulate their brain by pressing a lever will die of starvation without taking a break to feed themselves. Pleasurable electric stimulation can be delivered in bursts that vary in intensity and duration. Here again, only intensity matters. Up to a point, increasing the duration of a burst of stimulation does not appear to increase the eagerness of the animal to obtain it. The rules that govern the remembering self of humans have a long evolutionary history.
其他经典研究表明,对大鼠大脑中的特定区域(以及人脑中的相应区域)的电刺激会产生强烈的快感,在某些情况下是如此强烈,以至于能够通过按压杠杆来刺激大脑的大鼠 ,而不需要休息一下就会饿死。令人愉悦的电刺激可以以不同的强度和持续时间爆发出来。在这里,同样只有强度是重要的。在一定程度上,增加 一阵刺激 的持续时间似乎不会增加动物对获得刺激的渴望。管理人类记忆自我 的规则有很长的进化历史。
BIOLOGY VS. RATIONALITY
生物学 VS.理性
The most useful idea in the injections puzzle that preoccupied me years ago was that the experienced utility of a series of equally painful injections can be measured, by simply counting the injections. If all injections are equally aversive, then 20 of them are twice as bad as 10, and a reduction from 20 to 18 and a reduction from 6 to 4 are equally valuable. If the decision utility does not correspond to the experienced utility, then something is wrong with the decision. The same logic played out in the cold-hand experiment: an episode of pain that lasts 90 seconds is worse than the first 60 seconds of that episode. If people willingly choose to endure the longer episode, something is wrong with their decision. In my early puzzle, the discrepancy between the decision and the experience originated from diminishing sensitivity: the difference between 18 and 20 is less impressive, and appears to be worth less, than the difference between 6 and 4 injections. In the cold-hand experiment, the error reflects two principles of memory: duration neglect and the peak-end rule. The mechanisms are different but the outcome is the same: a decision that is not correctly attuned to the experience.
几年前困扰我的注射难题中最有用的想法是,一系列同样痛苦的注射的经验效用可以被衡量,只需计算注射次数。如果所有的注射都是同样令人厌恶的,那么20次的注射是10次的两倍,从20次减少到18次和从6次减少到4次 ,是同样有价值的。如果决策效用与经验效用不一致,那么这个决策就有问题。同样的逻辑在冷手实验中发挥了作用:持续90秒的疼痛发作比该发作的前60秒更糟糕。如果人们心甘情愿地选择忍受更长的发作,那么他们的决定就有问题了。在我的 早期的谜题中,决定和经验之间的差异源于敏感度的降低:18次和20次之间的差异没有那么令人印象深刻,似乎比6次和4次注射之间的差异更有价值。在冷手实验中,这个错误反映了记忆的两个原则:持续时间忽略和峰值结束规则。机制是不同的,但结果 是相同的:一个没有正确适应经验的决定。
Decisions that do not produce the best possible experience and erroneous forecasts of future feelings—both are bad news for believers in the rationality of choice. The cold-hand study showed that we cannot fully trust our preferences to reflect our interests, even if they are based on personal experience, and even if the memory of that experience was laid down within the last quarter of an hour! Tastes and decisions are shaped by memories, and the memories can be wrong. The evidence presents a profound challenge to the idea that humans have consistent preferences and know how to maximize them, a cornerstone of the rational-agent model. An inconsistency is built into the design of our minds. We have strong preferences about the duration of our experiences of pain and pleasure. We want pain to be brief and pleasure to last. But our memory, a function of System 1, has evolved to represent the most intense moment of an episode of pain or pleasure (the peak) and the feelings when the episode was at its end. A memory that neglects duration will not serve our preference for long pleasure and short pains.
不能产生最佳体验的决定和对未来感受的错误预测--对于选择的合理性的信仰者来说都是坏消息。冷手研究表明,我们不能完全相信 ,我们的偏好反映了我们的利益,即使它们是基于个人的经验,即使 ,这种经验的记忆是在最后一刻钟内奠定的!品味和决定是由记忆决定的,而记忆可能是错误的。这些证据对人类有一致的偏好并知道如何使其最大化的想法提出了深刻的挑战,而这是理性代理人模型的基石。我们的思维设计中就有一个不一致的地方。我们对痛苦和快乐体验的持续时间有强烈的偏好, 。我们希望痛苦是短暂的,快乐是持久的。但是我们的记忆,作为系统1的一个功能,已经进化到代表痛苦或快乐的最强烈的时刻(高峰)和该事件结束时的感觉。忽视持续时间的记忆将不能满足我们对长的快乐和短的痛苦的偏爱。
SPEAKING OF TWO SELVES
说:" "的两个自我
“You are thinking of your failed marriage entirely from the perspective of the remembering self. A divorce is like a symphony with a screeching sound at the end—the fact that it ended badly does not mean it was all bad.”
"你完全是从记忆中的自己的角度来思考你失败的婚姻。离婚就像一首交响乐,在结尾处有刺耳的声音--事实是它结束得很糟糕,但并不意味着它都是坏的。"
“This is a bad case of duration neglect. You are giving the good and the bad part of your experience equal weight, although the good part lasted ten times as long as the other.”
"这是一个糟糕的持续时间忽视案例。你对你的经历中好的和坏的部分给予同等的重视,尽管好的部分持续的时间是另一个部分的十倍 。"
36
36
Life as a Story
生命是一个故事
Early in the days of my work on the measurement of experience, I saw Verdi’s opera La Traviata . Known for its gorgeous music, it is also a moving story of the love between a young aristocrat and Violetta, a woman of the demimonde. The young man’s father approaches Violetta and convinces her to give up her lover, to protect the honor of the family and the marriage prospects of the young man’s sister. In an act of supreme self-sacrifice, Violetta pretends to reject the man she adores. She soon relapses into consumption (the nineteenth-century term for tuberculosis). In the final act, Violetta lies dying, surrounded by a few friends. Her beloved has been alerted and is rushing to Paris to see her. Hearing the news, she is transformed with hope and joy, but she is also deteriorating quickly.
在我从事经验测量工作的早期,我看过威尔第的歌剧 《茶花女 》。这部歌剧以其华丽的音乐而闻名,也是一个动人的故事,讲述了一个年轻的贵族和维奥莱塔之间的爱情,维奥莱塔是一个来自贫民区的女人 。年轻人的父亲找到维奥莱塔,说服她放弃她的情人,以保护家族的荣誉和年轻人妹妹的婚姻前景。在一个最高的自我牺牲的行为中,维奥莱塔假装拒绝了她所爱慕的男人。她很快就患上了肺结核(十九世纪对肺结核的称呼)。在最后一幕中,维奥莱塔奄奄一息,身边只有几个朋友, 。她的爱人已经被惊动,正赶往巴黎看她。听到这个消息,她被希望和喜悦所感染,但她的病情也在迅速恶化。
No matter how many times you have seen the opera, you are gripped by the tension and fear of the moment: Will the young lover arrive in time? There is a sense that it is immensely important for him to join his beloved before she dies. He does, of course, some marvelous love duets are sung, and after 10 minutes of glorious music Violetta dies.
无论你看过多少次歌剧,你都会被这一时刻的紧张和恐惧紧紧抓住。年轻的爱人会及时赶到吗?有一种感觉是,在他心爱的 ,在她死前与她团聚是非常重要的。当然,他做到了,一些奇妙的爱情二重唱被唱了出来,在10分钟的辉煌音乐之后,维奥莱塔死了。
On my way home from the opera, I wondered: Why do we care so much about those last 10 minutes? I quickly realized that I did not care at all about the length of Violetta’s life. If I had been told that she died at age 27, not age 28 as I believed, the news that she had missed a year of happy life would not have moved me at all, but the possibility of missing the last 10 minutes mattered a great deal. Furthermore, the emotion I felt about the lovers’ reunion would not have changed if I had learned that they actually had a week together, rather than 10 minutes. If the lover had come too late, however, La Traviata would have been an altogether different story. A story is about significant events and memorable moments, not about time passing. Duration neglect is normal in a story, and the ending often defines its character. The same core features appear in the rules of narratives and in the memories of colonoscopies, vacations, and films. This is how the remembering self works: it composes stories and keeps them for future reference.
在从歌剧院回家的路上,我在想。为什么我们如此关心最后的10分钟?我很快意识到,我根本不关心维奥莱塔生命的长度。如果我被告知她死于27岁,而不是我所相信的28岁,那么她错过了一年的幸福生活的消息 ,根本不会让我感动,但错过最后10分钟的可能性却非常重要。此外,如果我得知他们实际上有一周的时间在一起, ,而不是10分钟,我对这对恋人重逢的情感也不会改变。然而,如果这对恋人来得太晚, 《茶花女 》将是一个完全不同的故事。 一个故事是关于重大事件和难忘的时刻,而不是关于时间的流逝。在一个故事中忽略时间是正常的,结局往往决定了故事的特点。同样的核心特征出现在叙事的规则和结肠镜检查、度假和电影的记忆中。这就是记忆中的自我是如何工作的:它构成了故事,并把它们保留下来供将来参考。
It is not only at the opera that we think of life as a story and wish it to end well. When we hear about the death of a woman who had been estranged from her daughter for many years, we want to know whether they were reconciled as death approached. We do not care only about the daughter’s feelings—it is the narrative of the mother’s life that we wish to improve. Caring for people often takes the form of concern for the quality of their stories, not for their feelings. Indeed, we can be deeply moved even by events that change the stories of people who are already dead. We feel pity for a man who died believing in his wife’s love for him, when we hear that she had a lover for many years and stayed with her husband only for his money. We pity the husband although he had lived a happy life. We feel the humiliation of a scientist who made an important discovery that was proved false after she died, although she did not experience the humiliation. Most important, of course, we all care intensely for the narrative of our own life and very much want it to be a good story, with a decent hero.
不仅仅是 ,在歌剧中,我们把生命当作一个故事,希望它有一个好的结局。当我们听到一个与女儿疏远多年的妇女去世时,我们想知道他们是否在死亡临近时和解了。我们并不只关心女儿的感受--我们希望改善的是母亲的生活叙述。对人的关怀往往采取关心的形式, ,关心他们故事的质量,而不是关心他们的感受。事实上,我们甚至可以被那些改变已经死亡的人的故事的事件深深打动。我们对一个死后还相信妻子对他的爱的人感到同情,当我们听说她 有一个多年的情人 ,只是为了丈夫的钱而和他在一起。我们怜悯这位丈夫,尽管他曾过着幸福的生活。我们感到屈辱 ,一个科学家做出了一个重要的发现,但在她死后被证明是假的,尽管她没有经历过这种屈辱。当然,最重要的是,我们都非常关心自己的生活叙事,非常希望它是一个好故事,有一个体面的英雄。
The psychologist Ed Diener and his students wondered whether duration neglect and the peak-end rule would govern evaluations of entire lives. They used a short description of the life of a fictitious character called Jen, a never-married woman with no children, who died instantly and painlessly in an automobile accident. In one version of Jen’s story, she was extremely happy throughout her life (which lasted either 30 or 60 years), enjoying her work, taking vacations, spending time with her friends and on her hobbies. Another version added 5 extra years to Jen’s life, who now died either when she was 35 or 65. The extra years were described as pleasant but less so than before. After reading a schematic biography of Jen, each participant answered two questions: “Taking her life as a whole, how desirable do you think Jen’s life was?” and “How much total happiness or unhappiness would you say that Jen experienced in her life?”
心理学家埃德-迪纳(Ed Diener)和他的学生想知道,持续时间的忽视和高峰期规则是否会支配对整个生命的评价 。他们用一个简短的描述来描述一个叫珍的虚构人物的生活,她是一个没有孩子的未婚女子,在一次车祸中即时无痛地死去。在仁的故事的一个版本中,她的一生(持续了30年或60年)都非常快乐,享受她的工作,度假,花时间与她的朋友和她的爱好在一起。 另一个版本是在珍的生命中增加了5年,现在她在35岁或65岁时去世。这些额外的岁月被描述为令人愉快,但不如以前。在阅读了仁的示意图传记后,每个参与者都回答了两个问题。"把她的一生作为一个整体,你认为仁的生活有多理想?"和 "你说仁在她的一生中经历了多少总的幸福或不幸福 ?"
The results provided clear evidence of both duration neglect and a peak-end effect. In a between-subjects experiment (different participants saw different forms), doubling the duration of Jen’s life had no effect whatsoever on the desirability of her life, or on judgments of the total happiness that Jen experienced. Clearly, her life was represented by a prototypical slice of time, not as a sequence of time slices. As a consequence, her “total happiness” was the happiness of a typical period in her lifetime, not the sum (or integral) of happiness over the duration of her life.
结果提供了关于持续时间忽视和峰值效应的明确证据。在一个被试者之间的实验中(不同的参与者看到不同的形式),Jen的生活持续时间翻倍对她的生活的可取性没有任何影响,也对Jen 的总幸福感的判断没有任何影响。显然,她的生活是由一个典型的时间片代表的, ,而不是作为一个时间片的序列。因此,她的 "总幸福 "是她一生中一个典型时期的幸福,而不是她一生中幸福的总和(或积分)。
As expected from this idea, Diener and his students also found a less-is-more effect, a strong indication that an average (prototype) has been substituted for a sum. Adding 5 “slightly happy” years to a very happy life caused a substantial drop in evaluations of the total happiness of that life.
正如从这个想法中所预期的那样,迪纳和他的学生还发现了一个少即是多的效应,这是一个强烈的迹象,表明一个平均数(原型)已经被替换为一个总和。在一个非常幸福的生活中增加5个 "稍微幸福 "的年份 ,导致对该生活的总幸福感的评价大幅下降。
At my urging, they also collected data on the effect of the extra 5 years in a within-subject experiment; each participant made both judgments in immediate succession. In spite of my long experience with judgment errors, I did not believe that reasonable people could say that adding 5 slightly happy years to a life would make it substantially worse. I was wrong. The intuition that the disappointing extra 5 years made the whole life worse was overwhelming.
在我的敦促下,他们还在一个被试者范围内的实验中收集了关于额外5年的影响的数据;每个被试者都立即连续进行了这两项判断。尽管我对判断错误有长期的经验,但我不相信有理智的人可以说,在生活中增加5年略带幸福感的 ,会使它大大恶化。我错了。令人失望的额外5年使整个生活变得更糟,这种直觉是压倒性的。
The pattern of judgments seemed so absurd that Diener and his students initially thought that it represented the folly of the young people who participated in their experiments. However, the pattern did not change when the parents and older friends of students answered the same questions. In intuitive evaluation of entire lives as well as brief episodes , peaks and ends matter but duration does not.
这种判断模式似乎非常荒谬,以至于迪纳和他的学生最初认为这代表了参与他们实验的年轻人的愚蠢。然而,当学生的父母和年长的 朋友回答同样的问题时,这种模式并没有改变。在对 整个生命以及短暂的事件 进行直观的评价时,高峰和终点很重要,但持续时间并不重要。
The pains of labor and the benefits of vacations always come up as objections to the idea of duration neglect: we all share the intuition that it is much worse for labor to last 24 than 6 hours, and that 6 days at a good resort is better than 3. Duration appears to matter in these situations, but this is only because the quality of the end changes with the length of the episode. The mother is more depleted and helpless after 24 hours than after 6, and the vacationer is more refreshed and rested after 6 days than after 3. What truly matters when we intuitively assess such episodes is the progressive deterioration or improvement of the ongoing experience, and how the person feels at the end.
分娩的痛苦和休假的好处总是作为对忽视持续时间的想法的反对意见出现:我们都有这样的直觉,即分娩持续24小时比6小时要糟糕得多,在一个好的度假胜地呆6天比3天要好 。在这些情况下,持续时间似乎很重要,但这只是因为结局的质量随着时间的推移而改变。母亲在24小时后比6小时后更耗费精力和无助,而度假者在6天后比3天后更有活力和休息。当我们凭直觉评估这种发作时,真正重要的是正在进行的体验的逐渐恶化或改善 ,以及人在结束时的感觉。
AMNESIC VACATIONS
失忆的假期
Consider the choice of a vacation. Do you prefer to enjoy a relaxing week at the familiar beach to which you went last year? Or do you hope to enrich your store of memories? Distinct industries have developed to cater to these alternatives: resorts offer restorative relaxation; tourism is about helping people construct stories and collect memories. The frenetic picture taking of many tourists suggests that storing memories is often an important goal, which shapes both the plans for the vacation and the experience of it. The photographer does not view the scene as a moment to be savored but as a future memory to be designed. Pictures may be useful to the remembering self—though we rarely look at them for very long, or as often as we expected, or even at all—but picture taking is not necessarily the best way for the tourist’s experiencing self to enjoy a view.
考虑一下度假的选择。你是喜欢在去年去的那个熟悉的海滩上享受一个轻松的星期?还是希望充实自己的记忆库?不同的行业已经发展起来,以满足这些选择:度假村提供恢复性的放松;旅游业是帮助人们构建 ,收集记忆。许多游客狂热的拍照 ,这表明储存记忆往往是一个重要的目标,它塑造了假期的计划和假期的体验。摄影者并不把场景看作是需要品味的时刻,而是看作是需要设计的未来记忆。照片对记忆中的自己可能是有用的,尽管我们很少看很长时间 ,或者像我们预期的那样经常看,甚至根本不看,但拍照不一定是游客的体验者享受风景的最佳方式。
In many cases we evaluate touristic vacations by the story and the memories that we expect to store. The word memorable is often used to describe vacation highlights, explicitly revealing the goal of the experience. In other situations—love comes to mind—the declaration that the present moment will never be forgotten, though not always accurate, changes the character of the moment. A self-consciously memorable experience gains a weight and a significance that it would not otherwise have.
在许多情况下,我们通过故事和我们期望储存的记忆来评价旅游度假。 令人难忘 这个词经常被用来描述假期的亮点,明确地揭示了体验的目标。 一个自我意识到的难忘的经历获得了一种重量和意义,否则它就不会有。
Ed Diener and his team provided evidence that it is the remembering self that chooses vacations. They asked students to maintain daily diaries and record a daily evaluation of their experiences during spring break. The students also provided a global rating of the vacation when it had ended. Finally, they indicated whether or not they intended to repeat or not to repeat the vacation they had just had. Statistical analysis established that the intentions for future vacations were entirely determined by the final evaluation—even when that score did not accurately represent the quality of the experience that was described in the diaries. As in the cold-hand experiment, right or wrong, people choose by memory when they decide whether or not to repeat an experience.
埃德-迪纳和他的团队提供了证据,表明是记忆中的自我选择了假期。他们要求学生保持每日日记并记录 ,对他们在春假期间的经历进行每日评估。当假期结束时,学生们还提供了一份对假期的总体评价。最后,他们指出他们是否打算重复或不重复他们刚刚度过的假期。统计分析表明,对未来假期的意向完全由最后的评价决定--即使该分数 ,也不能准确代表日记中描述的体验质量。正如在冷手实验中,无论对错,人们在决定是否重复一次经历时,都 是通过记忆来选择 的。
A thought experiment about your next vacation will allow you to observe your attitude to your experiencing self.
关于你的下一个假期的思想实验将使你能够观察到你对体验自我的态度。
At the end of the vacation, all pictures and videos will be destroyed. Furthermore, you will swallow a potion that will wipe out all your memories of the vacation.
在假期结束时,所有的照片和视频都将 ,被销毁。此外,你将吞下一种药水,抹去你对这个假期的所有记忆。
How would this prospect affect your vacation plans? How much would you be willing to pay for it, relative to a normally memorable vacation?
这种前景会如何影响你的度假计划?相对于通常令人难忘的假期,你愿意为此支付多少钱?
While I have not formally studied the reactions to this scenario, my impression from discussing it with people is that the elimination of memories greatly reduces the value of the experience. In some cases, people treat themselves as they would treat another amnesic, choosing to maximize overall pleasure by returning to a place where they have been happy in the past. However, some people say that they would not bother to go at all, revealing that they care only about their remembering self, and care less about their amnesic experiencing self than about an amnesic stranger. Many point out that they would not send either themselves or another amnesic to climb mountains or trek through the jungle—because these experiences are mostly painful in real time and gain value from the expectation that both the pain and the joy of reaching the goal will be memorable.
虽然我没有正式研究过对这种情况的反应,但我从与人讨论中得到的印象是,记忆的消除 ,大大降低了体验的价值。在某些情况下,人们对待自己就像对待另一个失忆者一样,选择通过回到一个他们过去曾经快乐过的地方,最大限度地提高 整体的快乐。然而,有些人说他们根本不会费心去,这表明他们只关心记忆中的自己,而对失忆的经历 ,比对失忆的陌生人关心得少。许多人指出,他们不会让自己或另一个失忆者去爬山或在丛林中跋涉--因为这些经历在现实中大多是痛苦的,并且从期望达到目标的痛苦和喜悦都是值得纪念的中获得价值。
For another thought experiment, imagine you face a painful operation during which you will remain conscious. You are told you will scream in pain and beg the surgeon to stop. However, you are promised an amnesia-inducing drug that will completely wipe out any memory of the episode. How do you feel about such a prospect? Here again, my informal observation is that most people are remarkably indifferent to the pains of their experiencing self. Some say they don’t care at all. Others share my feeling, which is that I feel pity for my suffering self but not more than I would feel for a stranger in pain. Odd as it may seem, I am my remembering self, and the experiencing self, who does my living, is like a stranger to me.
对于另一个思想实验,想象一下,在 ,你将面临一个痛苦的手术,你将保持清醒。你被告知你会因疼痛而尖叫,并乞求外科医生停止。然而,你被许诺使用一种失忆诱导药物,将完全消除对这一事件的任何记忆。你对这样的前景有什么感觉?在这里,我的非正式观察是,大多数人对他们所经历的自我的痛苦非常漠不关心。有些人说他们根本不关心 。其他人和我有同样的感觉,那就是我对受苦的自己感到怜悯,但不会比我对一个痛苦的陌生人的感觉更强烈。虽然看起来很奇怪,但我是我的记忆中的自己,而为我生活服务的体验中的自己,对我来说就像一个陌生人。
SPEAKING OF LIFE AS A STORY
说到生活是一个故事
“He is desperately trying to protect the narrative of a life of integrity, which is endangered by the latest episode.”
"他正在拼命地保护正直生活的叙事,而这一叙事在最近的事件中受到了威胁。"
“The length to which he was willing to go for a one-night encounter is a sign of total duration neglect.”
"他愿意为一夜的邂逅付出的代价,是完全忽视时间的表现。"
“You seem to be devoting your entire vacation to the construction of memories. Perhaps you should put away the camera and enjoy the moment, even if it is not very memorable?”
"你似乎把你的整个假期都投入到记忆的构建中。也许你应该收起相机,享受这一刻,即使它不是很令人难忘?"
“She is an Alzheimer’s patient. She no longer maintains a narrative of her life, but her experiencing self is still sensitive to beauty and gentleness.”
"她是一个阿尔茨海默氏症患者。她不再保持对自己生活的叙述,但她的体验自我仍然对美和温柔敏感 "。
37
37
Experienced Well-Being
经历过的幸福生活
When I became interested in the study of well-being about fifteen years ago, I quickly found out that almost everything that was known about the subject drew on the answers of millions of people to minor variations of a survey question, which was generally accepted as a measure of happiness. The question is clearly addressed to your remembering self, which is invited to think about your life:
大约15年前,当我对幸福感的研究产生兴趣时,我很快发现,几乎所有关于这个主题的知识都借鉴了数百万人对一个调查问题的微小变化的回答 ,这个问题被普遍认为是衡量幸福的标准。这个问题显然是针对你记忆中的自我的,它被邀请去思考你的生活。
All things considered, how satisfied are you with your life as a whole these days ?
综上所述,你对最近的 整体生活 的满意度如何?
Having come to the topic of well-being from the study of the mistaken memories of colonoscopies and painfully cold hands, I was naturally suspicious of global satisfaction with life as a valid measure of well-being. As the remembering self had not proved to be a good witness in my experiments, I focused on the well-being of the experiencing self. I proposed that it made sense to say that “Helen was happy in the month of March” if
从对结肠镜检查和痛苦的冷手的错误记忆的研究中来到幸福感这个话题,我 ,自然对全球对生活的满意度作为幸福感的有效衡量标准表示怀疑。由于记忆中的自我在我的实验中没有被证明是一个好的证人,我把注意力集中在体验中的自我的幸福上。我提出,如果说 "海伦在三月很快乐 "是有意义的,如果
she spent most of her time engaged in activities that she would rather continue than stop, little time in situations she wished to escape, and—very important because life is short—not too much time in a neutral state in which she would not care either way.
她的大部分时间都在从事她宁愿继续也不愿停止的活动, ,很少有时间处于她希望逃避的情况,而且--非常重要的是,因为生命短暂--没有太多的时间处于中性状态,在这种状态下,她不会关心任何事情。
There are many different experiences we would rather continue than stop, including both mental and physical pleasures. One of the examples I had in mind for a situation that Helen would wish to continue is total absorption in a task, which Mihaly Csikszentmihalyi calls flow —a state that some artists experience in their creative moments and that many other people achieve when enthralled by a film, a book, or a crossword puzzle: interruptions are not welcome in any of these situations. I also had memories of a happy early childhood in which I always cried when my mother came to tear me away from my toys to take me to the park, and cried again when she took me away from the swings and the slide. The resistance to interruption was a sign I had been having a good time, both with my toys and with the swings.
有许多不同的体验我们宁愿继续也不愿停止,包括精神和身体的愉悦。对于海伦希望继续的情况,我 ,其中一个例子是完全沉浸在一项 ,Mihaly Csikszentmihalyi称之为 流动--这是 一些艺术家在创作时经历的状态,许多其他人在沉醉于一部电影、一本书或一道填字游戏时达到的状态:在任何这些情况下都不欢迎中断。我也有快乐的童年记忆,在那里,当我的母亲把我从玩具中拉出来带我去 ,我总是哭,当她把我从秋千和滑梯上带走时,我又哭了。对中断的抵制表明我一直过得很开心,无论是在玩具还是在秋千上。
I proposed to measure Helen’s objective happiness precisely as we assessed the experience of the two colonoscopy patients, by evaluating a profile of the well-being she experienced over successive moments of her life. In this I was following Edgeworth’s hedonimeter method of a century earlier. In my initial enthusiasm for this approach, I was inclined to dismiss Helen’s remembering self as an error-prone witness to the actual well-being of her experiencing self. I suspected this position was too extreme, which it turned out to be, but it was a good start.
我提议衡量海伦的客观幸福感,就像我们评估两位结肠镜检查患者的经历一样,通过评估她在 她的生活的连续时刻所经历的幸福感的概况。在这一点上,我沿用了一个世纪前埃奇沃斯的享乐指数法。在我最初对这一方法的热情中,我倾向于将海伦的记忆中的自我作为她经历中的自我的实际幸福感的一个容易出错的证人。我怀疑这一立场过于极端,事实证明也是如此,但这是一个好的开始。
EXPERIENCED WELL-BEING
经历过的幸福感
I assembled “a dream team” that included three other psychologists of different specialties and one economist, and we set out together to develop a measure of the well-being of the experiencing self. A continuous record of experience was unfortunately impossible—a person cannot live normally while constantly reporting her experiences. The closest alternative was experience sampling, a method that Csikszentmihalyi had invented. Technology has advanced since its first uses. Experience sampling is now implemented by programming an individual’s cell phone to beep or vibrate at random intervals during the day. The phone then presents a brief menu of questions about what the respondent was doing and who was with her when she was interrupted. The participant is also shown rating scales to report the intensity of various feelings : happiness, tension, anger, worry, engagement, physical pain, and others.
我组建了 一个 "梦之队 " ,其中包括其他三位不同专业的心理学家和一位经济学家,我们一起着手开发一个衡量体验自我的幸福感。不幸的是,连续的经验记录是不可能的--一个人不可能在不断报告她的经验时正常生活。最接近的替代方法是经验抽样,一种Csikszentmihalyi发明的方法, 。自其首次使用以来,技术得到了发展。现在,经验抽样是通过对个人的手机进行编程,使其在一天中以随机的时间间隔发出提示音或振动来实现的。然后手机会显示一个简短的问题菜单,内容是被调查者在做什么,以及她被打断时谁和她在一起。被访者也会看到评级表来报告 各种感觉的强度 :快乐、紧张、愤怒、担心、参与、身体疼痛和其他。
Experience sampling is expensive and burdensome (although less disturbing than most people initially expect; answering the questions takes very little time). A more practical alternative was needed, so we developed a method that we called the Day Reconstruction Method (DRM). We hoped it would approximate the results of experience sampling and provide additional information about the way people spend their time . Participants (all women, in the early studies) were invited to a two-hour session. We first asked them to relive the previous day in detail, breaking it up into episodes like scenes in a film. Later, they answered menus of questions about each episode, based on the experience-sampling method. They selected activities in which they were engaged from a list and indicated the one to which they paid most attention. They also listed the individuals they had been with, and rated the intensity of several feelings on separate 0–6 scales (0 = the absence of the feeling; 6 = most intense feeling). Our method drew on evidence that people who are able to retrieve a past situation in detail are also able to relive the feelings that accompanied it, even experiencing their earlier physiological indications of emotion .
经验取样是昂贵和繁琐的(尽管比大多数人最初预期的要少一些干扰;回答问题只需要很少的时间)。我们需要一个更实用的替代方法,所以我们开发了一种方法,我们称之为日重建法(DRM)。我们希望它能接近 经验抽样的结果,并提供关于人们 花费时间 的方式的额外 信息。参与者(在早期的研究中,都是女性)被邀请参加一个两小时的会议。我们首先要求他们详细回顾前一天的情况,像电影中的场景一样把它分成若干个情节。随后,他们根据经验抽样法回答了关于每个情节的问题菜单。他们从列表中选择了他们所参与的 活动,并指出他们最关注的活动。他们还列出了与他们在一起的人,并对几种感觉的强度进行了0-6的评分(0=没有这种感觉;6=最强烈的感觉)。我们的方法借鉴了一些证据,即能够详细检索过去情况的人也能够 ,重温伴随它的感觉,甚至体验到他们早期的 生理情感迹象 。
We assumed that our participants would fairly accurately recover the feeling of a prototypical moment of the episode. Several comparisons with experience sampling confirmed the validity of the DRM. Because the participants also reported the times at which episodes began and ended, we were able to compute a duration-weighted measure of their feeling during the entire waking day. Longer episodes counted more than short episodes in our summary measure of daily affect. Our questionnaire also included measures of life satisfaction, which we interpreted as the satisfaction of the remembering self. We used the DRM to study the determinants of both emotional well-being and life satisfaction in several thousand women in the United States, France, and Denmark.
我们假定,我们的参与者会相当准确地恢复剧情原型时刻的感觉。与经验抽样的几次比较证实了DRM的有效性。由于参与者也报告了发作开始和结束的时间,我们 ,能够计算出他们在整个清醒的一天中的感觉的持续时间加权测量。在我们对日常情绪的汇总测量中,较长的发作比短的发作算得多。我们的问卷还包括对生活满意度的测量,我们将其解释为记忆中的自我的满意度。我们使用DRM来研究美国、法国和丹麦的几千名妇女的情绪幸福和生活满意度的决定因素 。
The experience of a moment or an episode is not easily represented by a single happiness value. There are many variants of positive feelings, including love, joy, engagement, hope, amusement, and many others. Negative emotions also come in many varieties, including anger, shame, depression, and loneliness. Although positive and negative emotions exist at the same time, it is possible to classify most moments of life as ultimately positive or negative. We could identify unpleasant episodes by comparing the ratings of positive and negative adjectives. We called an episode unpleasant if a negative feeling was assigned a higher rating than all the positive feelings. We found that American women spent about 19% of the time in an unpleasant state, somewhat higher than French women (16%) or Danish women (14%).
一个时刻或一个情节的体验不容易用一个单一的幸福值来表示。积极情绪有很多变种,包括爱、快乐、参与、希望、娱乐,以及其他许多变种。负面情绪也有很多种类,包括愤怒、羞耻、抑郁和孤独。尽管积极的 和消极的情绪同时存在,但有可能将生活中的大多数时刻归类为最终的积极或消极。我们可以通过比较积极和消极形容词的评分来确定不愉快的情节。如果一种消极的感觉被赋予比所有积极的感觉更高的评价,我们就称其为不愉快的事件。我们发现,美国妇女花了大约19%的时间 ,处于不愉快的状态,比法国妇女(16%)或丹麦妇女(14%)高一些。
We called the percentage of time that an individual spends in an unpleasant state the U-index . For example, an individual who spent 4 hours of a 16-hour waking day in an unpleasant state would have a U-index of 25%. The appeal of the U-index is that it is based not on a rating scale but on an objective measurement of time. If the U-index for a population drops from 20% to 18%, you can infer that the total time that the population spent in emotional discomfort or pain has diminished by a tenth.
我们把一个人花在不愉快状态下的时间百分比 称为U指数 。例如,一个人在一天16个小时的清醒状态中,有4个小时处于不愉快的状态,那么他的U-指数为25%。U-指数的吸引力在于它不是基于一个评级表,而是基于一个 客观测量 的时间。如果一个群体的U指数从20%下降到18%,你可以推断出这个群体在情绪不适或痛苦中度过的总时间减少了十分之一。
A striking observation was the extent of inequality in the distribution of emotional pain . About half our participants reported going through an entire day without experiencing an unpleasant episode. On the other hand, a significant minority of the population experienced considerable emotional distress for much of the day. It appears that a small fraction of the population does most of the suffering—whether because of physical or mental illness, an unhappy temperament, or the misfortunes and personal tragedies in their life.
一个引人注目的观察是 情绪痛苦分布 的不平等程度。我们的参与者中大约有一半人报告说一整天都没有经历过不愉快的事情。另一方面, ,有相当一部分人在一天的大部分时间里经历了相当大的情绪痛苦。看来,一小部分人承受了大部分的痛苦--无论是因为身体或精神疾病、不愉快的性格,还是生活中的不幸和个人悲剧。
A U-index can also be computed for activities. For example, we can measure the proportion of time that people spend in a negative emotional state while commuting, working, or interacting with their parents, spouses, or children. For 1,000 American women in a Midwestern city, the U-index was 29% for the morning commute, 27% for work, 24% for child care, 18% for housework, 12% for socializing, 12% for TV watching, and 5% for sex. The U-index was higher by about 6% on weekdays than it was on weekends, mostly because on weekends people spend less time in activities they dislike and do not suffer the tension and stress associated with work. The biggest surprise was the emotional experience of the time spent with one’s children, which for American women was slightly less enjoyable than doing housework. Here we found one of the few contrasts between French and American women: Frenchwomen spend less time with their children but enjoy it more, perhaps because they have more access to child care and spend less of the afternoon driving children to various activities.
U型指数也可以为活动计算出来。例如,我们可以测量人们在通勤、工作或与父母、配偶或子女互动时,在负面情绪状态下所花费的时间比例 。对于中西部城市的1000名美国妇女来说,早上通勤时的U指数为29%,工作时为27%,照顾孩子时为24%,做家务时为18%,社交时为12%,看电视时为12%,做爱时为5%。U-指数在工作日 ,比周末高约6%,主要是因为在周末,人们花在他们不喜欢的活动上的时间较少,没有遭受与工作有关的紧张和压力。最大的惊喜是与孩子在一起的时间的情感体验,对美国妇女来说,这比做家务的乐趣略低。在这里,我们发现了法国和美国妇女之间为数不多的反差之一。 法国妇女花在孩子身上的时间更少,但却更享受,这也许是因为她们有更多的机会获得儿童护理,并且花在下午开车送孩子去参加各种活动的时间更少。
An individual’s mood at any moment depends on her temperament and overall happiness, but emotional well-being also fluctuates considerably over the day and the week. The mood of the moment depends primarily on the current situation. Mood at work, for example, is largely unaffected by the factors that influence general job satisfaction, including benefits and status. More important are situational factors such as an opportunity to socialize with coworkers, exposure to loud noise, time pressure (a significant source of negative affect), and the immediate presence of a boss (in our first study, the only thing that was worse than being alone). Attention is key. Our emotional state is largely determined by what we attend to, and we are normally focused on our current activity and immediate environment. There are exceptions, where the quality of subjective experience is dominated by recurrent thoughts rather than by the events of the moment. When happily in love, we may feel joy even when caught in traffic, and if grieving, we may remain depressed when watching a funny movie. In normal circumstances, however, we draw pleasure and pain from what is happening at the moment, if we attend to it. To get pleasure from eating, for example, you must notice that you are doing it. We found that French and American women spent about the same amount of time eating, but for Frenchwomen, eating was twice as likely to be focal as it was for American women. The Americans were far more prone to combine eating with other activities, and their pleasure from eating was correspondingly diluted.
一个人在任何时候的情绪都取决于她的气质和整体幸福感,但情绪的好坏在一天和一周内也有很大的波动。当下的情绪主要取决于 ,取决于当前的情况。例如,工作中的情绪基本上不受影响一般工作满意度的因素影响,包括福利和地位。更重要的是情景因素,如与同事社交的机会,暴露在嘈杂的噪音中,时间压力(负面情绪的重要来源),以及老板的直接存在(在我们的第一个研究中,唯一的 ,比孤独更糟糕)。注意力是关键。我们的情绪状态在很大程度上是由我们关注的东西决定的,而我们通常会关注我们当前的活动和直接环境。但也有例外,主观体验的质量是由反复出现的想法而不是当下的事件所主导的。当幸福地 ,我们可能会感到快乐,即使在交通中被抓到 ,如果悲伤,我们可能在看一个有趣的电影时仍然感到沮丧。然而,在正常情况下,如果我们关注当下正在发生的事情,我们会从其中汲取快乐和痛苦。例如,要从饮食中获得快乐,你必须注意到你正在做这件事。我们发现,法国妇女和美国妇女花在吃饭上的时间差不多,但是对于法国妇女来说,吃饭的可能性是美国妇女的两倍 。美国人更容易把吃和其他活动结合起来,他们从吃中得到的快乐也相应地被稀释了。
These observations have implications for both individuals and society. The use of time is one of the areas of life over which people have some control. Few individuals can will themselves to have a sunnier disposition, but some may be able to arrange their lives to spend less of their day commuting, and more time doing things they enjoy with people they like. The feelings associated with different activities suggest that another way to improve experience is to switch time from passive leisure, such as TV watching, to more active forms of leisure, including socializing and exercise. From the social perspective, improved transportation for the labor force, availability of child care for working women, and improved socializing opportunities for the elderly may be relatively efficient ways to reduce the U-index of society—even a reduction by 1% would be a significant achievement, amounting to millions of hours of avoided suffering. Combined national surveys of time use and of experienced well-being can inform social policy in multiple ways. The economist on our team, Alan Krueger, took the lead in an effort to introduce elements of this method into national statistics.
这些观察对个人和社会都有影响。时间的使用是人们可以控制的生活领域之一。很少有人能让自己拥有更阳光的性格, ,但有些人可能能够安排自己的生活,减少每天花在通勤上的时间,而有更多的时间与他们喜欢的人一起做他们喜欢的事情。与不同活动相关的感受表明,改善体验的另一个方法是将时间从看电视等被动的休闲方式转向更积极的休闲方式,包括社交和运动。从社会角度来看, ,改善劳动力的交通,为职业妇女提供托儿服务,以及改善老年人的社交机会,可能是降低社会U指数的相对有效的方法--即使减少1%也是一个重大成就,相当于避免了数百万小时的痛苦。对时间利用和经历过的幸福感的综合国家调查 ,可以以多种方式告知社会政策。我们团队中的经济学家艾伦-克鲁格率先努力将这种方法的要素引入国家统计。
Measures of experienced well-being are now routinely used in large-scale national surveys in the United States, Canada, and Europe, and the Gallup World Poll has extended these measurements to millions of respondents in the United States and in more than 150 countries. The polls elicit reports of the emotions experienced during the previous day, though in less detail than the DRM. The gigantic samples allow extremely fine analyses, which have confirmed the importance of situational factors, physical health, and social contact in experienced well-being. Not surprisingly, a headache will make a person miserable, and the second best predictor of the feelings of a day is whether a person did or did not have contacts with friends or relatives. It is only a slight exaggeration to say that happiness is the experience of spending time with people you love and who love you.
在美国、加拿大和欧洲的大规模全国性调查中,现在经常使用对经验幸福感的测量, 盖洛普世界民意调查 已经将这些测量扩展到 美国和150多个国家的数百万受访者。这些民意调查引起了对前一天经历的情绪的报告,尽管没有DRM那么详细。巨大的样本允许进行极其精细的分析,这些分析证实了情境因素、身体健康和社会接触对体验到的幸福的重要性。毫不奇怪,头痛会使人痛苦,而 ,对一天的感受的第二好的预测因素是一个人是否与朋友或亲戚有接触。说幸福是与你所爱的人和爱你的人在一起的经历,这只是稍微夸张的说法。
The Gallup data permit a comparison of two aspects of well-being:
盖洛普数据允许对幸福感的两个方面进行比较。
the well-being that people experience as they live their lives
人们在生活中体验到的幸福感
the judgment they make when they evaluate their life
判断 ,他们在评估自己的生活时做出的判断
Gallup’s life evaluation is measured by a question known as the Cantril Self-Anchoring Striving Scale:
盖洛普的生活评价是由一个被称为坎特利尔自我激励奋斗量表的问题来衡量的。
Please imagine a ladder with steps numbered from zero at the bottom to 10 at the top. The top of the ladder represents the best possible life for you and the bottom of the ladder represents the worst possible life for you.
请想象一个阶梯,阶梯的编号从底部的零到顶部的十。梯子的顶端代表你最好的生活,梯子的底部代表你最差的生活。
On which step of the ladder would you say you personally feel you stand at this time?
,你会说你个人觉得你现在站在哪个阶梯上?
Some aspects of life have more effect on the evaluation of one’s life than on the experience of living. Educational attainment is an example. More education is associated with higher evaluation of one’s life, but not with greater experienced well-being. Indeed, at least in the United States, the more educated tend to report higher stress. On the other hand, ill health has a much stronger adverse effect on experienced well-being than on life evaluation. Living with children also imposes a significant cost in the currency of daily feelings—reports of stress and anger are common among parents, but the adverse effects on life evaluation are smaller. Religious participation also has relatively greater favorable impact on both positive affect and stress reduction than on life evaluation. Surprisingly, however, religion provides no reduction of feelings of depression or worry.
生活的某些方面对一个人的生活评价的影响比对生活经验的影响更大。教育程度就是一个例子。更多的教育与对一个人的生活的更高评价有关,但与更大的体验幸福感无关。事实上,至少在美国,受教育程度高的人往往报告说压力更大。 另一方面,健康状况不佳对经验幸福感的不利影响比对生活评价的影响要大得多。与孩子生活在一起也会在日常感受的货币方面造成重大损失--报告压力和愤怒在父母中很常见,但对生活评价的不利影响较小。与生活评价相比,宗教参与对积极的 情感和减少压力也有相对较大的有利影响。然而,令人惊讶的是,宗教并没有减少抑郁或担忧的感觉。
An analysis of more than 450,000 responses to the Gallup-Healthways Well-Being Index, a daily survey of 1,000 Americans, provides a surprisingly definite answer to the most frequently asked question in well-being research: Can money buy happiness? The conclusion is that being poor makes one miserable, and that being rich may enhance one’s life satisfaction, but does not (on average) improve experienced well-being.
对盖洛普-健康之路幸福指数的 450,000多份答复 的分析,即对1000名美国人的日常调查,为幸福研究中最常问的问题提供了一个令人惊讶的明确答案。钱能买到幸福吗? ,结论是:贫穷使人痛苦,富有可能会提高一个人的生活满意度,但(平均)不会改善经验的幸福感。
Severe poverty amplifies the experienced effects of other misfortunes of life. In particular, illness is much worse for the very poor than for those who are more comfortable. A headache increases the proportion reporting sadness and worry from 19% to 38% for individuals in the top two-thirds of the income distribution. The corresponding numbers for the poorest tenth are 38% and 70%—a higher baseline level and a much larger increase. Significant differences between the very poor and others are also found for the effects of divorce and loneliness. Furthermore, the beneficial effects of the weekend on experienced well-being are significantly smaller for the very poor than for most everyone else.
严重的贫困放大了生活中其他不幸的经验影响。特别是,疾病 对于非常贫穷的人来说 比那些更舒适的人要 糟糕 得多。在收入分布的前三分之二的个人中,头痛使报告悲伤 ,担心的比例从19%增加到38%。最贫穷的十分之一的人的相应数字是38%和70%--较高的基线水平和更大的增长。在离婚和孤独的 影响方面,非常贫穷的人和其他人之间也有明显的差异。此外,周末对有经验的幸福感的有利影响 ,对于非常贫穷的人来说,明显小于其他大多数人。
The satiation level beyond which experienced well-being no longer increases was a household income of about $75,000 in high-cost areas (it could be less in areas where the cost of living is lower). The average increase of experienced well-being associated with incomes beyond that level was precisely zero. This is surprising because higher income undoubtedly permits the purchase of many pleasures, including vacations in interesting places and opera tickets, as well as an improved living environment. Why do these added pleasures not show up in reports of emotional experience? A plausible interpretation is that higher income is associated with a reduced ability to enjoy the small pleasures of life. There is suggestive evidence in favor of this idea: priming students with the idea of wealth reduces the pleasure their face expresses as they eat a bar of chocolate!
在高成本地区, 家庭收入约为 75,000美元 (在生活成本较低的地区可能更低),超过这一水平,经验福利就不再增加。与超过这一水平的收入相关的经验幸福感的平均增长正好是零。这是令人惊讶的 ,因为较高的收入无疑允许购买许多乐趣,包括在有趣的地方度假和歌剧票,以及改善生活环境。为什么这些额外的快乐没有出现在情感体验的报告中?一个合理的解释是,较高的收入与享受生活小乐趣的能力下降有关。有暗示性的 证据支持这一观点:用财富的概念引诱学生,会减少他们在 吃一块巧克力 时脸上所表达的愉悦感 。
There is a clear contrast between the effects of income on experienced well-being and on life satisfaction. Higher income brings with it higher satisfaction, well beyond the point at which it ceases to have any positive effect on experience. The general conclusion is as clear for well-being as it was for colonoscopies: people’s evaluations of their lives and their actual experience may be related, but they are also different. Life satisfaction is not a flawed measure of their experienced well-being, as I thought some years ago. It is something else entirely.
收入对经验幸福感和生活满意度的影响之间存在着明显的对比。更高的收入带来了更高的满意度,远远超过了它对经验不再有任何积极影响的程度。一般 ,对于幸福感的结论和对于结肠镜检查的结论一样明确:人们对自己生活的评价和他们的实际经验可能是相关的,但它们也是不同的。生活满意度并不是像我几年前认为的那样,是对他们体验到的幸福感的一种有缺陷的衡量。它是完全不同的东西。
SPEAKING OF EXPERIENCED WELL-BEING
说到有经验的幸福感
“The objective of policy should be to reduce human suffering. We aim for a lower U-index in society. Dealing with depression and extreme poverty should be a priority.”
"政策的目标应该是减少人类 痛苦。我们的目标是降低社会的U指数。处理抑郁症和极端贫困问题应该是一个优先事项。"
“The easiest way to increase happiness is to control your use of time. Can you find more time to do the things you enjoy doing?”
"增加幸福感的最简单方法是控制你对时间的使用。你能找到更多的时间来做你喜欢做的事情吗?"
“Beyond the satiation level of income, you can buy more pleasurable experiences, but you will lose some of your ability to enjoy the less expensive ones.”
"超过收入的饱合度,你可以购买更多的愉悦体验,但你会失去一些享受不那么昂贵的 "的能力。
38
38
Thinking About Life
对生命的思考
Figure 16 is taken from an analysis by Andrew Clark, Ed Diener, and Yannis Georgellis of the German Socio-Economic Panel , in which the same respondents were asked every year about their satisfaction with their life. Respondents also reported major changes that had occurred in their circumstances during the preceding year. The graph shows the level of satisfaction reported by people around the time they got married.
图16 取自安德鲁-克拉克、埃德-迪纳和扬尼斯-乔治利斯对 德国社会经济小组 的分析,其中每年都会询问同样的受访者对其 生活的满意度。受访者还报告了前一年他们的情况发生的重大变化。该图显示了人们在结婚前后报告的满意程度。

Figure 16
图16
The graph reliably evokes nervous laughter from audiences, and the nervousness is easy to understand: after all, people who decide to get married do so either because they expect it will make them happier or because they hope that making a tie permanent will maintain the present state of bliss. In the useful term introduced by Daniel Gilbert and Timothy Wilson, the decision to get married reflects, for many people, a massive error of affective forecasting . On their wedding day, the bride and the groom know that the rate of divorce is high and that the incidence of marital disappointment is even higher, but they do not believe that these statistics apply to them.
图表可靠地唤起了观众紧张的笑声,而这种紧张很容易理解:毕竟,决定结婚的人要么是因为他们期望这将使他们更幸福,要么是因为他们希望使纽带永久化将保持目前的幸福状态。用丹尼尔-吉尔伯特和蒂莫西-威尔逊提出的有用术语来说,对许多人来说,决定结婚 ,反映了 情感预测 的一个巨大错误。在婚礼当天,新娘和新郎知道离婚率很高,婚姻失望的发生率更高,但他们不相信这些统计数据适用于他们。
The startling news of figure 16 is the steep decline of life satisfaction. The graph is commonly interpreted as tracing a process of adaptation, in which the early joys of marriage quickly disappear as the experiences become routine. However, another approach is possible, which focuses on heuristics of judgment. Here we ask what happens in people’s minds when they are asked to evaluate their life. The questions “How satisfied are you with your life as a whole?” and “How happy are you these days?” are not as simple as “What is your telephone number?” How do survey participants manage to answer such questions in a few seconds, as all do? It will help to think of this as another judgment. As is also the case for other questions, some people may have a ready-made answer, which they had produced on another occasion in which they evaluated their life. Others, probably the majority, do not quickly find a response to the exact question they were asked, and automatically make their task easier by substituting the answer to another question. System 1 is at work. When we look at figure 16 in this light, it takes on a different meaning.
图16 中令人吃惊的消息是生活满意度的急剧下降。该图通常被解释为 ,因为它追踪了一个适应过程,在这个过程中,早期的婚姻乐趣很快就消失了,因为这些经验已经成为常规。然而,另一种方法也是可能的,它侧重于判断的启发式方法。在这里,我们要问的是,当人们被要求评估他们的生活时,他们的头脑中会发生什么。"你对你的生活整体上有多满意?"和 "你最近有多开心?"这些问题并不像 "你的电话号码是多少?"那样简单 。调查参与者是如何做到在几秒钟内回答这样的问题的,就像所有人一样?将此视为另一种判断会有所帮助。就像其他问题一样,有些人可能有一个现成的答案,是他们在另一个评价自己生活的场合中产生的。其他人,可能是大多数人,不能很快找到 ,对他们被问到的确切问题的回答,并自动通过取代另一个问题的答案来使他们的任务更容易。系统1在起作用。当我们从这个角度看 图16 时,它就有了不同的意义。
The answers to many simple questions can be substituted for a global evaluation of life. You remember the study in which students who had just been asked how many dates they had in the previous month reported their “happiness these days” as if dating was the only significant fact in their life . In another well-known experiment in the same vein, Norbert Schwarz and his colleagues invited subjects to the lab to complete a questionnaire on life satisfaction . Before they began that task, however, he asked them to photocopy a sheet of paper for him. Half the respondents found a dime on the copying machine, planted there by the experimenter. The minor lucky incident caused a marked improvement in subjects’ reported satisfaction with their life as a whole! A mood heuristic is one way to answer life-satisfaction questions.
许多简单问题的答案可以代替对生活的全面评价。你还记得一项研究,在这项研究中,刚刚被问到在上个月有多少次 的学生报告了他们 "这些天的幸福",好像约会是 他们生活中唯一重要的事实 。在另一个同样著名的实验中,诺伯特-施瓦茨和他的同事邀请受试者到实验室完成一份 关于生活满意度的调查问卷 。然而,在他们开始这项任务之前,他要求他们为他复印一张纸。 ,一半的受访者在复印机上发现了一角钱,是实验者放在那里的。这个小的幸运事件使受试者报告的对他们整个生活的满意度有了明显的提高!心情启发式是回答生活满意度问题的一种方法。
The dating survey and the coin-on-the-machine experiment demonstrated, as intended, that the responses to global well-being questions should be taken with a grain of salt. But of course your current mood is not the only thing that comes to mind when you are asked to evaluate your life. You are likely to be reminded of significant events in your recent past or near future; of recurrent concerns, such as the health of a spouse or the bad company that your teenager keeps; of important achievements and painful failures. A few ideas that are relevant to the question will occur to you; many others will not. Even when it is not influenced by completely irrelevant accidents such as the coin on the machine, the score that you quickly assign to your life is determined by a small sample of highly available ideas, not by a careful weighting of the domains of your life.
约会调查和机器上的硬币实验表明,正如预期的那样,对全球幸福感问题的回答 ,应该带点盐。但当然,当你被要求评估你的生活时,你目前的心情并不是 ,而是唯一想到的事情。你很可能会想起近期或近期的重大事件;反复出现的担忧,如配偶的健康状况或你的青少年的不良伴侣;重要成就和痛苦的失败。一些与问题相关的想法 ,你会想到;其他许多想法则不会。即使没有受到完全不相关的意外的影响,如机器上的硬币,你迅速给你的生活打的分数也是由一小部分高度可用的想法决定的,而不是由对你生活领域的仔细加权决定的。
People who recently married, or are expecting to marry in the near future, are likely to retrieve that fact when asked a general question about their life. Because marriage is almost always voluntary in the United States, almost everyone who is reminded of his or her recent or forthcoming marriage will be happy with the idea. Attention is the key to the puzzle. Figure 16 can be read as a graph of the likelihood that people will think of their recent or forthcoming marriage when asked about their life. The salience of this thought is bound to diminish with the passage of time, as its novelty wanes.
最近结婚的人,或预计在 近期结婚的人,在被问到有关他们生活的一般问题时,很可能会检索到这个事实。因为在美国,婚姻几乎都是自愿的,几乎所有被提醒他或她最近或即将结婚的人都会对这一想法感到高兴。注意力是难题的关键。 图16 可以理解为人们在被问及自己的生活时,会想到自己最近或即将到来的 婚姻的可能性图。随着时间的推移,这种想法的突出性必然会减少,因为它的新颖性会减弱。
The figure shows an unusually high level of life satisfaction that lasts two or three years around the event of marriage. However, if this apparent surge reflects the time course of a heuristic for answering the question, there is little we can learn from it about either happiness or about the process of adaptation to marriage. We cannot infer from it that a tide of raised happiness lasts for several years and gradually recedes. Even people who are happy to be reminded of their marriage when asked a question about their life are not necessarily happier the rest of the time. Unless they think happy thoughts about their marriage during much of their day, it will not directly influence their happiness. Even newlyweds who are lucky enough to enjoy a state of happy preoccupation with their love will eventually return to earth, and their experienced well-being will again depend, as it does for the rest of us, on the environment and activities of the present moment.
该图显示了一个异常高的生活满意度,在结婚事件前后持续了两三年。然而,如果这种明显的激增反映了回答问题的启发式的时间过程,那么我们从它 ,关于幸福或关于适应婚姻的过程,我们几乎无法了解。我们不能从中推断出提高幸福感的浪潮会持续数年并逐渐退去。即使是那些在被问到有关他们生活的问题时被提醒他们的婚姻而感到高兴的人,在其他时间也不一定更幸福。除非他们在一天的大部分时间里对自己的婚姻产生快乐的想法,否则 ,这不会直接影响他们的幸福。即使是那些幸运地享受到对他们的爱情的幸福预想状态的新婚夫妇,最终也会回到地球,他们的经验幸福将再次取决于,就像对我们其他人一样,取决于当下的环境和活动。
In the DRM studies, there was no overall difference in experienced well-being between women who lived with a mate and women who did not. The details of how the two groups used their time explained the finding. Women who have a mate spend less time alone, but also much less time with friends. They spend more time making love, which is wonderful, but also more time doing housework, preparing food, and caring for children, all relatively unpopular activities. And of course, the large amount of time married women spend with their husband is much more pleasant for some than for others. Experienced well-being is on average unaffected by marriage, not because marriage makes no difference to happiness but because it changes some aspects of life for the better and others for the worse.
在DRM研究中, ,与配偶生活在一起的妇女和不与配偶生活在一起的妇女在经历的幸福感方面没有总体差异。两组人如何使用他们的时间的细节解释了这一发现。有配偶的妇女花在独处上的时间较少,但与朋友相处的时间也少得多。她们花更多时间做爱,这很好,但也有更多时间做家务,准备食物,照顾孩子,这些都是相对不受欢迎的活动。当然,已婚妇女花在丈夫身上的大量 ,对一些人来说,比其他人要愉快得多。经验上的幸福感平均来说不受婚姻的影响,这并不是因为 ,婚姻对幸福没有任何影响,而是因为它使生活的某些方面变得更好,而另一些方面则变得更糟。
One reason for the low correlations between individuals’ circumstances and their satisfaction with life is that both experienced happiness and life satisfaction are largely determined by the genetics of temperament. A disposition for well-being is as heritable as height or intelligence, as demonstrated by studies of twins separated at birth. People who appear equally fortunate vary greatly in how happy they are. In some instances, as in the case of marriage, the correlations with well-being are low because of balancing effects. The same situation may be good for some people and bad for others, and new circumstances have both benefits and costs. In other cases, such as high income, the effects on life satisfaction are generally positive, but the picture is complicated by the fact that some people care much more about money than others do.
个人的情况和他们对生活的满意度之间的相关性很低, ,原因之一是经历过的幸福和生活满意度在很大程度上是由气质的遗传决定的。正如对出生时分离的双胞胎的研究所证明的那样,幸福感的倾向与身高或智力一样可以遗传。看起来同样幸运的人在他们的幸福程度上差别很大。在某些情况下,如在婚姻的情况下,由于平衡效应,与幸福感的相关性 是很低的。同样的情况可能对一些人来说是好的,对另一些人来说是坏的,新的环境既有好处也有代价。在其他情况下,如高收入,对生活满意度的影响通常是积极的,但由于有些人比其他人更关心钱,情况就复杂了。
A large-scale study of the impact of higher education, which was conducted for another purpose, revealed striking evidence of the lifelong effects of the goals that young people set for themselves. The relevant data were drawn from questionnaires collected in 1995–1997 from approximately 12,000 people who had started their higher education in elite schools in 1976. When they were 17 or 18, the participants had filled out a questionnaire in which they rated the goal of “being very well-off financially” on a 4-point scale ranging from “not important” to “essential.” The questionnaire they completed twenty years later included measures of their income in 1995, as well as a global measure of life satisfaction.
一项关于高等 教育影响的大规模研究,是为了另一个目的而进行的,揭示了 年轻人 为自己 设定的目标 所产生的终生影响的惊人的证据。相关数据来自于1995-1997年从大约12000名1976年在精英学校开始接受高等教育的人那里收集的问卷。当他们17或18岁时,参与者填写了一份问卷 ,其中他们对 "经济上非常富裕 " 这一目标进行了4分制的评分,从 "不重要 "到 "必须"。二十年后,他们填写的问卷包括对他们在1995年的收入的测量,以及对生活满意度的全面测量。
Goals make a large difference. Nineteen years after they stated their financial aspirations, many of the people who wanted a high income had achieved it. Among the 597 physicians and other medical professionals in the sample, for example, each additional point on the money-importance scale was associated with an increment of over $14,000 of job income in 1995 dollars! Nonworking married women were also likely to have satisfied their financial ambitions. Each point on the scale translated into more than $12,000 of added household income for these women, evidently through the earnings of their spouse.
目标会产生很大的不同。在他们提出财务愿望的19年后,许多希望 高收入的人已经实现了。例如,在样本中的597名医生和其他医疗专业人员中,金钱重要性量表上的每一个附加点都与超过14,000美元的工作收入(以1995年的美元计算)相关联不工作的已婚妇女也有可能满足她们的经济野心。对这些妇女来说,量表上的每一分都转化为超过12,000美元的额外家庭收入 ,显然是通过其配偶的收入。
The importance that people attached to income at age 18 also anticipated their satisfaction with their income as adults. We compared life satisfaction in a high-income group (more than $200,000 household income) to a low-to moderate-income group (less than $50,000). The effect of income on life satisfaction was larger for those who had listed being well-off financially as an essential goal: .57 point on a 5-point scale. The corresponding difference for those who had indicated that money was not important was only .12. The people who wanted money and got it were significantly more satisfied than average; those who wanted money and didn’t get it were significantly more dissatisfied. The same principle applies to other goals—one recipe for a dissatisfied adulthood is setting goals that are especially difficult to attain. Measured by life satisfaction 20 years later, the least promising goal that a young person could have was “becoming accomplished in a performing art.” Teenagers’ goals influence what happens to them, where they end up, and how satisfied they are.
人们在18岁时对收入的重视程度也预示着他们成年后对收入的满意度。我们比较了高收入群体(家庭收入超过20万美元)和中低收入群体(低于5万美元)的生活满意度。 收入对生活满意度的影响,对于那些把经济上的小康作为基本目标的人来说, :在5分制中占0.57分。而对于那些表示 金钱不重要的 人来说,相应的差异只有0.12。那些想要钱并得到钱的人的满意度明显高于平均水平;那些想要钱但没有得到钱的人明显更不满意。同样的原则 ,适用于其他目标--不满意的成年生活的秘诀是设定特别难以实现的目标。以20年后的生活满意度来衡量,一个年轻人最没有希望的目标是 "在表演艺术方面取得成就"。青少年的目标影响着他们的遭遇、他们的结局以及他们的满意程度。
In part because of these findings I have changed my mind about the definition of well-being. The goals that people set for themselves are so important to what they do and how they feel about it that an exclusive focus on experienced well-being is not tenable. We cannot hold a concept of well-being that ignores what people want. On the other hand, it is also true that a concept of well-being that ignores how people feel as they live and focuses only on how they feel when they think about their life is also untenable. We must accept the complexities of a hybrid view, in which the well-being of both selves is considered.
部分原因是由于这些发现 ,我改变了对幸福的定义的看法。人们为自己设定的目标对于他们所做的事情以及他们的感受是如此的重要,以至于只关注经验上的幸福感是站不住脚的。我们不能持有一个忽视人们需求的幸福感的概念。另一方面,忽视人们在 生活时的感受而只关注他们在思考自己的生活时的感受的幸福感概念也是不成立的。我们必须接受混合观点的复杂性,其中考虑到两个自我的福祉。
THE FOCUSING ILLUSION
聚焦幻觉
We can infer from the speed with which people respond to questions about their life, and from the effects of current mood on their responses, that they do not engage in a careful examination when they evaluate their life. They must be using heuristics, which are examples of both substitution and WYSIATI. Although their view of their life was influenced by a question about dating or by a coin on the copying machine, the participants in these studies did not forget that there is more to life than dating or feeling lucky. The concept of happiness is not suddenly changed by finding a dime, but System 1 readily substitutes a small part of it for the whole of it. Any aspect of life to which attention is directed will loom large in a global evaluation. This is the essence of the focusing illusion , which can be described in a single sentence:
我们可以从人们回答有关他们生活的问题的速度,以及从当前情绪对他们回答的影响推断出,他们在评价自己的生活时并没有进行仔细的 。他们一定是在使用启发式方法,这既是替代的例子,也是所见即所得。尽管他们对自己生活的看法受到了关于约会的问题或复印机上的硬币的影响,但这些研究中的参与者并没有忘记,生活中还有比约会或感觉幸运更多的东西。幸福的概念并不会因为找到 一枚硬币而突然改变,但系统1很容易将其中的一小部分取代全部。生活中被关注的任何方面都会在整体评价中占据重要位置。这就是 聚焦幻觉 的本质,可以用一句话来描述。
Nothing in life is as important as you think it is when you are thinking about it.
生活中没有什么是像你在思考时认为的那样重要。
The origin of this idea was a family debate about moving from California to Princeton, in which my wife claimed that people are happier in California than on the East Coast. I argued that climate is demonstrably not an important determinant of well-being—the Scandinavian countries are probably the happiest in the world. I observed that permanent life circumstances have little effect on well-being and tried in vain to convince my wife that her intuitions about the happiness of Californians were an error of affective forecasting.
这个想法的起源是一场关于 ,从加利福尼亚搬到普林斯顿的家庭辩论,其中我的妻子声称,加利福尼亚的人们 ,比东海岸的人们更幸福。我争辩说,气候显然不是决定幸福的重要因素--斯堪的纳维亚国家可能是世界上最幸福的国家。我观察到,永久的生活环境对幸福感没有什么影响,并试图说服我的妻子,她对 加州人幸福感 的 直觉是情感预测的错误,但这是徒劳的。
A short time later, with this debate still on my mind, I participated in a workshop about the social science of global warming. A colleague made an argument that was based on his view of the well-being of the population of planet Earth in the next century. I argued that it was preposterous to forecast what it would be like to live on a warmer planet when we did not even know what it is like to live in California. Soon after that exchange, my colleague David Schkade and I were granted research funds to study two questions: Are people who live in California happier than others? and What are the popular beliefs about the relative happiness of Californians?
不久之后,我带着这场辩论的情绪,参加了一个关于全球变暖的社会科学的研讨会。一位同事提出了一个论点,这个论点是基于他对下个世纪地球上人口的福祉的看法。我认为,当我们甚至不知道生活在加利福尼亚是什么样子的时候,就预测 ,生活在一个更温暖的星球上会是什么样子,这是荒谬的。在那次交流之后不久,我和我的同事大卫-施卡德获得了研究基金,以研究两个问题。生活在加利福尼亚的人比其他人更幸福吗? 以及关于加利福尼亚人的相对幸福的流行信念是什么?
We recruited large samples of students at major state universities in California, Ohio, and Michigan. From some of them we obtained a detailed report of their satisfaction with various aspects of their lives . From others we obtained a prediction of how someone “with your interests and values” who lived elsewhere would complete the same questionnaire.
我们在加州、俄亥俄州和密歇根州的 主要州立大学招募了大量学生样本。我们从他们中的一些人那里得到了他们对 自己生活 各 方面 满意度的详细报告。从其他学生那里,我们获得了对居住在其他地方的 "具有你的兴趣和价值观 "的人将如何完成同样的调查问卷的预测。
As we analyzed the data, it became obvious that I had won the family argument . As expected, the students in the two regions differed greatly in their attitude to their climate: the Californians enjoyed their climate and the Midwesterners despised theirs. But climate was not an important determinant of well-being. Indeed, there was no difference whatsoever between the life satisfaction of students in California and in the Midwest . We also found that my wife was not alone in her belief that Californians enjoy greater well-being than others. The students in both regions shared the same mistaken view, and we were able to trace their error to an exaggerated belief in the importance of climate. We described the error as a focusing illusion .
当我们分析数据时,很明显, 我赢得了家庭争论 。正如预期的那样, ,两个地区的学生在对他们的气候的态度上有很大的不同:加利福尼亚人喜欢他们的气候,而中西部人则鄙视他们的。但气候并不是决定幸福感的重要因素。事实上, 加利福尼亚和中西部的学生 的生活满意度没有任何区别。我们还发现,我妻子并不是唯一相信 ,加州人比其他人享有更多的幸福感。两个地区的学生都有同样的错误观点,我们能够追溯到他们的错误是对气候的重要性的夸大的信念。我们把这个错误描述为 聚焦错觉 。
The essence of the focusing illusion is WYSIATI, giving too much weight to the climate, too little to all the other determinants of well-being. To appreciate how strong this illusion is, take a few seconds to consider the question:
焦点错觉的本质是所见即所得,对气候给予过多的重视,而对福祉的所有其他决定因素则重视不足。 为了了解这种错觉有多强烈,请花几秒钟考虑这个问题。
How much pleasure do you get from your car?
你从你的汽车中得到多少乐趣?
An answer came to your mind immediately; you know how much you like and enjoy your car. Now examine a different question: “ When do you get pleasure from your car?” The answer to this question may surprise you, but it is straightforward: you get pleasure (or displeasure) from your car when you think about your car, which is probably not very often. Under normal circumstances, you do not spend much time thinking about your car when you are driving it. You think of other things as you drive, and your mood is determined by whatever you think about. Here again, when you tried to rate how much you enjoyed your car, you actually answered a much narrower question: “How much pleasure do you get from your car when you think about it ?” The substitution caused you to ignore the fact that you rarely think about your car, a form of duration neglect. The upshot is a focusing illusion. If you like your car, you are likely to exaggerate the pleasure you derive from it, which will mislead you when you think of the virtues of your current vehicle as well as when you contemplate buying a new one.
一个答案立即出现在你的脑海中;你知道你有多喜欢和享受你的车。现在考察一个不同的问题。"你 什么时候 从你的汽车中获得快乐?"这个问题的答案可能会让你吃惊,但它是直截了当的:当 你想到你的车时,你会从你的车中得到快乐(或不快乐) ,这可能不是很频繁。在正常情况下,你在开车时不会花很多时间去想你的车。你在开车时想的是其他事情,而你的心情是由你所想的任何事情决定的。在这里,当你试图评价你有多喜欢你的车时,你实际上回答了一个更狭窄的问题。" 当你想到 你的 车时, 你从它身上得到多少快乐 ?"这种替换使你忽略了你很少想到你的车的事实,这是一种持续时间的忽视。其结果是一种聚焦的错觉。如果你喜欢你的车,你很可能会夸大你从它那里得到的快乐,这将误导你,当你想到你现在的车的优点时,以及当你考虑 买一辆新车时。
A similar bias distorts judgments of the happiness of Californians. When asked about the happiness of Californians, you probably conjure an image of someone attending to a distinctive aspect of the California experience, such as hiking in the summer or admiring the mild winter weather. The focusing illusion arises because Californians actually spend little time attending to these aspects of their life. Moreover, long-term Californians are unlikely to be reminded of the climate when asked for a global evaluation of their life. If you have been there all your life and do not travel much, living in California is like having ten toes: nice, but not something one thinks much about. Thoughts of any aspect of life are more likely to be salient if a contrasting alternative is highly available.
一种类似的偏见扭曲了对加州人幸福的判断。当被问及加州人的幸福感时,你可能会联想到有人在关注加州经验的某个独特方面,例如在夏天徒步旅行或欣赏冬季的温和天气。这种聚焦的错觉之所以出现,是因为加州人实际上很少花时间去关注他们生活的这些方面, 。此外,当被要求对他们的生活进行全面评估时,长期的加利福尼亚人不太可能被提醒气候的问题。如果你一辈子都在那里,而且不经常旅行,生活在加州就像有十个脚趾头一样:很好,但不是人们所想的东西。如果有一个对比强烈的替代方案 ,那么对生活的任何方面的想法都更有可能被突出出来。
People who recently moved to California will respond differently. Consider an enterprising soul who moved from Ohio to seek happiness in a better climate. For a few years following the move, a question about his satisfaction with life will probably remind him of the move and also evoke thoughts of the contrasting climates in the two states. The comparison will surely favor California, and the attention to that aspect of life may distort its true weight in experience. However, the focusing illusion can also bring comfort. Whether or not the individual is actually happier after the move, he will report himself happier, because thoughts of the climate will make him believe that he is. The focusing illusion can cause people to be wrong about their present state of well-being as well as about the happiness of others, and about their own happiness in the future.
最近搬到加州的人将有不同的反应。考虑到一个从俄亥俄州搬来的有进取心的人,在一个更好的气候中寻求幸福。在搬家后的几年里,关于他对生活的满意度的问题可能会让他想起搬家的事,也会唤起他对两个州的气候对比的思考。这种比较肯定会偏向于加利福尼亚, ,对生活的这一方面的关注可能会扭曲它在经验中的真实分量。然而,聚焦的幻觉也能带来安慰。无论个人在搬家后是否真的更快乐,他都会报告自己更快乐,因为对气候的想法会使他相信自己更快乐。聚焦幻觉会使人们对自己目前的幸福状态 ,也会对他人的幸福和自己未来的幸福产生错误。
What proportion of the day do paraplegics spend in a bad mood?
截瘫患者 一天中有多大比例的时间是在 坏情绪中度过的?
This question almost certainly made you think of a paraplegic who is currently thinking about some aspect of his condition. Your guess about a paraplegic’s mood is therefore likely to be accurate in the early days after a crippling accident; for some time after the event, accident victims think of little else. But over time, with few exceptions, attention is withdrawn from a new situation as it becomes more familiar. The main exceptions are chronic pain, constant exposure to loud noise, and severe depression. Pain and noise are biologically set to be signals that attract attention, and depression involves a self-reinforcing cycle of miserable thoughts. There is therefore no adaptation to these conditions. Paraplegia, however, is not one of the exceptions: detailed observations show that paraplegics are in a fairly good mood more than half of the time as early as one month following their accident—though their mood is certainly somber when they think about their situation . Most of the time, however, paraplegics work, read, enjoy jokes and friends, and get angry when they read about politics in the newspaper. When they are involved in any of these activities, they are not much different from anyone else, and we can expect the experienced well-being of paraplegics to be near normal much of the time. Adaptation to a new situation, whether good or bad, consists in large part of thinking less and less about it. In that sense, most long-term circumstances of life, including paraplegia and marriage, are part-time states that one inhabits only when one attends to them.
这个问题几乎肯定让你想到了一个截瘫者,他目前正在思考自己病情的某些方面。因此,你对截瘫者的 ,在残废事故发生后的早期,你的猜测很可能是准确的; ,在事件发生后的一段时间内,事故受害者几乎不考虑其他问题。但随着时间的推移,除了少数例外,注意力会从新的情况中抽离出来,因为它变得更加熟悉。主要的例外是慢性疼痛、持续暴露于巨大的噪音和严重的抑郁症。疼痛和噪音在生物学上被设定为吸引注意力的信号,而抑郁症涉及一个自我强化的悲惨循环 。因此,对这些情况没有任何适应。然而,截瘫并不是一个例外:详细的观察显示,截瘫者在事故发生后的一个月内,有一半以上的时间心情是相当好的--尽管当他们 想到自己的处境 时,心情肯定是阴郁的。然而,大多数时候,截瘫者会工作、阅读、享受笑话 和朋友,当他们在报纸上读到政治时就会生气。当他们参与任何这些活动时,他们与其他人没有什么不同,我们可以预期截瘫者的经验福祉在大部分时间内都接近正常。适应一个新的情况,无论是好的还是坏的,在很大程度上包括越来越少地思考它。从这个意义上说,大多数 长期的生活环境,包括截瘫和婚姻,都是兼职状态,只有当一个人关注它们的时候才会居住在其中。
One of the privileges of teaching at Princeton is the opportunity to guide bright undergraduates through a research thesis. And one of my favorite experiences in this vein was a project in which Beruria Cohn collected and analyzed data from a survey firm that asked respondents to estimate the proportion of time that paraplegics spend in a bad mood. She split her respondents into two groups: some were told that the crippling accident had occurred a month earlier, some a year earlier. In addition, each respondent indicated whether he or she knew a paraplegic personally. The two groups agreed closely in their judgment about the recent paraplegics: those who knew a paraplegic estimated 75% bad mood; those who had to imagine a paraplegic said 70%. In contrast, the two groups differed sharply in their estimates of the mood of paraplegics a year after the accidents: those who knew a paraplegic offered 41% as their estimate of the time in that bad mood. The estimates of those who were not personally acquainted with a paraplegic averaged 68%. Evidently, those who knew a paraplegic had observed the gradual withdrawal of attention from the condition, but others did not forecast that this adaptation would occur. Judgments about the mood of lottery winners one month and one year after the event showed exactly the same pattern.
在普林斯顿大学任教的特权之一是有机会指导聪明的本科生完成研究论文。而我在这方面最喜欢的经历之一是Beruria Cohn收集和分析了一家调查公司的数据, ,要求受访者估计截瘫者花在坏心情上的时间比例。她把受访者分成两组:有些人被告知截瘫事故发生在一个月前,有些人则是一年前。此外,每个受访者都表示他或她本人是否认识一个截瘫者。两组人在对最近的截瘫者的判断上非常一致:那些 认识截瘫者的人估计有75%的坏情绪;那些不得不想象截瘫者的人说有70%。相比之下,两组人对事故发生一年后截瘫者情绪的估计有很大的不同:那些认识截瘫者的人提供了41%作为他们对这种坏情绪的估计时间。而那些与截瘫者不相识的人的估计平均为68%。显而易见, ,那些认识截瘫者的人已经观察到他们的注意力逐渐从病情中撤出,但其他人并没有预测到这种适应会发生。在事件发生一个月和一年后,对彩票中奖者情绪的判断显示出完全相同的模式。
We can expect the life satisfaction of paraplegics and those afflicted by other chronic and burdensome conditions to be low relative to their experienced well-being, because the request to evaluate their lives will inevitably remind them of the life of others and of the life they used to lead. Consistent with this idea, recent studies of colostomy patients have produced dramatic inconsistencies between the patients’ experienced well-being and their evaluations of their lives. Experience sampling shows no difference in experienced happiness between these patients and a healthy population. Yet colostomy patients would be willing to trade away years of their life for a shorter life without the colostomy. Furthermore, patients whose colostomy has been reversed remember their time in this condition as awful, and they would give up even more of their remaining life not to have to return to it. Here it appears that the remembering self is subject to a massive focusing illusion about the life that the experiencing self endures quite comfortably.
我们可以预期截瘫者和那些被 其他慢性和负担性疾病折磨的人的生活满意度相对于他们的经验幸福感来说是 ,因为评价他们生活的要求将不可避免地使他们想起别人的生活和他们曾经的生活。与这个想法一致,最近 对结肠造口术患者的研究 在患者的经验幸福感和他们对生活的评价之间产生了戏剧性的不一致。经验抽样显示,这些病人和健康人群之间在经验幸福感方面没有区别 。然而,结肠造口术患者愿意用他们多年的生命来换取没有结肠造口术的更短的生命。此外,结肠造口术被逆转的病人记得他们在这种情况下的日子是可怕的,他们甚至愿意放弃更多的剩余生命而不必再回到这种情况。在这里,似乎 ,记忆中的自我受到了巨大的聚焦幻觉的影响,而体验中的自我则很舒服地忍受着这种生活。
Daniel Gilbert and Timothy Wilson introduced the word miswanting to describe bad choices that arise from errors of affective forecasting. This word deserves to be in everyday language. The focusing illusion (which Gilbert and Wilson call focalism) is a rich source of miswanting. In particular, it makes us prone to exaggerate the effect of significant purchases or changed circumstances on our future well-being.
丹尼尔-吉尔伯特(Daniel Gilbert)和蒂莫西-威尔逊(Timothy Wilson)引入了 miswanting 这个词来 描述由情感预测错误引起的错误选择。这个词理应出现在日常用语中。焦点错觉(吉尔伯特和威尔逊称之为焦点主义)是错误愿望的一个丰富的 来源。特别是,它使我们容易夸大重大购买或改变环境对我们未来福祉的影响。
Compare two commitments that will change some aspects of your life: buying a comfortable new car and joining a group that meets weekly, perhaps a poker or book club. Both experiences will be novel and exciting at the start. The crucial difference is that you will eventually pay little attention to the car as you drive it, but you will always attend to the social interaction to which you committed yourself. By WYSIATI, you are likely to exaggerate the long-term benefits of the car, but you are not likely to make the same mistake for a social gathering or for inherently attention-demanding activities such as playing tennis or learning to play the cello. The focusing illusion creates a bias in favor of goods and experiences that are initially exciting, even if they will eventually lose their appeal. Time is neglected, causing experiences that will retain their attention value in the long term to be appreciated less than they deserve to be.
比较两个将改变你生活的某些方面的承诺:购买一辆舒适的新车和加入一个每周聚会的团体,也许是扑克或读书俱乐部。这两种经历在开始时都会很新奇和兴奋。关键的区别是 ,你最终会在驾驶汽车时很少注意汽车,但你会一直关注你承诺的社会互动。通过所见即所得,你很可能会夸大汽车的长期利益,但对于社交聚会或打网球或学习 拉大提琴等本质上需要注意力的活动,你不可能犯同样的错误。集中注意力的错觉创造了一种偏向于最初令人兴奋的商品和体验,即使它们最终会失去吸引力。时间被忽视了,导致那些能够长期保持其注意力价值的经验被赞赏的程度低于它们应得的程度。
TIME AND TIME AGAIN
一次又一次
The role of time has been a refrain in this part of the book. It is logical to describe the life of the experiencing self as a series of moments, each with a value. The value of an episode—I have called it a hedonimeter total—is simply the sum of the values of its moments. But this is not how the mind represents episodes. The remembering self, as I have described it, also tells stories and makes choices, and neither the stories nor the choices properly represent time. In storytelling mode, an episode is represented by a few critical moments, especially the beginning, the peak, and the end. Duration is neglected. We saw this focus on singular moments both in the cold-hand situation and in Violetta’s story.
时间的作用在本书的这一部分中一直是一个反问句。 ,将体验自我的生活描述为一系列的时刻,每个时刻都有一个价值,这是合乎逻辑的。一个事件的价值--我称它为享乐主义总和--仅仅是其各个时刻的价值之和。但这并不是心智 ,而是代表了事件。记忆中的自我,正如我所描述的那样,也会讲故事和做选择,而故事和选择都不能恰当地代表 时间。在讲故事的模式中,一个情节由几个关键时刻来代表,特别是开始、高峰和结束。持续时间被忽略了。我们在冷手的情况和维奥莱塔的故事中都看到了这种对单一时刻的关注。
We saw a different form of duration neglect in prospect theory, in which a state is represented by the transition to it. Winning a lottery yields a new state of wealth that will endure for some time, but decision utility corresponds to the anticipated intensity of the reaction to the news that one has won. The withdrawal of attention and other adaptations to the new state are neglected, as only that thin slice of time is considered. The same focus on the transition to the new state and the same neglect of time and adaptation are found in forecasts of the reaction to chronic diseases, and of course in the focusing illusion. The mistake that people make in the focusing illusion involves attention to selected moments and neglect of what happens at other times. The mind is good with stories, but it does not appear to be well designed for the processing of time.
我们在前景理论中看到了一种不同形式的持续时间忽视,在这种理论中,一种状态是由向它的过渡所代表的。赢得彩票会产生一种 ,这种新的财富状态会持续一段时间,但决策效用与对中奖消息的预期反应强度相对应。注意力的撤回和其他对新状态的适应被忽略了,因为只考虑了那一小段时间。在对慢性病反应的预测 ,当然还有聚焦幻觉中,也发现了对过渡到新状态的同样关注和对时间和适应的同样忽视。人们在聚焦幻觉中所犯的错误涉及到对选定时刻的关注和对其他时间发生的事情的忽视。心灵善于处理故事,但它似乎并不是为处理时间而精心设计的。
During the last ten years we have learned many new facts about happiness. But we have also learned that the word happiness does not have a simple meaning and should not be used as if it does. Sometimes scientific progress leaves us more puzzled than we were before.
在过去十年中,我们学到了许多关于幸福的新事实。但我们 ,也了解到 幸福 这个词并没有简单的含义,也不应该把它当作是简单的含义来使用。有时,科学进步让我们比以前更困惑。
SPEAKING OF THINKING ABOUT LIFE
谈到对生活的思考
“She thought that buying a fancy car would make her happier, but it turned out to be an error of affective forecasting.”
"她以为买一辆高级车会让她更快乐,但结果却是情感预测的错误。"
“His car broke down on the way to work this morning and he’s in a foul mood. This is not a good day to ask him about his job satisfaction!”
"他的车今天早上在上班的路上坏了, ,他的心情很糟糕。这不是一个问他工作满意度的好日子!"
“She looks quite cheerful most of the time, but when she is asked she says she is very unhappy. The question must make her think of her recent divorce.”
"她在大多数时候看起来都很开朗,但当她被问及时,她说她非常不开心。这个问题一定让她想到了她最近的离婚。"
“Buying a larger house may not make us happier in the long term. We could be suffering from a focusing illusion.”
"从长远来看,买一个更大的房子可能不会让我们更快乐。我们可能是在遭受一种聚焦的幻觉。"
“He has chosen to split his time between two cities. Probably a serious case of miswanting.”
"他选择在两个 城市之间分配时间。可能是一个严重的欲望错乱情况。"
Conclusions
结论
I began this book by introducing two fictitious characters, spent some time discussing two species, and ended with two selves. The two characters were the intuitive System 1, which does the fast thinking, and the effortful and slower System 2, which does the slow thinking, monitors System 1, and maintains control as best it can within its limited resources. The two species were the fictitious Econs, who live in the land of theory, and the Humans, who act in the real world. The two selves are the experiencing self, which does the living, and the remembering self, which keeps score and makes the choices. In this final chapter I consider some applications of the three distinctions, taking them in reverse order.
我 ,在这本书的开头介绍了两个虚构的人物,花了一些时间讨论了两个物种,最后是两个自我。这两个角色是直觉型的系统1和努力型的系统2,前者负责快速思考,后者负责缓慢思考,监控系统1,并在其有限的资源范围内尽可能地保持控制。这两个物种是 虚构的Econs,他们生活在理论的土地上,而人类则是在现实世界中行动。这两个自我是体验的自我和记忆的自我,前者负责生活,后者负责记账并做出选择。在这最后一章中,我考虑了这三个区别的一些应用,按相反的顺序进行。
TWO SELVES
两个自己
The possibility of conflicts between the remembering self and the interests of the experiencing self turned out to be a harder problem than I initially thought. In an early experiment, the cold-hand study, the combination of duration neglect and the peak-end rule led to choices that were manifestly absurd. Why would people willingly expose themselves to unnecessary pain? Our subjects left the choice to their remembering self, preferring to repeat the trial that left the better memory, although it involved more pain. Choosing by the quality of the memory may be justified in extreme cases, for example when post-traumatic stress is a possibility, but the cold-hand experience was not traumatic. An objective observer making the choice for someone else would undoubtedly choose the short exposure, favoring the sufferer’s experiencing self. The choices that people made on their own behalf are fairly described as mistakes. Duration neglect and the peak-end rule in the evaluation of stories, both at the opera and in judgments of Jen’s life, are equally indefensible. It does not make sense to evaluate an entire life by its last moments, or to give no weight to duration in deciding which life is more desirable.
记忆中的 自我和体验中的自我的利益之间存在冲突的可能性,原来是一个比我最初想象的更难的问题。在一个早期的实验中,即冷手研究中,持续时间的忽略和峰值结束规则的结合导致了明显荒谬的选择。为什么人们会愿意让自己承受不必要的痛苦?我们的受试者把选择权留给了记忆中的自己,他们更愿意重复 ,留下更好的记忆的试验,尽管它涉及更多的痛苦。在极端情况下,根据记忆的质量进行选择可能是合理的,例如,当创伤后压力是一种可能性时,但冷手 ,这种经历并不是创伤性的。一个客观的观察者为别人做选择时,无疑会选择短期暴露,偏向于受难者的体验自我。人们为自己做出的 选择被公平地描述为错误。在对故事的评价中,无论是在歌剧院还是在对甄嬛传的判断中,持续时间的忽视和峰回路转的规则,同样是无法辩解的。以最后时刻来评价整个生命,或者在决定哪种生命更可取时不重视持续时间,都是没有意义的。
The remembering self is a construction of System 2. However, the distinctive features of the way it evaluates episodes and lives are characteristics of our memory. Duration neglect and the peak-end rule originate in System 1 and do not necessarily correspond to the values of System 2. We believe that duration is important, but our memory tells us it is not. The rules that govern the evaluation of the past are poor guides for decision making, because time does matter. The central fact of our existence is that time is the ultimate finite resource, but the remembering self ignores that reality. The neglect of duration combined with the peak-end rule causes a bias that favors a short period of intense joy over a long period of moderate happiness. The mirror image of the same bias makes us fear a short period of intense but tolerable suffering more than we fear a much longer period of moderate pain. Duration neglect also makes us prone to accept a long period of mild unpleasantness because the end will be better, and it favors giving up an opportunity for a long happy period if it is likely to have a poor ending. To drive the same idea to the point of discomfort, consider the common admonition, “Don’t do it, you will regret it.” The advice sounds wise because anticipated regret is the verdict of the remembering self and we are inclined to accept such judgments as final and conclusive. We should not forget, however, that the perspective of the remembering self is not always correct. An objective observer of the hedonimeter profile, with the interests of the experiencing self in mind, might well offer different advice. The remembering self’s neglect of duration, its exaggerated emphasis on peaks and ends, and its susceptibility to hindsight combine to yield distorted reflections of our actual experience.
记忆中的自我是系统2的一个 建设。然而,它评价事件和生活的方式的独特特征是我们记忆的特点。忽视持续时间和高峰期规则源于系统1,不一定与系统2的价值相对应。我们认为持续时间很重要,但我们的记忆告诉我们它并不重要。管理对过去的评价的规则是决策的糟糕指南 ,因为时间确实很重要。我们存在的核心事实是,时间是最终的有限资源,但记忆中的自己却忽略了这个现实。对持续时间的忽视加上峰回路转的规则导致了一种偏向,即倾向于短时间的强烈快乐而不是长时间的适度快乐。同一偏见的镜像使我们害怕短时间的强烈 ,但可以忍受的痛苦,而不是害怕长时间的中等程度的痛苦。对持续时间的忽视也使我们容易接受长时间的轻微不愉快,因为结局会更好,而且如果可能会有一个糟糕的结局,它更倾向于放弃一个长时间的快乐的机会。为了将同样的想法推向不愉快的地步,考虑一下常见的告诫:"不要做,你 ,会后悔的。"这个建议听起来很明智,因为预期的后悔是记忆中的自我的裁决,我们倾向于接受这样的判断,认为是最终的和结论性的。然而,我们不应忘记,记忆中的自己的观点并不总是正确的。一个客观观察享乐主义曲线的人,如果考虑到体验自我的利益,很可能会提供不同的建议 。记忆中的自我对持续时间的忽视,对高峰和终点的夸张强调,以及对事后诸葛亮的易感性,都会对我们的实际经验产生扭曲的反映。
In contrast, the duration-weighted conception of well-being treats all moments of life alike, memorable or not. Some moments end up weighted more than others, either because they are memorable or because they are important. The time that people spend dwelling on a memorable moment should be included in its duration, adding to its weight. A moment can also gain importance by altering the experience of subsequent moments. For example, an hour spent practicing the violin may enhance the experience of many hours of playing or listening to music years later. Similarly, a brief awful event that causes PTSD should be weighted by the total duration of the long-term misery it causes. In the duration-weighted perspective, we can determine only after the fact that a moment is memorable or meaningful. The statements “I will always remember …” or “this is a meaningful moment” should be taken as promises or predictions, which can be false—and often are—even when uttered with complete sincerity. It is a good bet that many of the things we say we will always remember will be long forgotten ten years later.
与此相反,持续时间加权的幸福感概念对生活中的所有时刻都一视同仁,无论是否令人难忘。有些时刻的权重最终会高于其他时刻,要么是因为它们令人难忘,要么是因为 它们很重要。人们花在一个难忘的时刻上的时间应该包括在它的持续时间内,增加它的权重。一个时刻也可以 ,通过改变后续时刻的经验来获得重要性。例如,一个小时的小提琴练习可能会增强多年后许多小时的演奏或听音乐的体验。同样地,一个导致 PTSD的短暂的可怕事件应该被它所造成的长期痛苦的总持续时间所加权。在持续时间加权的观点中,我们只有在事后才能确定某个时刻是令人难忘或有意义的。"我会永远记得...... "或 "这是一个有意义的时刻 "的说法应被视为承诺或预测,这些承诺或预测可能是错误的,而且往往是错误的,即使是在完全真诚的情况下说出来。这是 ,我们说我们将永远记得的许多事情在十年后将被遗忘。
The logic of duration weighting is compelling, but it cannot be considered a complete theory of well-being because individuals identify with their remembering self and care about their story. A theory of well-being that ignores what people want cannot be sustained. On the other hand, a theory that ignores what actually happens in people’s lives and focuses exclusively on what they think about their life is not tenable either. The remembering self and the experiencing self must both be considered, because their interests do not always coincide. Philosophers could struggle with these questions for a long time.
持续时间加权的逻辑是令人信服的,但它不能被认为是一个完整的幸福理论,因为个人认同他们记忆中的自我,关心他们的故事。一个忽视了人们想要什么的幸福理论是不能成立的。另一方面,一个忽视人们生活中实际发生的事情而只关注他们对自己生活的看法的理论 ,也是站不住脚的。记忆中的自我和体验中的自我都必须被考虑,因为他们的利益并不总是一致的。哲学家们可以在这些问题上纠结很久。
The issue of which of the two selves matters more is not a question only for philosophers; it has implications for policies in several domains, notably medicine and welfare. Consider the investment that should be made in the treatment of various medical conditions, including blindness, deafness, or kidney failure. Should the investments be determined by how much people fear these conditions? Should investments be guided by the suffering that patients actually experience? Or should they follow the intensity of the patients’ desire to be relieved from their condition and by the sacrifices that they would be willing to make to achieve that relief? The ranking of blindness and deafness, or of colostomy and dialysis, might well be different depending on which measure of the severity of suffering is used. No easy solution is in sight, but the issue is too important to be ignored .
两个自我中的哪一个更重要,这不是一个仅适用于哲学家的问题 ;它对几个领域的政策都有影响,特别是医学和福利。考虑一下在治疗各种医疗状况方面应该进行的投资,包括失明、失聪或肾衰竭。投资是否应该由人们对这些疾病的恐惧程度来决定?投资是否应该以病人实际经历的痛苦为指导? 还是应该遵循病人对缓解病情的强烈愿望,以及他们为实现这种缓解而愿意做出的牺牲?失明和失聪,或者结肠造口术和透析的排名可能会有所不同,这取决于使用哪种衡量痛苦严重程度的标准。目前看不到任何简单的解决方案,但这个问题 太重要了,不能忽视 。
The possibility of using measures of well-being as indicators to guide government policies has attracted considerable recent interest, both among academics and in several governments in Europe. It is now conceivable, as it was not even a few years ago, that an index of the amount of suffering in society will someday be included in national statistics, along with measures of unemployment, physical disability, and income. This project has come a long way.
最近,在学术界和欧洲的一些政府中,使用幸福指数作为指标来 指导政府政策 的可能性已经引起了相当大的兴趣。现在可以想象,就像几年前一样,社会中的痛苦数量的指数有一天会和失业、身体 残疾和收入的措施一起被纳入国家统计。这个项目已经取得了很大的进展。
ECONS AND HUMANS
生态环境 和人类
In everyday speech, we call people reasonable if it is possible to reason with them, if their beliefs are generally in tune with reality, and if their preferences are in line with their interests and their values. The word rational conveys an image of greater deliberation, more calculation, and less warmth, but in common language a rational person is certainly reasonable. For economists and decision theorists, the adjective has an altogether different meaning. The only test of rationality is not whether a person’s beliefs and preferences are reasonable, but whether they are internally consistent. A rational person can believe in ghosts so long as all her other beliefs are consistent with the existence of ghosts. A rational person can prefer being hated over being loved, so long as his preferences are consistent. Rationality is logical coherence—reasonable or not. Econs are rational by this definition, but there is overwhelming evidence that Humans cannot be. An Econ would not be susceptible to priming, WYSIATI, narrow framing, the inside view, or preference reversals, which Humans cannot consistently avoid.
在日常讲话中,如果有可能与人讲道理,如果他们的信仰总体上与现实相符,如果他们的偏好与他们的利益和价值观相符,我们就称他们为合理的人。 理性 这个词给人的印象是更多的深思熟虑,更多的计算,更少的温情,但在常见的 ,一个理性的人肯定是合理的。对于经济学家和决策理论家来说,这个形容词有着完全不同的含义。检验理性的唯一标准不是一个人的信念和偏好是否合理,而是它们是否内部一致。一个理性的人可以相信有鬼,只要她所有的其他信仰都与鬼的存在相一致。 一个理性的人可以选择被人讨厌而不是被人爱,只要他的偏好是一致的。理性是逻辑上的一致性--无论是否合理。根据这个定义,经济学人是理性的,但有大量的证据表明人类不可能是理性的。生态体不会受到引诱、所见即所得、狭隘的框架、内部观点或偏好逆转的影响,而人类无法持续避免这些。
The definition of rationality as coherence is impossibly restrictive; it demands adherence to rules of logic that a finite mind is not able to implement. Reasonable people cannot be rational by that definition, but they should not be branded as irrational for that reason. Irrational is a strong word , which connotes impulsivity, emotionality, and a stubborn resistance to reasonable argument. I often cringe when my work with Amos is credited with demonstrating that human choices are irrational, when in fact our research only showed that Humans are not well described by the rational-agent model.
将理性定义为连贯性是不可能的限制性的;它要求遵守一个有限的头脑无法实施的逻辑规则。根据这个定义,合理的人不可能是理性的,但他们不应该因此而被打上非理性的烙印。 非理性 是一个强烈的词 ,它意味着冲动、情绪化,以及对合理论证的顽固抵抗。我 ,当我和阿莫斯的工作被认为证明了人类的选择是非理性的时候,我经常感到害怕,而事实上我们的研究只是表明人类并没有被理性代理模型很好地描述。
Although Humans are not irrational, they often need help to make more accurate judgments and better decisions, and in some cases policies and institutions can provide that help. These claims may seem innocuous, but they are in fact quite controversial. As interpreted by the important Chicago school of economics, faith in human rationality is closely linked to an ideology in which it is unnecessary and even immoral to protect people against their choices. Rational people should be free, and they should be responsible for taking care of themselves. Milton Friedman, the leading figure in that school, expressed this view in the title of one of his popular books: Free to Choose .
尽管人类并非不理性,但他们往往需要帮助来做出更准确的判断和更好的决定,而在某些情况下,政策和机构可以提供这种帮助。这些主张 ,似乎无伤大雅,但事实上却颇有争议。按照重要的芝加哥经济学派的解释,对人类理性的信仰与一种意识形态密切相关,在这种意识形态中,保护人们的选择是不必要的,甚至是不道德的。理性的人应该是自由的,他们应该负责照顾自己。米尔顿-弗里德曼(Milton Friedman)是 该学派的领军人物,他在他的一本流行书的标题中表达了这种观点。 自由选择 。
The assumption that agents are rational provides the intellectual foundation for the libertarian approach to public policy: do not interfere with the individual’s right to choose, unless the choices harm others. Libertarian policies are further bolstered by admiration for the efficiency of markets in allocating goods to the people who are willing to pay the most for them. A famous example of the Chicago approach is titled A Theory of Rational Addiction ; it explains how a rational agent with a strong preference for intense and immediate gratification may make the rational decision to accept future addiction as a consequence. I once heard Gary Becker, one of the authors of that article, who is also a Nobel laureate of the Chicago school, argue in a lighter vein, but not entirely as a joke, that we should consider the possibility of explaining the so-called obesity epidemic by people’s belief that a cure for diabetes will soon become available. He was making a valuable point: when we observe people acting in ways that seem odd, we should first examine the possibility that they have a good reason to do what they do. Psychological interpretations should only be invoked when the reasons become implausible—which Becker’s explanation of obesity probably is.
代理人是理性的这一假设为公共政策的自由主义方法提供了知识基础:不干涉个人的选择权,除非这些选择损害了他人。对市场将 ,分配给愿意为之付出最多的人的效率的推崇,进一步加强了自由主义 政策。芝加哥方法的一个著名例子是名为 《理性成瘾理论 》;它解释了一个对强烈的即时满足有强烈偏好的理性代理人如何做出理性决定, 接受未来成瘾 的后果。我曾经听过加里-贝克尔,那篇文章的作者之一,也是芝加哥学派的诺贝尔 获得者,以轻松的口吻(但不完全是玩笑)认为,我们应该考虑用人们相信很快就能治愈糖尿病来解释所谓肥胖症流行的可能性。他提出了一个有价值的观点:当我们观察到人们的行为似乎很奇怪时,我们应该首先审查他们是否有一个很好的理由 ,来做他们的事情。只有当理由变得不可信时,才应援引心理学的解释--贝克尔对肥胖症的解释可能就是如此。
In a nation of Econs, government should keep out of the way, allowing the Econs to act as they choose, so long as they do not harm others. If a motorcycle rider chooses to ride without a helmet, a libertarian will support his right to do so. Citizens know what they are doing, even when they choose not to save for their old age, or when they expose themselves to addictive substances. There is sometimes a hard edge to this position: elderly people who did not save enough for retirement get little more sympathy than someone who complains about the bill after consuming a large meal at a restaurant. Much is therefore at stake in the debate between the Chicago school and the behavioral economists, who reject the extreme form of the rational-agent model. Freedom is not a contested value; all the participants in the debate are in favor of it. But life is more complex for behavioral economists than for true believers in human rationality. No behavioral economist favors a state that will force its citizens to eat a balanced diet and to watch only television programs that are good for the soul. For behavioral economists, however, freedom has a cost, which is borne by individuals who make bad choices, and by a society that feels obligated to help them. The decision of whether or not to protect individuals against their mistakes therefore presents a dilemma for behavioral economists. The economists of the Chicago school do not face that problem, because rational agents do not make mistakes. For adherents of this school, freedom is free of charge.
在一个由生态人组成的国家里,政府应该置身事外,允许生态人按照他们的选择行事,只要他们不伤害他人。如果一个骑摩托车的人选择不戴头盔骑车,自由主义者会支持他这样做的权利, 。公民知道他们在做什么,即使他们选择不为自己的老年生活储蓄,或者当他们将自己暴露在成瘾物质中时。这一立场有时会有一个硬伤:没有为退休储蓄足够资金的老人比在餐馆吃完大餐后抱怨账单的人得到更多的同情。因此,在芝加哥学派和行为经济学家之间的 辩论中,有很多东西是利害攸关的,因为他们拒绝理性代理模式的极端形式。自由不是一个有争议的价值;辩论中的所有参与者都赞成自由。但是,对于行为经济学家来说,生活比人类理性的真正信仰者更加复杂。没有哪个行为经济学家赞成一个国家强迫其公民吃均衡的饮食 ,只看对灵魂有益的电视节目。然而,对于行为经济学家来说,自由是有成本的,这个成本由做出错误选择的个人和认为有义务帮助他们的社会来承担。因此,是否保护个人不犯错误的决定,给行为经济学家带来了两难的选择。芝加哥学派的经济学家并不 ,因为理性的代理人不会犯错。对于这个学派的信徒来说,自由是免费的。
In 2008 the economist Richard Thaler and the jurist Cass Sunstein teamed up to write a book, Nudge , which quickly became an international bestseller and the bible of behavioral economics. Their book introduced several new words into the language, including Econs and Humans. It also presented a set of solutions to the dilemma of how to help people make good decisions without curtailing their freedom. Thaler and Sunstein advocate a position of libertarian paternalism, in which the state and other institutions are allowed to nudge people to make decisions that serve their own long-term interests. The designation of joining a pension plan as the default option is an example of a nudge. It is difficult to argue that anyone’s freedom is diminished by being automatically enrolled in the plan, when they merely have to check a box to opt out. As we saw earlier, the framing of the individual’s decision—Thaler and Sunstein call it choice architecture—has a huge effect on the outcome. The nudge is based on sound psychology, which I described earlier. The default option is naturally perceived as the normal choice. Deviating from the normal choice is an act of commission, which requires more effortful deliberation, takes on more responsibility, and is more likely to evoke regret than doing nothing. These are powerful forces that may guide the decision of someone who is otherwise unsure of what to do.
2008年,经济学家理查德-塞勒(Richard Thaler)和法学家卡斯-桑斯坦(Cass Sunstein)合作写了一本书《 Nudge 》,这本书迅速成为国际畅销书和行为经济学的圣经。他们的书在语言中引入了几个新词,包括Econs和Humans。 它还 ,提出了一套解决方案,以解决如何帮助人们做出好的决定而又不限制他们的自由这一难题。Thaler和Sunstein主张采取自由主义的家长制立场,即允许国家和其他机构 鼓励 人们做出有利于他们自身长期利益的决定。将加入养老金计划指定为默认选项是一个 ,这是一个鼓励的例子。很难说任何人的自由会因为自动加入该计划而被削弱,因为他们只需要勾选一个选项就可以退出了。正如我们前面所看到的,个人决定的框架--哈利和桑斯坦称之为选择架构--对结果有巨大影响。催促是基于合理的心理学,我在前面描述过。默认选项 ,自然被认为是正常的选择。偏离正常选择是一种委托行为,它需要更多的努力思考,承担更多的责任,并且比什么都不做更有可能引起后悔。这些都是强大的力量,可能会指导一个在其他方面不确定该怎么做的人的决定。
Humans, more than Econs, also need protection from others who deliberately exploit their weaknesses—and especially the quirks of System 1 and the laziness of System 2. Rational agents are assumed to make important decisions carefully, and to use all the information that is provided to them. An Econ will read and understand the fine print of a contract before signing it, but Humans usually do not. An unscrupulous firm that designs contracts that customers will routinely sign without reading has considerable legal leeway in hiding important information in plain sight. A pernicious implication of the rational-agent model in its extreme form is that customers are assumed to need no protection beyond ensuring that the relevant information is disclosed. The size of the print and the complexity of the language in the disclosure are not considered relevant—an Econ knows how to deal with small print when it matters. In contrast, the recommendations of Nudge require firms to offer contracts that are sufficiently simple to be read and understood by Human customers. It is a good sign that some of these recommendations have encountered significant opposition from firms whose profits might suffer if their customers were better informed. A world in which firms compete by offering better products is preferable to one in which the winner is the firm that is best at obfuscation.
人类比生态人更需要保护,以免被其他人 ,故意利用他们的弱点--尤其是系统1的怪癖和系统2的懒惰。理性的代理人被认为会谨慎地做出重要决定,并使用提供给他们的所有信息。一个经济学人在签署合同之前会阅读并理解合同的细部内容,但人类通常不会这样做。一家无良公司设计的合同,客户 ,通常会在不阅读的情况下签署,在法律上有相当大的余地将重要信息隐藏在众目睽睽之下。极端形式的理性代理模式的一个有害影响是,除了确保相关信息被披露外,客户被认为不需要任何保护。印刷品的大小和披露的语言的复杂性被认为是无关紧要的-- Econ知道如何在重要的时候处理小字。相反, Nudge 的建议要求公司提供足够简单的合同,以便人类客户阅读和理解。这是一个好兆头,其中一些建议遇到了来自公司的重大反对,如果他们的客户得到更好的信息,他们的利润可能会受到影响。一个由 公司通过提供更好的产品进行竞争的世界,比一个赢家是最擅长混淆视听的公司的世界要好。
A remarkable feature of libertarian paternalism is its appeal across a broad political spectrum. The flagship example of behavioral policy, called Save More Tomorrow, was sponsored in Congress by an unusual coalition that included extreme conservatives as well as liberals. Save More Tomorrow is a financial plan that firms can offer their employees. Those who sign on allow the employer to increase their contribution to their saving plan by a fixed proportion whenever they receive a raise. The increased saving rate is implemented automatically until the employee gives notice that she wants to opt out of it. This brilliant innovation, proposed by Richard Thaler and Shlomo Benartzi in 2003, has now improved the savings rate and brightened the future prospects of millions of workers. It is soundly based in the psychological principles that readers of this book will recognize. It avoids the resistance to an immediate loss by requiring no immediate change; by tying increased saving to pay raises, it turns losses into foregone gains, which are much easier to bear; and the feature of automaticity aligns the laziness of System 2 with the long-term interests of the workers. All this, of course, without compelling anyone to do anything he does not wish to do and without any misdirection or artifice.
自由主义家长制的一个显著特点是它在广泛的政治光谱中具有吸引力。行为政策的旗舰案例,名为 "拯救更多的明天",是由一个不寻常的联盟在国会发起的,该联盟包括极端保守派和自由派。 Save More Tomorrow 是一个企业可以为其员工提供的财务计划。签署该计划的人允许雇主在他们获得加薪时按固定比例增加他们对储蓄计划的贡献。增加的储蓄率会自动执行,直到雇员通知她想退出。这项由理查德-塞勒和 Shlomo Benartzi在2003年提出的杰出创新,现在已经提高了储蓄率,并照亮了数百万工人的未来前景。它健全地建立在本书读者会认可的心理学原理上。它通过不要求立即改变,避免了对直接损失的抵制;通过将增加储蓄与加薪挂钩,它将损失变成了放弃的收益,这就更容易承受; ,自动性的特点使系统2的懒惰与工人的长期利益相一致。当然,所有这些都没有强迫任何人去做他不愿意做的事情,也没有任何误导或诡计。
The appeal of libertarian paternalism has been recognized in many countries, including the UK and South Korea, and by politicians of many stripes, including Tories and the Democratic administration of President Obama. Indeed, Britain’s government has created a new small unit whose mission is to apply the principles of behavioral science to help the government better accomplish its goals. The official name for this group is the Behavioural Insight Team, but it is known both in and out of government simply as the Nudge Unit. Thaler is an adviser to this team.
自由主义家长制的吸引力已经在许多国家得到认可,包括英国和韩国,以及许多政治家,包括保守党 和奥巴马总统的民主党政府。事实上,英国政府已经建立了一个新的小单位,其任务是应用行为科学的原则来帮助政府更好地实现其目标。这个小组的正式名称是 "行为洞察小组",但它在政府内外都被称为 "推动小组"。塞勒是这个小组的顾问。
In a storybook sequel to the writing of Nudge , Sunstein was invited by President Obama to serve as administrator of the Office of Information and Regulatory Affairs, a position that gave him considerable opportunity to encourage the application of the lessons of psychology and behavioral economics in government agencies. The mission is described in the 2010 Report of the Office of Management and Budget. Readers of this book will appreciate the logic behind specific recommendations, including encouraging “clear, simple, salient, and meaningful disclosures.” They will also recognize background statements such as “presentation greatly matters; if, for example, a potential outcome is framed as a loss, it may have more impact than if it is presented as a gain.”
在《 Nudge 》写作的故事书续集中,孙斯坦被奥巴马总统邀请担任信息和监管事务办公室的管理员,这个职位给了他相当多的机会来鼓励在政府机构中应用心理学和行为经济学的经验教训。该任务在2010年管理和 预算办公室的报告中有所描述。本书的读者会欣赏具体建议背后的逻辑,包括鼓励 "清晰、简单、突出和有意义的披露"。他们也会认识到诸如 "表述非常重要;例如,如果一个潜在的结果被表述为损失,它可能比被表述为收益更有影响力 "等背景声明。
The example of a regulation about the framing of disclosures concerning fuel consumption was mentioned earlier. Additional applications that have been implemented include automatic enrollment in health insurance, a new version of the dietary guidelines that replaces the incomprehensible Food Pyramid with the powerful image of a Food Plate loaded with a balanced diet, and a rule formulated by the USDA that permits the inclusion of messages such as “90% fat-free” on the label of meat products, provided that the statement “10% fat” is also displayed “contiguous to, in lettering of the same color, size, and type as, and on the same color background as, the statement of lean percentage.” Humans, unlike Econs, need help to make good decisions, and there are informed and unintrusive ways to provide that help.
前面提到了一个法规 ,关于燃料消耗的信息披露的框架。其他已经实施的应用包括医疗保险的自动登记,新版的饮食指南,用充满平衡饮食的 "食物盘 "的强大形象取代了难以理解的 "食物金字塔",以及美国农业部制定的规则,允许在肉类产品的标签上包含 "90%无脂肪 "等信息,只要 "10%脂肪 "的声明也显示在 "与瘦肉百分比声明相邻,颜色、大小和字体相同,且背景颜色相同"。人类与生态环境不同,需要帮助才能做出正确的决定,而有一些知情的、不受干扰的方法可以提供这种帮助。
TWO SYSTEMS
两套系统
This book has described the workings of the mind as an uneasy interaction between two fictitious characters: the automatic System 1 and the effortful System 2. You are now quite familiar with the personalities of the two systems and able to anticipate how they might respond in different situations. And of course you also remember that the two systems do not really exist in the brain or anywhere else. “System 1 does X” is a shortcut for “X occurs automatically.” And “System 2 is mobilized to do Y” is a shortcut for “arousal increases, pupils dilate, attention is focused, and activity Y is performed.” I hope you find the language of systems as helpful as I do, and that you have acquired an intuitive sense of how they work without getting confused by the question of whether they exist. Having delivered this necessary warning, I will continue to use the language to the end.
本书将思维的运作描述为两个虚构人物之间的不安互动:自动系统1和努力系统2。你现在对这两个系统的个性相当熟悉,能够预测它们在不同情况下可能做出的反应。当然,你也记得这两个系统并不真正存在于大脑或其他地方。 "系统1做X "是 "X自动发生 "的一个捷径。而 "系统2被动员起来做Y "则是 "唤醒增加,瞳孔放大,注意力集中,并进行Y活动 "的捷径。我希望你和我一样发现系统的语言是有帮助的,而且你已经获得了对它们如何工作的直观感觉,而不会被它们是否存在的问题所迷惑。在 这个必要的警告之后,我将继续使用这种语言,直到最后。
The attentive System 2 is who we think we are. System 2 articulates judgments and makes choices, but it often endorses or rationalizes ideas and feelings that were generated by System 1. You may not know that you are optimistic about a project because something about its leader reminds you of your beloved sister, or that you dislike a person who looks vaguely like your dentist. If asked for an explanation, however, you will search your memory for presentable reasons and will certainly find some. Moreover, you will believe the story you make up. But System 2 is not merely an apologist for System 1; it also prevents many foolish thoughts and inappropriate impulses from overt expression. The investment of attention improves performance in numerous activities—think of the risks of driving through a narrow space while your mind is wandering—and is essential to some tasks, including comparison, choice, and ordered reasoning. However, System 2 is not a paragon of rationality. Its abilities are limited and so is the knowledge to which it has access. We do not always think straight when we reason, and the errors are not always due to intrusive and incorrect intuitions. Often we make mistakes because we (our System 2) do not know any better.
殷勤的系统2是我们认为自己是谁。系统2阐述判断并做出选择,但它经常认可或合理化由系统1产生的想法和感觉。你可能不知道你对一个项目持乐观态度是因为这个项目的领导者让你想起了你心爱的姐姐, ,或者你不喜欢一个长得有点像你的牙医的人。然而,如果要求你解释,你会在记忆中寻找可供展示的理由,并且一定会找到一些。此外,你会相信自己编造的故事。但是,系统2不仅仅是系统1的辩护人;它还能防止许多愚蠢的想法和不适当的冲动公开表达出来。注意力的投入 ,提高了许多活动的表现--想想看,当你的思想在徘徊时,开车通过一个狭窄的空间的风险--对一些任务来说是必不可少的,包括比较、选择和有序推理。然而,系统2不是理性的典范。它的能力是有限的,它所获得的知识也是有限的。我们在推理时并不总是直截了当地思考,而 ,错误并不总是由于侵入性的、不正确的直觉造成的。我们之所以犯错,往往是因为我们(我们的系统2)并不了解情况。
I have spent more time describing System 1, and have devoted many pages to errors of intuitive judgment and choice that I attribute to it. However, the relative number of pages is a poor indicator of the balance between the marvels and the flaws of intuitive thinking. System 1 is indeed the origin of much that we do wrong, but it is also the origin of most of what we do right—which is most of what we do. Our thoughts and actions are routinely guided by System 1 and generally are on the mark. One of the marvels is the rich and detailed model of our world that is maintained in associative memory: it distinguishes surprising from normal events in a fraction of a second, immediately generates an idea of what was expected instead of a surprise, and automatically searches for some causal interpretation of surprises and of events as they take place.
我花了更多的时间来描述系统1,并花了许多 ,来说明我归因于它的直觉判断和选择的错误。然而,相对的页数并不能说明直觉 思维的奇迹和缺陷之间的平衡。系统1确实是我们做错事的源头,但它也是我们做对事的源头--也就是我们所做的大部分事。我们的思想和行动经常受到系统1的指导,一般来说都是正确的。其中一个奇迹是联想记忆中保存的丰富而详细的世界模型:它能在几分之一秒内将令人惊讶的事件与正常事件区分开来 ,立即产生一个预期的想法而不是惊讶,并自动寻找对惊讶和事件发生时的一些因果解释。
Memory also holds the vast repertory of skills we have acquired in a lifetime of practice, which automatically produce adequate solutions to challenges as they arise, from walking around a large stone on the path to averting the incipient outburst of a customer. The acquisition of skills requires a regular environment, an adequate opportunity to practice, and rapid and unequivocal feedback about the correctness of thoughts and actions. When these conditions are fulfilled, skill eventually develops, and the intuitive judgments and choices that quickly come to mind will mostly be accurate. All this is the work of System 1, which means it occurs automatically and fast. A marker of skilled performance is the ability to deal with vast amounts of information swiftly and efficiently.
记忆中还保存着我们在一生的实践中所获得的大量技能,这些技能会在挑战出现时自动产生适当的解决方案,从绕着大石头走 ,到避免顾客刚开始的爆发。技能的获得需要一个有规律的环境,一个充分的练习机会,以及关于思想和行动正确性的快速和明确的反馈。当这些条件得到满足时,技能最终会得到发展,而迅速出现在脑海中的直觉判断和选择大多会是准确的。所有这些 ,都是系统1的工作,这意味着它自动而快速地发生。熟练表现的一个标志是迅速有效地处理大量信息的能力。
When a challenge is encountered to which a skilled response is available, that response is evoked. What happens in the absence of skill? Sometimes, as in the problem 17 × 24 = ?, which calls for a specific answer, it is immediately apparent that System 2 must be called in. But it is rare for System 1 to be dumbfounded. System 1 is not constrained by capacity limits and is profligate in its computations. When engaged in searching for an answer to one question, it simultaneously generates the answers to related questions, and it may substitute a response that more easily comes to mind for the one that was requested. In this conception of heuristics, the heuristic answer is not necessarily simpler or more frugal than the original question—it is only more accessible, computed more quickly and easily. The heuristic answers are not random, and they are often approximately correct. And sometimes they are quite wrong.
当遇到有技巧的反应的挑战时,就会唤起这种反应。在没有技巧的情况下会发生什么?有时,如17×24=?的问题,需要一个特定的答案, ,很明显,系统2必须被调用。但是,系统1被吓呆的情况很少。系统1不受容量限制的约束,在计算中很放肆。当参与寻找一个问题的答案时,它同时产生相关问题的答案,它可能会用一个更容易想到的答案来替代 。在这种启发式的概念中,启发式的答案不一定比原来的问题更简单或更节俭--它只是更容易获得,更快速和更容易计算。启发式答案并不是随机的,而且它们往往是近似正确的。有时它们也是完全错误的。
System 1 registers the cognitive ease with which it processes information, but it does not generate a warning signal when it becomes unreliable. Intuitive answers come to mind quickly and confidently, whether they originate from skills or from heuristics. There is no simple way for System 2 to distinguish between a skilled and a heuristic response. Its only recourse is to slow down and attempt to construct an answer on its own, which it is reluctant to do because it is indolent. Many suggestions of System 1 are casually endorsed with minimal checking, as in the bat-and-ball problem. This is how System 1 acquires its bad reputation as the source of errors and biases. Its operative features, which include WYSIATI, intensity matching, and associative coherence, among others, give rise to predictable biases and to cognitive illusions such as anchoring, nonregressive predictions, overconfidence, and numerous others.
系统1记录了它处理信息的认知便利性,但它 ,当它变得不可靠时,不会产生警告信号。直观的答案迅速而自信地出现在脑海中,无论它们是来自于技能还是启发式方法。系统2没有简单的方法来 ,以区分熟练的和启发式的反应。它唯一的办法是放慢速度,试图自己构建一个答案,但它不愿意这么做,因为它是懒惰的。 系统1的许多建议都是随意认可的,只做了最少的检查,如在球棒和球的问题上。这就是系统1获得其作为错误和偏见来源的坏名声的方式。它的操作特点,包括所见即所得、强度匹配和联想一致性等,引起了可预测的偏见和认知错觉,如锚定、非回归性预测、 过度自信,以及其他许多问题。
What can be done about biases? How can we improve judgments and decisions, both our own and those of the institutions that we serve and that serve us? The short answer is that little can be achieved without a considerable investment of effort. As I know from experience, System 1 is not readily educable. Except for some effects that I attribute mostly to age, my intuitive thinking is just as prone to overconfidence, extreme predictions, and the planning fallacy as it was before I made a study of these issues. I have improved only in my ability to recognize situations in which errors are likely: “This number will be an anchor …,” “The decision could change if the problem is reframed …” And I have made much more progress in recognizing the errors of others than my own.
对于偏见可以做些什么?我们如何才能改善我们自己的判断和决定,以及我们所服务的机构和为我们服务的机构的判断和决定?简短的回答是,如果不投入大量的精力,几乎不可能实现。根据我的经验,系统1是不容易被教育的。除了一些我主要归因于年龄的影响外, ,我的直觉思维和我对这些问题进行研究之前一样容易出现过度自信、极端预测和计划谬误。我只是在识别可能出现错误的情况的能力方面有所提高。"这个数字将是一个锚......","如果对问题进行重新规划,决定可能会改变......"而且我在识别别人的错误方面取得了更大的进步 ,而不是我自己的错误。
The way to block errors that originate in System 1 is simple in principle: recognize the signs that you are in a cognitive minefield, slow down, and ask for reinforcement from System 2. This is how you will proceed when you next encounter the Müller-Lyer illusion. When you see lines with fins pointing in different directions, you will recognize the situation as one in which you should not trust your impressions of length. Unfortunately, this sensible procedure is least likely to be applied when it is needed most. We would all like to have a warning bell that rings loudly whenever we are about to make a serious error, but no such bell is available, and cognitive illusions are generally more difficult to recognize than perceptual illusions. The voice of reason may be much fainter than the loud and clear voice of an erroneous intuition, and questioning your intuitions is unpleasant when you face the stress of a big decision. More doubt is the last thing you want when you are in trouble. The upshot is that it is much easier to identify a minefield when you observe others wandering into it than when you are about to do so. Observers are less cognitively busy and more open to information than actors. That was my reason for writing a book that is oriented to critics and gossipers rather than to decision makers.
阻止源自系统1的错误的方法原则上很简单:认识到你处于认知雷区的迹象,放慢速度,并要求系统2进行强化。当你下次遇到穆勒-莱尔错觉时,你将这样做。当你看到鳍状物指向不同方向的线条时,你会意识到这种情况下,你应该 ,不要相信你对长度的印象。不幸的是,这个明智的程序在最需要的时候却最不可能被应用。我们都希望有一个警告铃,在我们即将犯下严重错误的时候大声响起,但没有这样的铃,而且认知上的错觉通常比感知上的错觉更难识别。理性的声音可能比错误的直觉的响亮声音要微弱得多 ,当你面临重大决定的压力时,质疑你的直觉是令人不快的。当你陷入困境时,更多的怀疑是你最不希望看到的。结果是,当你观察到别人误入雷区时,要比你准备这样做时更容易识别雷区。与行动者相比,观察者在认知上没有那么忙,对信息的态度也更开放 。这就是我写一本面向批评家和八卦者而不是决策者的书的原因。
Organizations are better than individuals when it comes to avoiding errors, because they naturally think more slowly and have the power to impose orderly procedures. Organizations can institute and enforce the application of useful checklists, as well as more elaborate exercises, such as reference-class forecasting and the premortem. At least in part by providing a distinctive vocabulary , organizations can also encourage a culture in which people watch out for one another as they approach minefields. Whatever else it produces, an organization is a factory that manufactures judgments and decisions. Every factory must have ways to ensure the quality of its products in the initial design, in fabrication, and in final inspections. The corresponding stages in the production of decisions are the framing of the problem that is to be solved, the collection of relevant information leading to a decision, and reflection and review. An organization that seeks to improve its decision product should routinely look for efficiency improvements at each of these stages. The operative concept is routine. Constant quality control is an alternative to the wholesale reviews of processes that organizations commonly undertake in the wake of disasters. There is much to be done to improve decision making. One example out of many is the remarkable absence of systematic training for the essential skill of conducting efficient meetings.
在避免 错误方面,组织比个人要好,因为他们自然会思考得更慢,并有能力强加有序的程序。组织 可以建立并强制 应用有用的检查表,以及 更为精细的练习,如参考类预测和预验收。至少在某种程度上,通过提供 独特的词汇 ,组织还可以鼓励一种文化,使人们在接近雷区时相互照应。无论组织生产什么,它都是一个制造判断和决策的工厂。每个工厂都必须有办法在最初的设计、制造和最后的检查中确保其产品的质量 。决策生产的相应阶段是对要解决的问题进行构思,收集导致决策的相关信息,以及思考和审查。一个组织如果想改善其决策产品,就应该在这些阶段的每一个 ,寻求效率的提高。操作性的概念是常规。持续的质量控制是对组织在灾难发生后通常进行的全盘审查的一种替代。在改善决策方面还有很多工作要做。许多例子中的一个就是明显缺乏对主持高效会议这一基本技能的系统培训。
Ultimately, a richer language is essential to the skill of constructive criticism. Much like medicine, the identification of judgment errors is a diagnostic task, which requires a precise vocabulary. The name of a disease is a hook to which all that is known about the disease is attached, including vulnerabilities, environmental factors, symptoms, prognosis, and care. Similarly, labels such as “anchoring effects,” “narrow framing,” or “excessive coherence” bring together in memory everything we know about a bias, its causes, its effects, and what can be done about it.
归根结底,更丰富的 语言对建设性批评的技能至关重要。和医学一样,识别判断错误是一项诊断任务,需要一个精确的词汇。一种疾病的名称是一个钩子,所有关于该疾病的已知信息都被附加在上面,包括脆弱性、环境因素、症状、预后和护理。同样,诸如 "锚定效应"、 "狭义框架 "或 "过度连贯性 "等标签将我们所知道的关于偏见、其原因、其影响以及可以采取的措施的一切都集中在记忆中。
There is a direct link from more precise gossip at the watercooler to better decisions. Decision makers are sometimes better able to imagine the voices of present gossipers and future critics than to hear the hesitant voice of their own doubts. They will make better choices when they trust their critics to be sophisticated and fair, and when they expect their decision to be judged by how it was made, not only by how it turned out.
更精确的饮水机上的八卦与更好的决策有直接联系。决策者有时能更好地想象现在的流言蜚语者和未来的批评者的声音,而不是听到他们自己怀疑的犹豫不决的声音。 当他们相信他们的批评者是成熟和公正的,当他们期望他们的决定被评判为是如何做出的,而不仅仅是结果如何,他们会做出更好的选择。
Appendix A: Judgment Under Uncertainty: Heuristics and Biases fn1
附录A: 不确定情况下的判断:启发式和偏见fn1
Amos Tversky and Daniel Kahneman
Amos Tversky和Daniel Kahneman
Many decisions are based on beliefs concerning the likelihood of uncertain events such as the outcome of an election, the guilt of a defendant, or the future value of the dollar. These beliefs are usually expressed in statements such as “I think that …,” “chances are …,” “it is unlikely that …,” and so forth. Occasionally, beliefs concerning uncertain events are expressed in numerical form as odds or subjective probabilities. What determines such beliefs? How do people assess the probability of an uncertain event or the value of an uncertain quantity? This article shows that people rely on a limited number of heuristic principles which reduce the complex tasks of assessing probabilities and predicting values to simpler judgmental operations. In general, these heuristics are quite useful, but sometimes they lead to severe and systematic errors.
许多决定都是基于对不确定事件的可能性的信念,如选举结果、被告的罪责或美元的未来价值。这些信念通常用 "我认为......"、"机会是......"、"不太可能...... "等语句来表达, 等。偶尔,关于不确定事件的信念也会以数字形式表达为赔率或主观概率。是什么决定了这种信念?人们是如何评估一个不确定事件的概率或一个不确定数量的价值的?本文表明,人们依赖有限的启发式原则,这些原则将评估概率和 预测价值的复杂任务简化为较简单的判断操作。一般来说,这些启发式原则是相当有用的,但有时它们会导致严重的系统性错误。
The subjective assessment of probability resembles the subjective assessment of physical quantities such as distance or size. These judgments are all based on data of limited validity, which are processed according to heuristic rules. For example, the apparent distance of an object is determined in part by its clarity. The more sharply the object is seen, the closer it appears to be. This rule has some validity, because in any given scene the more distant objects are seen less sharply than nearer objects. However, the reliance on this rule leads to systematic errors in the estimation of distance. Specifically, distances are often overestimated when visibility is poor because the contours of objects are blurred. On the other hand, distances are often underestimated when visibility is good because the objects are seen sharply. Thus, the reliance on clarity as an indication of distance leads to common biases. Such biases are also found in the intuitive judgment of probability. This article describes three heuristics that are employed to assess probabilities and to predict values. Biases to which these heuristics lead are enumerated, and the applied and theoretical implications of these observations are discussed.
对概率的主观评估类似于对距离或大小等物理量的主观评估。这些判断都是基于有效性有限的数据,根据启发式规则进行处理。对于 ,一个物体的表面距离部分由其清晰度决定。物体被看得越清晰,它看起来就越近。这个规则有一定的合理性,因为在任何给定的场景中,较远的物体比较近的物体看得不清楚。然而,对这一规则的依赖导致了对距离估计的系统性错误。具体来说,当能见度低时,距离往往被高估, ,因为物体的轮廓是模糊的。另一方面,当能见度良好时,距离往往被低估,因为物体被看得很清楚。因此,依赖清晰度作为距离的指示会导致常见的偏差。在概率的直觉判断中也会出现这样的偏差。本文介绍了三种启发式方法,它们被用于 ,以评估概率和预测数值。列举了这些启发式方法 导致的偏差,并讨论了这些观察结果的应用和理论意义。
REPRESENTATIVENESS
代表性
Many of the probabilistic questions with which people are concerned belong to one of the following types: What is the probability that object A belongs to class B? What is the probability that event A originates from process B? What is the probability that process B will generate event A? In answering such questions, people typically rely on the representativeness heuristic, in which probabilities are evaluated by the degree to which A is representative of B, that is, by the degree to which A resembles B. For example, when A is highly representative of B, the probability that A originates from B is judged to be high. On the other hand, if A is not similar to B, the probability that A originates from B is judged to be low.
人们关心的许多概率问题都属于下列类型之一。物体A属于B类的概率是多少? 事件A起源于过程B的概率是多少?过程B产生事件A的概率是多少?在回答这类问题时,人们通常依靠代表性启发式,即通过A代表B的程度来评估概率,也就是说,通过A与B的相似程度来评估概率。例如,当A高度代表B时,判断A源自 B的概率就很高。另一方面,如果A与B不相似,则判断A源自B的概率就低。
For an illustration of judgment by representativeness, consider an individual who has been described by a former neighbor as follows: “Steve is very shy and withdrawn, invariably helpful, but with little interest in people, or in the world of reality. A meek and tidy soul, he has a need for order and structure, and a passion for detail.” How do people assess the probability that Steve is engaged in a particular occupation from a list of possibilities (for example, farmer, salesman, airline pilot, librarian, or physician)? How do people order these occupations from most to least likely? In the representativeness heuristic, the probability that Steve is a librarian, for example, is assessed by the degree to which he is representative of, or similar to, the stereotype of a librarian. Indeed, research with problems of this type has shown that people order the occupations by probability and by similarity in exactly the same way. 1 This approach to the judgment of probability leads to serious errors, because similarity, or representativeness, is not influenced by several factors that should affect judgments of probability.
为了说明代表性的判断,请考虑一个人,他被一个以前的邻居描述如下。"史蒂夫非常害羞和孤僻,总是乐于助人,但对人或对现实世界没有什么兴趣。一个温顺的 ,整洁的灵魂,他对秩序和结构有需求,对细节有热情"。人们如何从一连串的可能性中评估史蒂夫从事某种职业的概率(例如,农民、推销员、航空公司飞行员、图书管理员或医生)?人们是如何将这些职业从最有可能到最无可能排序的?在代表性启发式中,例如,史蒂夫是 图书管理员的概率,是通过他在多大程度上代表或类似于图书管理员的刻板印象来评估的。事实上,对这种类型问题的研究表明,人们以完全相同的方式按概率和相似性来排列职业。 1 这种判断概率的方法导致了严重的错误,因为相似性,或者说代表性,并没有受到几个应该影响概率判断的因素的影响, 。
Insensitivity to prior probability of outcomes . One of the factors that have no effect on representativeness but should have a major effect on probability is the prior probability, or base-rate frequency, of the outcomes. In the case of Steve, for example, the fact that there are many more farmers than librarians in the population should enter into any reasonable estimate of the probability that Steve is a librarian rather than a farmer. Considerations of base-rate frequency, however, do not affect the similarity of Steve to the stereotypes of librarians and farmers. If people evaluate probability by representativeness, therefore, prior probabilities will be neglected. This hypothesis was tested in an experiment where prior probabilities were manipulated. 2 Subjects were shown brief personality descriptions of several individuals, allegedly sampled at random from a group of 100 professionals—engineers and lawyers. The subjects were asked to assess, for each description, the probability that it belonged to an engineer rather than to a lawyer. In one experimental condition, subjects were told that the group from which the descriptions had been drawn consisted of 70 engineers and 30 lawyers. In another condition, subjects were told that the group consisted of 30 engineers and 70 lawyers. The odds that any particular description belongs to an engineer rather than to a lawyer should be higher in the first condition, where there is a majority of engineers, than in the second condition, where there is a majority of lawyers. Specifically, it can be shown by applying Bayes’ rule that the ratio of these odds should be (.7/.3) 2 , or 5.44, for each description. In a sharp violation of Bayes’ rule, the subjects in the two conditions produced essentially the same probability judgments. Apparently, subjects evaluated the likelihood that a particular description belonged to an engineer rather than to a lawyer by the degree to which this description was representative of the two stereotypes, with little or no regard for the prior probabilities of the categories.
对结果的先前概率不敏感 。对代表性没有影响,但对概率有重大影响的因素之一是结果的先验概率,或基准频率。以史蒂夫为例,在 的人口中,农民比图书管理员多得多,这一事实应该进入对史蒂夫是图书管理员而不是农民的概率的任何合理估计中。然而,对基数频率的考虑并不影响史蒂夫与图书馆员和农民的刻板印象的相似性。因此,如果人们通过代表性来评估概率,那么先前的概率就会被忽略掉。这一假设在一个实验中得到了验证 ,在这个实验中,先验概率被操纵。 2 受试者看到了几个人的简短个性描述,据说是从100名专业人员--工程师和律师--中随机抽取的。受试者被要求评估每个描述属于工程师而不是律师的概率。在一个实验条件下,受试者被告知,从描述中抽取的群体 ,其中包括70名工程师和30名律师。在另一个条件下,受试者被告知,该组由30名工程师和70名律师组成。在第一个条件下,任何特定的描述属于工程师而不是律师的几率应该比在第二个条件下更高,因为在第二个条件下, ,律师占大多数。具体来说,通过应用贝叶斯法则可以证明,这些几率的比率应该是(.7/.3 )2 ,或者说是5.44,对于每一个描述来说。在严重违反贝叶斯规则的情况下,两种条件下的受试者产生了基本相同的概率判断。 显然,受试者评估某一描述属于工程师的可能性,而不是属于 律师的可能性,其依据是该描述在多大程度上代表了这两种定型,很少或根本不考虑类别的先验概率。
The subjects used prior probabilities correctly when they had no other information. In the absence of a personality sketch, they judged the probability that an unknown individual is an engineer to be .7 and .3, respectively, in the two base-rate conditions. However, prior probabilities were effectively ignored when a description was introduced, even when this description was totally uninformative. The responses to the following description illustrate this phenomenon:
当被试没有其他信息时,他们正确地使用了先验概率。在没有人格素描的情况下,他们判断一个未知的人是工程师的概率分别是0.7和0.3,在 两个基数的条件下。然而,当引入描述时,先前的概率被有效地忽略了,即使这个描述完全没有信息。对以下描述的反应说明了这种现象。
Dick is a 30-year-old man. He is married with no children. A man of high ability and high motivation, he promises to be quite successful in his field. He is well liked by his colleagues.
迪克是一个30岁的男人。他已婚,没有孩子。他是一个能力强、积极性高的人,有望在他的领域取得相当的成功。他 ,很受同事的欢迎。
This description was intended to convey no information relevant to the question of whether Dick is an engineer or a lawyer. Consequently, the probability that Dick is an engineer should equal the proportion of engineers in the group, as if no description had been given. The subjects, however, judged the probability of Dick being an engineer to be .5 regardless of whether the stated proportion of engineers in the group was .7 or .3. Evidently, people respond differently when given no evidence and when given worthless evidence. When no specific evidence is given, prior probabilities are properly utilized; when worthless evidence is given, prior probabilities are ignored. 3
这一描述的目的是为了传达与迪克是工程师还是律师这一问题无关的信息。因此,迪克是工程师的概率应该等于该组中工程师的比例,就像没有给出描述一样。然而,受试者判断迪克是工程师的概率是0.5,而不考虑 ,无论所陈述的工程师在该组中的比例是0.7还是0.3。显而易见,人们在没有证据和没有价值的证据时的反应是不同的。当没有给出具体的证据时,先验概率被适当利用;当给出无价值的证据时,先验概率被忽略。 3
Insensitivity to sample size . To evaluate the probability of obtaining a particular result in a sample drawn from a specified population, people typically apply the representativeness heuristic. That is, they assess the likelihood of a sample result, for example, that the average height in a random sample of ten men will be 6 feet, by the similarity of this result to the corresponding parameter (that is, to the average height in the population of men). The similarity of a sample statistic to a population parameter does not depend on the size of the sample. Consequently, if probabilities are assessed by representativeness, then the judged probability of a sample statistic will be essentially independent of sample size. Indeed, when subjects assessed the distributions of average height for samples of various sizes, they produced identical distributions. For example, the probability of obtaining an average height greater than 6 feet was assigned the same value for samples of 1,000, 100, and 10 men. 4 Moreover, subjects failed to appreciate the role of sample size even when it was emphasized in the formulation of the problem. Consider the following question:
对样本大小不敏感 。为了评估从特定人群中抽取的样本获得特定 结果的概率,人们通常采用代表性启发式的方法。也就是说,他们评估一个样本结果的可能性,例如,随机抽取的10个男人的平均身高是6英尺,通过这个结果与相应的参数(即与男人群体的平均身高)的相似性来评估。样本 统计量与人口参数的相似性并不取决于样本的大小。因此,如果概率是通过代表性来评估的,那么一个样本统计量的判断概率将基本上与样本大小无关。事实上,当受试者评估各种规模的样本的平均身高分布时,他们产生了相同的分布。例如, ,对于1000个、100个和10个男人的样本,获得平均身高超过6英尺的概率被赋予相同的数值。 4 此外,即使在问题的表述中强调了样本大小的作用,受试者也没有意识到这一点。考虑一下下面的问题。
A certain town is served by two hospitals. In the larger hospital about 45 babies are born each day, and in the smaller hospital about 15 babies are born each day. As you know, about 50% of all babies are boys. However, the exact percentage varies from day to day. Sometimes it may be higher than 50%, sometimes lower.
某镇有两家医院。在大医院,每天大约有45个婴儿出生,而在小医院, ,每天大约有15个婴儿出生。如你所知,所有婴儿中约有50%是男孩。然而,确切的比例每天都在变化。有时它可能高于50%,有时则更低。
For a period of 1 year, each hospital recorded the days on which more than 60% of the babies born were boys. Which hospital do you think recorded more such days?
在一年的时间里,每家医院都记录了超过60%的新生婴儿是男孩的日子。你认为哪家医院记录了更多这样的日子?
The larger hospital (21)
大医院 (21)
The smaller hospital (21)
小型医院 (21)
About the same (that is, within 5% of each other) (53)
差不多(即彼此在5%以内)(53)
The values in parentheses are the number of undergraduate students who chose each answer.
括号内的数值是选择每个答案的本科生人数。
Most subjects judged the probability of obtaining more than 60% boys to be the same in the small and in the large hospital, presumably because these events are described by the same statistic and are therefore equally representative of the general population. In contrast, sampling theory entails that the expected number of days on which more than 60% of the babies are boys is much greater in the small hospital than in the large one, because a large sample is less likely to stray from 50%. This fundamental notion of statistics is evidently not part of people’s repertoire of intuitions.
大多数受试者判断在小医院和大医院获得超过60%的男孩的概率是一样的,大概是因为这些事件是由相同的统计学 ,因此同样是 ,代表了一般人群。相反,抽样理论要求小医院中超过60%的婴儿是男孩的预期天数要比大医院多得多,因为大的样本不太可能偏离50%。这一统计学的基本概念显然不属于人们的直觉范畴。
A similar insensitivity to sample size has been reported in judgments of posterior probability, that is, of the probability that a sample has been drawn from one population rather than from another. Consider the following example:
在后验概率的判断中也有类似的不敏感性 ,即一个样本从一个群体而不是从另一个群体抽取的概率。请看下面的例子。
Imagine an urn filled with balls, of which ⅔ are of one color and ⅓ of another. One individual has drawn 5 balls from the urn, and found that 4 were red and 1 was white. Another individual has drawn 20 balls and found that 12 were red and 8 were white. Which of the two individuals should feel more confident that the urn contains ⅔ red balls and ⅓ white balls, rather than the opposite? What odds should each individual give?
设想一个装满球的瓮,其中⅔是一种颜色,⅓是另一种颜色。一个人从瓮中抽出5个球,发现4个是红色,1个是白色。另一个人 ,他抽了20个球,发现12个是红色的,8个是白色的。这两个人中哪一个更有信心瓮里有⅔红球和⅓白球,而不是相反?每个人的赔率应该是多少?
In this problem, the correct posterior odds are 8 to 1 for the 4:1 sample and 16 to 1 for the 12:8 sample, assuming equal prior probabilities. However, most people feel that the first sample provides much stronger evidence for the hypothesis that the urn is predominantly red, because the proportion of red balls is larger in the first than in the second sample. Here again, intuitive judgments are dominated by the sample proportion and are essentially unaffected by the size of the sample, which plays a crucial role in the determination of the actual posterior odds. 5 In addition, intuitive estimates of posterior odds are far less extreme than the correct values. The underestimation of the impact of evidence has been observed repeatedly in problems of this type. 6 It has been labeled “conservatism.”
在这个问题上,假设先验概率相同,4:1样本的正确后验概率为8比1,12:8样本的正确后验概率为16比1。然而,大多数 ,认为第一个样本为骨灰盒以红色为主的假设提供了更有力的证据,因为第一个样本中红球的比例大于第二个样本。在这里,直觉判断又是由样本比例主导的,基本上不受样本大小的影响,而样本大小在决定实际的后验 赔率方面起着关键作用。 5 此外,对后验几率的直觉估计远没有正确的数值那么极端。在这种类型的问题中,已经反复观察到对证据影响的低估。 6 它被称为 "保守主义"。
Misconceptions of chance . People expect that a sequence of events generated by a random process will represent the essential characteristics of that process even when the sequence is short. In considering tosses of a coin for heads or tails, for example, people regard the sequence H-T-H-T-T-H to be more likely than the sequence H-H-H-T-T-T, which does not appear random, and also more likely than the sequence H-H-H-H-T-H, which does not represent the fairness of the coin. 7 Thus, people expect that the essential characteristics of the process will be represented, not only globally in the entire sequence, but also locally in each of its parts. A locally representative sequence, however, deviates systematically from chance expectation: it contains too many alternations and too few runs. Another consequence of the belief in local representativeness is the well-known gambler’s fallacy. After observing a long run of red on the roulette wheel, for example, most people erroneously believe that black is now due, presumably because the occurrence of black will result in a more representative sequence than the occurrence of an additional red. Chance is commonly viewed as a self-correcting process in which a deviation in one direction induces a deviation in the opposite direction to restore the equilibrium. In fact, deviations are not “corrected” as a chance process unfolds, they are merely diluted.
对机会的误解 。人们期望由随机过程产生的事件序列将代表该过程的基本特征 ,即使该序列很短。例如,在考虑抛硬币是正面还是反面时,人们认为序列H-T-H-T-T-H比序列H-H-T-T-T更有可能,后者看起来并不随机,也比序列H-H-H-T-H更有可能,后者并不代表硬币的公正性。 7 因此,人们期望过程的基本特征将被 ,不仅在整个序列中全局地被代表,而且在其每个部分的局部被代表。然而,一个局部代表的序列会系统地偏离机会预期:它包含太多的交替和太少的运行。相信局部代表性的另一个结果是著名的赌徒谬误。例如,在观察了轮盘上长时间的红色运行后, ,大多数人错误地认为现在应该是黑色,大概是因为黑色的出现将导致比另外一个红色的出现更具代表性的序列。机会通常被看作是一个自我纠正的过程,在这个过程中,一个方向的偏差会诱发一个相反方向的偏差来恢复平衡。事实上,随着机会过程的展开,偏差并没有被 "纠正 " ,它们只是被稀释了。
Misconceptions of chance are not limited to naive subjects. A study of the statistical intuitions of experienced research psychologists 8 revealed a lingering belief in what may be called the “law of small numbers,” according to which even small samples are highly representative of the populations from which they are drawn. The responses of these investigators reflected the expectation that a valid hypothesis about a population will be represented by a statistically significant result in a sample with little regard for its size. As a consequence, the researchers put too much faith in the results of small samples and grossly overestimated the replicability of such results. In the actual conduct of research, this bias leads to the selection of samples of inadequate size and to overinterpretation of findings.
对机会的误解并不限于天真的受试者。一项对有经验的研究心理学家的统计直觉的研究8显示,他们对所谓的 "小数法则 "的信念挥之不去,根据这一法则,即使是小样本也能高度代表它们所抽取的人群。 ,这些调查员的回答反映了这样一种期望,即关于一个群体的有效假设将由一个样本的统计学意义上的结果来代表,而很少考虑其大小。因此,研究者们过于相信小样本的结果,并严重高估了这种 结果的可复制性。在实际研究过程中,这种偏见导致 ,选择规模不足的样本,并对研究结果进行过度解释。
Insensitivity to predictability . People are sometimes called upon to make such numerical predictions as the future value of a stock, the demand for a commodity, or the outcome of a football game. Such predictions are often made by representativeness. For example, suppose one is given a description of a company and is asked to predict its future profit. If the description of the company is very favorable, a very high profit will appear most representative of that description; if the description is mediocre, a mediocre performance will appear most representative. The degree to which the description is favorable is unaffected by the reliability of that description or by the degree to which it permits accurate prediction. Hence, if people predict solely in terms of the favorableness of the description, their predictions will be insensitive to the reliability of the evidence and to the expected accuracy of the prediction.
对可预测性不敏感 。人们有时被要求进行诸如股票的未来价值、商品的需求或足球比赛的结果等数字预测。这种预测往往是通过代表性来进行的。例如,假设人们得到了一个公司的描述, ,被要求预测其未来的利润。如果对该公司的描述是非常有利的,那么很高的利润将显得最能代表该描述;如果该描述是平庸的,那么平庸的业绩将显得最有代表性。描述的有利程度不受该描述的可靠性影响,也不受其允许准确 预测的程度影响。因此,如果人们只根据描述的有利程度进行预测,他们的预测将对证据的可靠性和预测的预期准确性不敏感。
This mode of judgment violates the normative statistical theory in which the extremeness and the range of predictions are controlled by considerations of predictability. When predictability is nil, the same prediction should be made in all cases. For example, if the descriptions of companies provide no information relevant to profit, then the same value (such as average profit) should be predicted for all companies. If predictability is perfect, of course, the values predicted will match the actual values and the range of predictions will equal the range of outcomes. In general, the higher the predictability, the wider the range of predicted values.
这种判断方式违反了规范的统计理论,在这种理论中,预测的极端性和范围是由可预测性的考虑来控制的。当可预测性 是零时,在所有情况下都应该做出相同的预测。例如,如果对公司的描述没有提供与利润相关的信息,那么就应该对所有公司进行相同的数值(如平均利润)的预测。当然,如果可预测性是完美的,那么预测的价值将与实际价值相匹配,预测的范围将等于结果的范围。一般来说, ,可预测性越高,预测值的范围就越大。
Several studies of numerical prediction have demonstrated that intuitive predictions violate this rule, and that subjects show little or no regard for considerations of predictability. 9 In one of these studies, subjects were presented with several paragraphs, each describing the performance of a student teacher during a particular practice lesson. Some subjects were asked to evaluate the quality of the lesson described in the paragraph in percentile scores, relative to a specified population. Other subjects were asked to predict, also in percentile scores, the standing of each student teacher 5 years after the practice lesson. The judgments made under the two conditions were identical. That is, the prediction of a remote criterion (success of a teacher after 5 years) was identical to the evaluation of the information on which the prediction was based (the quality of the practice lesson). The students who made these predictions were undoubtedly aware of the limited predictability of teaching competence on the basis of a single trial lesson 5 years earlier; nevertheless, their predictions were as extreme as their evaluations.
一些关于数字预测的研究表明,直觉预测违反了这一规则,而且受试者很少或根本不考虑可预测性的因素。 9 在其中一项研究中,受试者看到了几个段落,每个段落都描述了一个学生教师在某个特定的 练习课中的表现。一些受试者被要求以百分位数的分数来评估这段话中所描述的课程的质量,相对于一个特定的人群。其他受试者则被要求同样以百分制的分数来预测每个学生教师在练习课5年后的地位。这两种情况下的判断是相同的。也就是说,对一个 远程标准(5年后教师的成功)的预测与对预测所依据的信息(实践课的质量)的评价是相同的。作出这些预测的学生无疑意识到,根据5年前的一次试教,教学能力的可预测性是有限的;然而,他们的预测与 他们的评价一样极端。
The illusion of validity . As we have seen, people often predict by selecting the outcome (for example, an occupation) that is most representative of the input (for example, the description of a person). The confidence they have in their prediction depends primarily on the degree of representativeness (that is, on the quality of the match between the selected outcome and the input) with little or no regard for the factors that limit predictive accuracy. Thus, people express great confidence in the prediction that a person is a librarian when given a description of his personality which matches the stereotype of librarians, even if the description is scanty, unreliable, or outdated. The unwarranted confidence which is produced by a good fit between the predicted outcome and the input information may be called the illusion of validity. This illusion persists even when the judge is aware of the factors that limit the accuracy of his predictions. It is a common observation that psychologists who conduct selection interviews often experience considerable confidence in their predictions, even when they know of the vast literature that shows selection interviews to be highly fallible. The continued reliance on the clinical interview for selection, despite repeated demonstrations of its inadequacy, amply attests to the strength of this effect.
有效性的幻觉 。正如我们所看到的,人们经常通过选择最能代表输入(例如,对一个人的描述)的结果(例如,一个职业)来预测。他们对自己预测的信心主要取决于代表性的程度(即所选结果与 输入之间的匹配质量),而很少或根本不考虑限制预测准确性的因素。因此,当一个人的个性描述与图书馆员的刻板印象相吻合时,人们对预测他是一个图书馆员表示出极大的信心,即使这个描述是稀少的、不可靠的或过时的。预测结果 和输入信息之间的良好契合所产生的毫无根据的信心,可以称为有效性的错觉。即使法官意识到限制其预测准确性的因素,这种错觉仍然存在。一个常见的现象是,进行选拔面试的心理学家经常对他们的预测有相当大的信心,即使他们知道大量的文献显示选拔面试 ,是非常容易出错的。尽管临床访谈的不足之处一再被证明,但人们仍然依赖它来进行选择,这充分证明了这种效应的强度。
The internal consistency of a pattern of inputs is a major determinant of one’s confidence in predictions based on these inputs. For example, people express more confidence in predicting the final grade point average of a student whose first-year record consists entirely of B’s than in predicting the grade point average of a student whose first-year record includes many A’s and C’s. Highly consistent patterns are most often observed when the input variables are highly redundant or correlated. Hence, people tend to have great confidence in predictions based on redundant input variables. However, an elementary result in the statistics of correlation asserts that, given input variables of stated validity, a prediction based on several such inputs can achieve higher accuracy when they are independent of each other than when they are redundant or correlated. Thus, redundancy among inputs decreases accuracy even as it increases confidence, and people are often confident in predictions that are quite likely to be off the mark. 10
输入模式的内部一致性是决定人们对基于这些输入的预测的信心的主要因素 。例如,人们对预测一个第一年记录完全由B组成的学生的最终平均分 ,比预测一个第一年记录包括许多A和C的学生的平均分更有信心。当输入变量高度冗余或相关时,最常观察到高度一致的模式。因此,人们往往对基于冗余输入变量的预测有很大信心。然而,相关统计学中的一个基本 结果断言,给定具有陈述有效性的输入变量,当它们相互独立时,基于几个这样的输入的预测可以达到比它们冗余或相关时更高的准确性。因此,输入中的冗余会降低准确率,即使它增加了信心,人们往往对预测充满信心,而这些预测很有可能 ,从而偏离目标。 10
Misconceptions of regression . Suppose a large group of children has been examined on two equivalent versions of an aptitude test. If one selects ten children from among those who did best on one of the two versions, he will usually find their performance on the second version to be somewhat disappointing. Conversely, if one selects ten children from among those who did worst on one version, they will be found, on the average, to do somewhat better on the other version. More generally, consider two variables X and Y which have the same distribution. If one selects individuals whose average X score deviates from the mean of X by k units, then the average of their Y scores will usually deviate from the mean of Y by less than k units. These observations illustrate a general phenomenon known as regression toward the mean, which was first documented by Galton more than 100 years ago.
回归的错误概念 。假设一大群儿童接受了两个同等版本的能力测试的检查。如果从这两个版本中的一个版本中选择10个表现最好的儿童,他通常会发现他们在第二个版本中的表现有些令人失望。相反,如果我们从那些在一个版本中表现最差的孩子中选出10个 ,就会发现他们在另一个版本中的表现平均来说会好一些。更一般地说,考虑两个具有相同分布的变量 X 和 Y 。如果我们选择那些 X 的平均分数与 X 的平均值相差 k 个单位的人,那么他们的 Y 的平均分数通常会与 Y 的平均值相差小于 k 个单位。这些观察结果说明了一个普遍的 现象,即向平均数回归,这是由高尔顿在100多年前首次记录的。
In the normal course of life, one encounters many instances of regression toward the mean, in the comparison of the height of fathers and sons, of the intelligence of husbands and wives, or of the performance of individuals on consecutive examinations. Nevertheless, people do not develop correct intuitions about this phenomenon. First, they do not expect regression in many contexts where it is bound to occur. Second, when they recognize the occurrence of regression, they often invent spurious causal explanations for it. 11 We suggest that the phenomenon of regression remains elusive because it is incompatible with the belief that the predicted outcome should be maximally representative of the input, and, hence, that the value of the outcome variable should be as extreme as the value of the input variable.
在正常的生活过程中,人们在比较父亲和儿子的身高、丈夫和妻子的智力或个人在连续考试中的表现时,会遇到许多向平均值回归的情况。然而,人们并没有对这种现象形成 正确的直觉。首先,在很多情况下,他们并不期待回归,而回归是一定会发生的。其次,当他们认识到回归的发生时,他们经常为它编造虚假的因果解释。 11 我们认为,回归现象仍然难以捉摸,因为它与预测的结果应该是输入的最大代表 ,因此,结果变量的值应该与输入变量的值一样极端的信念不相容。
The failure to recognize the import of regression can have pernicious consequences, as illustrated by the following observation. 12 In a discussion of flight training, experienced instructors noted that praise for an exceptionally smooth landing is typically followed by a poorer landing on the next try, while harsh criticism after a rough landing is usually followed by an improvement on the next try. The instructors concluded that verbal rewards are detrimental to learning, while verbal punishments are beneficial, contrary to accepted psychological doctrine. This conclusion is unwarranted because of the presence of regression toward the mean. As in other cases of repeated examination, an improvement will usually follow a poor performance and a deterioration will usually follow an outstanding performance, even if the instructor does not respond to the trainee’s achievement on the first attempt. Because the instructors had praised their trainees after good landings and admonished them after poor ones, they reached the erroneous and potentially harmful conclusion that punishment is more effective than reward.
未能认识到退步的重要性可能会产生有害的后果,下面的观察说明了这一点。 12 在一次关于飞行训练的讨论中,有经验的教员指出,对一次特别平稳的着陆进行表扬后,通常会在下一次尝试中出现较差的 ,而在一次粗暴的着陆后进行严厉的批评后,通常会在下一次尝试中出现改进。教官们的结论是,口头奖励不利于学习,而口头惩罚则是有益的,这与公认的心理学理论相反。这个结论是没有根据的,因为存在着向平均值的回归。正如其他重复 考试的情况一样,即使教官没有对受训者在第一次尝试时的成绩作出反应,通常在差的表现之后会有改善,在出色的表现之后会有恶化。因为教官在良好的着陆后对学员进行了表扬,在糟糕的着陆后对学员进行了训诫,他们得出了错误的、可能有害的结论 ,即惩罚比奖励更有效。
Thus, the failure to understand the effect of regression leads one to overestimate the effectiveness of punishment and to underestimate the effectiveness of reward. In social interaction, as well as in training, rewards are typically administered when performance is good, and punishments are typically administered when performance is poor. By regression alone, therefore, behavior is most likely to improve after punishment and most likely to deteriorate after reward. Consequently, the human condition is such that, by chance alone, one is most often rewarded for punishing others and most often punished for rewarding them. People are generally not aware of this contingency. In fact, the elusive role of regression in determining the apparent consequences of reward and punishment seems to have escaped the notice of students of this area.
因此,不了解退步的效果会导致人们高估惩罚的效果,低估奖励的效果。在社会互动中,以及在训练中,奖励通常是在表现好的时候进行的,而惩罚通常是在表现不好的时候进行的。因此,仅通过回归 ,行为最有可能在惩罚后得到改善,最有可能在奖励后变坏。 因此,人类的状况是这样的,仅从机会上看,一个人最经常因为惩罚别人而得到奖励,最经常因为奖励别人而受到惩罚。人们一般不会意识到这种偶然性。事实上,回归在决定奖励和惩罚的明显后果方面的难以捉摸的作用 ,似乎没有引起这一领域的学生的注意。
AVAILABILITY
可用性
There are situations in which people assess the frequency of a class or the probability of an event by the ease with which instances or occurrences can be brought to mind. For example, one may assess the risk of heart attack among middle-aged people by recalling such occurrences among one’s acquaintances. Similarly, one may evaluate the probability that a given business venture will fail by imagining various difficulties it could encounter. This judgmental heuristic is called availability. Availability is a useful clue for assessing frequency or probability, because instances of large classes are usually recalled better and faster than instances of less frequent classes. However, availability is affected by factors other than frequency and probability. Consequently, the reliance on availability leads to predictable biases, some of which are illustrated below.
在有些情况下,人们通过能够想起的实例或事件的难易程度来评估一类的频率或事件的概率。例如,人们可以通过回忆自己的熟人中发生的此类事件来评估中年人心脏病发作的风险。 同样地,人们可以通过想象一个特定的商业风险可能遇到的各种困难来评估其失败的可能性。这种判断性的启发式方法被称为可用性。可用性是评估频率或概率的有用线索,因为大类的实例通常比频率较低的类的实例记得更好、更快。然而,可用性 ,受频率和概率以外的因素影响。因此,对可用性的依赖导致了可预测的偏差,其中一些偏差在下文中得到了说明。
Biases due to the retrievability of instances . When the size of a class is judged by the availability of its instances, a class whose instances are easily retrieved will appear more numerous than a class of equal frequency whose instances are less retrievable. In an elementary demonstration of this effect, subjects heard a list of well-known personalities of both sexes and were subsequently asked to judge whether the list contained more names of men than of women. Different lists were presented to different groups of subjects. In some of the lists the men were relatively more famous than the women, and in others the women were relatively more famous than the men. In each of the lists, the subjects erroneously judged that the class (sex) that had the more famous personalities was the more numerous. 13
由于 实例的可检索性而产生的偏差 。当一个类别的大小由其实例的可用性来判断时,一个实例容易被检索到的类别会比一个频率相同但实例较难被检索到的类别 ,显得更多。在这一效应的初级演示中,受试者听到了一份男女知名人士的名单,随后被要求判断这份名单中男性的名字是否多于女性。不同的名单被呈现给不同的受试者群体。在一些名单中,男性相对比女性更有名,而在其他名单中,女性 ,相对比男性更有名。在每份名单中,受试者都错误地判断,拥有更多著名人物的阶层(性别)是更多的。 13
In addition to familiarity, there are other factors, such as salience, which affect the retrievability of instances. For example, the impact of seeing a house burning on the subjective probability of such accidents is probably greater than the impact of reading about a fire in the local paper. Furthermore, recent occurrences are likely to be relatively more available than earlier occurrences. It is a common experience that the subjective probability of traffic accidents rises temporarily when one sees a car overturned by the side of the road.
除了熟悉程度之外,还有其他因素,如突出性,也会影响实例的可检索性。例如,看到房子燃烧对此类事故的主观概率的影响 ,可能比在当地报纸上看到火灾的影响要大。此外,最近发生的事件可能比早期发生的事件相对更容易获得。一个常见的经验是,当人们看到一辆汽车在路边翻倒时,交通事故的主观概率会暂时上升。
Biases due to the effectiveness of a search set . Suppose one samples a word (of three letters or more) at random from an English text. Is it more likely that the word starts with r or that r is the third letter? People approach this problem by recalling words that begin with r ( road ) and words that have r in the third position ( car ) and assess the relative frequency by the ease with which words of the two types come to mind. Because it is much easier to search for words by their first letter than by their third letter, most people judge words that begin with a given consonant to be more numerous than words in which the same consonant appears in the third position. They do so even for consonants, such as r or k , that are more frequent in the third position than in the first. 14
由于搜索集的有效性而产生的偏差 。假设一个 ,从一个英文文本中随机抽取一个单词(三个字母或更多)。这个词是以 r 开头的可能性更大,还是 r 是第三个字母?人们通过回忆以 r 开头的单词 (道路 )和将 r 放在第三位的单词 (汽车 )来解决这个问题,并通过这两种类型的单词在脑海中出现的难易程度来评估相对频率。 因为通过第一个字母搜索单词比通过第三个字母搜索单词要容易得多,所以大多数人判断以某一辅音开头的单词比同一辅音出现在第三位置的单词更多。他们甚至对那些在第三位置比第一位置更频繁的辅音,如 r 或 k ,也是如此。 14
Different tasks elicit different search sets. For example, suppose you are asked to rate the frequency with which abstract words ( thought , love ) and concrete words ( door , water ) appear in written English. A natural way to answer this question is to search for contexts in which the word could appear. It seems easier to think of contexts in which an abstract concept is mentioned (love in love stories) than to think of contexts in which a concrete word (such as door ) is mentioned. If the frequency of words is judged by the availability of the contexts in which they appear, abstract words will be judged as relatively more numerous than concrete words. This bias has been observed in a recent study 15 which showed that the judged frequency of occurrence of abstract words was much higher than that of concrete words, equated in objective frequency. Abstract words were also judged to appear in a much greater variety of contexts than concrete words.
不同的任务会引起不同的搜索集。例如,假设 ,你被要求评定抽象词 (思想 、 爱 )和具体词 (门 、 水 )在书面英语中出现的频率。回答这个问题的一个自然方法是搜索该词可能出现的语境。想到一个抽象概念被提及的语境(爱情故事中的爱情)似乎比想到一个具体词(如 door )被提及的语境更容易。如果单词的频率是根据它们出现的语境的可用性来判断的 ,那么抽象的单词就会被判断为比具体的单词相对更多。在最近的一项研究中已经观察到了这种偏见15,该研究显示,抽象词的判断出现频率要比具体词的判断出现频率高得多,在客观频率上是相等的。抽象的 词也被判断为比具体词出现在更多的语境中。
Biases of imaginability . Sometimes one has to assess the frequency of a class whose instances are not stored in memory but can be generated according to a given rule. In such situations, one typically generates several instances and evaluates frequency or probability by the ease with which the relevant instances can be constructed. However, the ease of constructing instances does not always reflect their actual frequency, and this mode of evaluation is prone to biases. To illustrate, consider a group of 10 people who form committees of k members, 2 ≤ k ≤ 8. How many different committees of k members can be formed? The correct answer to this problem is given by the binomial coefficient (10/ k ) which reaches a maximum of 252 for k = 5 . Clearly, the number of committees of k members equals the number of committees of (10 − k ) members, because any committee of k members defines a unique group of (10 − k ) nonmembers.
可想象性的偏差 。有时,我们必须评估一个类别的频率,这个类别的实例没有储存在记忆中,但可以根据给定的规则生成。在这种情况下,人们通常会生成几个实例,通过构建相关 实例的难易程度来评估频率或概率。然而,构建实例的难易程度并不总是反映它们的实际频率,而且这种评估模式容易产生偏差。为了说明这一点,考虑一个由10个人组成的、由 k 个成员组成的委员会,2≤ k ≤8。可以组成多少个由 k 个成员组成的不同委员会?这个问题的正确答案是由二项式系数(10/k )给出的,在 k = 5 的情况下达到最大252。显然,由 k 个成员组成的委员会的数量等于由(10- k )个成员组成的委员会的数量,因为任何由 k 个成员组成的委员会都定义了一个由(10- k )个非成员组成的唯一的小组。
One way to answer this question without computation is to mentally construct committees of k members and to evaluate their number by the ease with which they come to mind. Committees of few members, say 2, are more available than committees of many members, say 8. The simplest scheme for the construction of committees is a partition of the group into disjoint sets. One readily sees that it is easy to construct five disjoint committees of 2 members, while it is impossible to generate even two disjoint committees of 8 members. Consequently, if frequency is assessed by imaginability, or by availability for construction, the small committees will appear more numerous than larger committees, in contrast to the correct bell-shaped function. Indeed, when naive subjects were asked to estimate the number of distinct committees of various sizes, their estimates were a decreasing monotonic function of committee size. 16 For example, the median estimate of the number of committees of 2 members was 70, while the estimate for committees of 8 members was 20 (the correct answer is 45 in both cases).
回答这个问题的一个方法是,无需计算,在头脑中构建由 K 个成员组成的委员会,并通过想到它们的难易程度来评估其数量。 少数成员的委员会,例如2个,比许多成员的委员会,例如8个,更容易想到。构建委员会的最简单方案是将小组划分为互不相连的集合。人们很容易看到,构建5个由2个成员组成的互不相干的委员会是很容易的,而要产生甚至两个由8个成员组成的互不相干的委员会是不可能的。因此,如果频率是通过可想象性或可构建性来评估 ,那么小的委员会将比大的委员会显得更多,这与正确的钟形函数相反。事实上,当天真的受试者被要求估计各种规模的不同委员会的数量时,他们的估计是委员会规模的单调性递减函数。 16 例如,2名成员的委员会的数量 ,估计的中位数是70,而8名成员的委员会的估计是20(在这两种情况下,正确的答案是45)。
Imaginability plays an important role in the evaluation of probabilities in real-life situations. The risk involved in an adventurous expedition, for example, is evaluated by imagining contingencies with which the expedition is not equipped to cope. If many such difficulties are vividly portrayed, the expedition can be made to appear exceedingly dangerous, although the ease with which disasters are imagined need not reflect their actual likelihood. Conversely, the risk involved in an undertaking may be grossly underestimated if some possible dangers are either difficult to conceive of, or simply do not come to mind.
可想象性在评估现实生活中的概率方面发挥着重要作用。例如,通过想象探险队没有能力应对的突发事件,来评估冒险探险中的风险。如果生动地描述了许多这样的困难 ,那么探险就会显得非常危险,尽管想象中的灾难的容易程度并不反映其实际的可能性。相反,如果一些可能的危险难以想象,或者根本没有想到,那么一项事业所涉及的风险可能被严重低估。
Illusory correlation . Chapman and Chapman 17 have described an interesting bias in the judgment of the frequency with which two events co-occur. They presented naive judges with information concerning several hypothetical mental patients. The data for each patient consisted of a clinical diagnosis and a drawing of a person made by the patient. Later the judges estimated the frequency with which each diagnosis (such as paranoia or suspiciousness) had been accompanied by various features of the drawing (such as peculiar eyes). The subjects markedly overestimated the frequency of co-occurrence of natural associates, such as suspiciousness and peculiar eyes. This effect was labeled illusory correlation. In their erroneous judgments of the data to which they had been exposed, naive subjects “rediscovered” much of the common, but unfounded, clinical lore concerning the interpretation of the draw-a-person test. The illusory correlation effect was extremely resistant to contradictory data. It persisted even when the correlation between symptom and diagnosis was actually negative, and it prevented the judges from detecting relationships that were in fact present.
虚幻的关联性 。查普曼和查普曼17曾 ,描述了对两个事件共同发生的频率的判断中一个有趣的偏见。他们向天真的法官提供了有关几个假想的精神病人的信息。每个病人的数据包括临床诊断和病人画的人像。后来,法官估计了每种诊断(如偏执狂或多疑) ,并伴随着绘画的各种特征(如奇特的眼睛)的频率。受试者明显地高估了自然关联的出现频率,如多疑和怪异的眼睛。这种效应被称为虚幻的关联。在他们对所接触到的数据的错误判断中,天真的受试者 "重新发现 "了许多常见的、但毫无根据的、关于抽签测试解释的临床 。虚幻的相关效应对矛盾的数据有极大的抵抗力。即使在症状和诊断之间的相关性实际上是负面的情况下,它仍然存在 ,而且它使法官无法检测到事实上存在的关系。
Availability provides a natural account for the illusory-correlation effect. The judgment of how frequently two events co-occur could be based on the strength of the associative bond between them. When the association is strong, one is likely to conclude that the events have been frequently paired. Consequently, strong associates will be judged to have occurred together frequently. According to this view, the illusory correlation between suspiciousness and peculiar drawing of the eyes, for example, is due to the fact that suspiciousness is more readily associated with the eyes than with any other part of the body.
可用性为虚幻-相关效应提供了一个自然的解释。 ,对两个事件共同出现的频率的判断可以基于它们之间的关联性的强度。当关联性很强时,人们很可能会得出结论,这些事件经常被配对。因此,强关联者将被判断为经常一起发生。 根据这一观点,怀疑和眼睛的特殊画法之间的虚幻关联,例如,是由于怀疑比身体的任何其他部分更容易与眼睛联系起来。
Lifelong experience has taught us that, in general, instances of large classes are recalled better and faster than instances of less frequent classes; that likely occurrences are easier to imagine than unlikely ones; and that the associative connections between events are strengthened when the events frequently co-occur. As a result, man has at his disposal a procedure (the availability heuristic) for estimating the numerosity of a class, the likelihood of an event, or the frequency of co-occurrences, by the ease with which the relevant mental operations of retrieval, construction, or association can be performed. However, as the preceding examples have demonstrated, this valuable estimation procedure results in systematic errors.
终身的经验告诉我们,一般来说,大类的实例比频率较低的类的实例记得更好、更快;可能发生的事件比不可能发生的事件更容易想象;当事件经常共同发生时,事件之间的关联联系 。因此,人类有一个程序(可用性启发式),可以通过检索、构建或联想等相关心理操作的难易程度来估计一个类别的数量、一个事件的可能性或共同出现的频率。然而,正如前面的例子 ,这种有价值的估计程序会导致系统性错误。
ADJUSTMENT AND ANCHORING
调整和锚定
In many situations, people make estimates by starting from an initial value that is adjusted to yield the final answer. The initial value, or starting point, may be suggested by the formulation of the problem, or it may be the result of a partial computation. In either case, adjustments are typically insufficient. 18 That is, different starting points yield different estimates, which are biased toward the initial values. We call this phenomenon anchoring.
在许多情况下,人们通过从一个初始值开始进行估计,然后再进行调整以得出最终答案。初始值,或起点,可能是由问题的表述提出的,也可能是部分计算的结果。在这两种情况下,调整 ,通常是不够的。 18 也就是说,不同的起点会产生不同的估计值,而这些估计值是偏向于初始值的。我们把这种现象称为锚定。
Insufficient adjustment . In a demonstration of the anchoring effect, subjects were asked to estimate various quantities, stated in percentages (for example, the percentage of African countries in the United Nations). For each quantity, a number between 0 and 100 was determined by spinning a wheel of fortune in the subjects’ presence. The subjects were instructed to indicate first whether that number was higher or lower than the value of the quantity, and then to estimate the value of the quantity by moving upward or downward from the given number. Different groups were given different numbers for each quantity, and these arbitrary numbers had a marked effect on estimates. For example, the median estimates of the percentage of African countries in the United Nations were 25 and 45 for groups that received 10 and 65, respectively, as starting points. Payoffs for accuracy did not reduce the anchoring effect.
不充分的调整 。在锚定效应的演示中,被试被要求估计各种数量,以百分比表示(例如,非洲国家在联合国的百分比)。对于每个 ,通过在受试者面前转动幸运轮来确定一个0到100之间的数字。受试者被要求首先指出该数字是高于还是低于该数量的价值,然后通过从给定的数字向上或向下移动来估计该数量的价值。不同的组别为每个数量提供了不同的数字,这些 任意的数字对估计有明显的影响。例如,对联合国中非洲国家百分比的估计中值是25,对得到10和65作为起点的小组来说是45。对准确性的回报并没有减少锚定效应。
Anchoring occurs not only when the starting point is given to the subject, but also when the subject bases his estimate on the result of some incomplete computation. A study of intuitive numerical estimation illustrates this effect. Two groups of high school students estimated, within 5 seconds, a numerical expression that was written on the blackboard. One group estimated the product
锚定不仅发生在给了被试者起点的时候,也发生在被试者把 他的估计建立在一些不完整的计算结果的基础上。一项关于直觉数字估计的研究说明了这种效应。两组高中生在5秒钟内估计了写在黑板上的一个数字表达式。一组学生估计的是乘积
8 × 7 × 6 × 5 × 4 × 3 × 2 × 1
8 × 7 × 6 × 5 × 4 × 3 × 2 × 1
while another group estimated the product
而另一组人则估计产品
1 × 2 × 3 × 4 × 5 × 6 × 7 × 8
1 × 2 × 3 × 4 × 5 × 6 × 7 × 8
To rapidly answer such questions, people may perform a few steps of computation and estimate the product by extrapolation or adjustment. Because adjustments are typically insufficient, this procedure should lead to underestimation. Furthermore, because the result of the first few steps of multiplication (performed from left to right) is higher in the descending sequence than in the ascending sequence, the former expression should be judged larger than the latter. Both predictions were confirmed. The median estimate for the ascending sequence was 512, while the median estimate for the descending sequence was 2,250. The correct answer is 40,320.
为了 迅速 回答这类问题,人们可能会进行几步计算,并通过外推或调整来估计产品。由于调整通常是不充分的,这种程序应该会导致低估。此外,由于乘法的前几步(从左到右执行)的结果在降序中比在升序中高,所以 ,前者的表达应该被判断为比后者大。这两个预测都得到了证实。升序的估计中值为512,而降序的估计中值为2250。正确答案是40,320。
Biases in the evaluation of conjunctive and disjunctive events . In a recent study by Bar-Hillel 19 subjects were given the opportunity to bet on one of two events. Three types of events were used: (i) simple events, such as drawing a red marble from a bag containing 50% red marbles and 50% white marbles; (ii) conjunctive events, such as drawing a red marble seven times in succession, with replacement, from a bag containing 90% red marbles and 10% white marbles; and (iii) disjunctive events, such as drawing a red marble at least once in seven successive tries, with replacement, from a bag containing 10% red marbles and 90% white marbles. In this problem, a significant majority of subjects preferred to bet on the conjunctive event (the probability of which is .48) rather than on the simple event (the probability of which is .50). Subjects also preferred to bet on the simple event rather than on the disjunctive event, which has a probability of .52. Thus, most subjects bet on the less likely event in both comparisons. This pattern of choices illustrates a general finding. Studies of choice among gambles and of judgments of probability indicate that people tend to overestimate the probability of conjunctive events 20 and to underestimate the probability of disjunctive events. These biases are readily explained as effects of anchoring. The stated probability of the elementary event (success at any one stage) provides a natural starting point for the estimation of the probabilities of both conjunctive and disjunctive events. Since adjustment from the starting point is typically insufficient, the final estimates remain too close to the probabilities of the elementary events in both cases. Note that the overall probability of a conjunctive event is lower than the probability of each elementary event, whereas the overall probability of a disjunctive event is higher than the probability of each elementary event. As a consequence of anchoring, the overall probability will be overestimated in conjunctive problems and underestimated in disjunctive problems.
对连带事件和非连带事件的评价存在偏差 。在Bar-Hillel最近的一项研究中,19受试者有机会在两个 事件中的一个进行投注。使用了三种类型的事件。 (i)简单事件,例如从一个装有50%红色弹珠和50%白色弹珠的袋子中抽出一个红色弹珠;(ii)共轭事件,例如从一个装有90%红色弹珠和10%白色弹珠的袋子中连续七次抽出一个红色弹珠,并进行替换;以及(iii)不共轭事件,例如从一个装有10%红色弹珠和90%白色弹珠的袋子中连续七次至少抽出一个红色弹珠,并进行替换。在这个问题上,相当多的受试者倾向于投注于共轭事件(其概率为0.48)而不是简单事件(其概率为0.50)。受试者还倾向于在简单事件上下注,而不是在不连贯事件上下注,后者的概率 ,为0.52。因此,在两次比较中,大多数受试者都押在可能性较小的事件上。这种选择模式说明了一个普遍的发现。对赌博的选择和概率判断的研究表明,人们倾向于高估共轭事件的概率20,低估不共轭事件的概率。这些偏见很容易被解释为锚定的影响。 基本事件(在任何一个阶段的成功)的陈述概率为估计连带事件和非连带事件的概率提供了一个自然的起点。由于从起点开始的调整通常是不充分的,所以最终的估计仍然过于接近两种情况下的基本事件的概率。请注意, 共轭事件的总体概率低于每个基本事件的概率,而非共轭事件的总体概率则高于每个基本事件的概率。作为锚定的结果,总体概率在共轭问题中会被高估,在不共轭问题中会被低估。
Biases in the evaluation of compound events are particularly significant in the context of planning. The successful completion of an undertaking, such as the development of a new product, typically has a conjunctive character: for the undertaking to succeed, each of a series of events must occur. Even when each of these events is very likely, the overall probability of success can be quite low if the number of events is large. The general tendency to overestimate the probability of conjunctive events leads to unwarranted optimism in the evaluation of the likelihood that a plan will succeed or that a project will be completed on time. Conversely, disjunctive structures are typically encountered in the evaluation of risks. A complex system, such as a nuclear reactor or a human body, will malfunction if any of its essential components fails. Even when the likelihood of failure in each component is slight, the probability of an overall failure can be high if many components are involved. Because of anchoring, people will tend to underestimate the probabilities of failure in complex systems. Thus, the direction of the anchoring bias can sometimes be inferred from the structure of the event. The chain-like structure of conjunctions leads to overestimation, the funnel-like structure of disjunctions leads to underestimation.
在规划的背景下,对复合事件的评价偏差尤其重要 。一项事业的成功完成,如新产品的开发,通常具有连带性的特点:为了使事业成功,一系列事件中的每一个都必须发生。即使这些事件中的每一个都很有可能发生,但如果事件的数量很多,总体的成功概率也会相当低。一般来说,高估 共轭事件的概率的趋势,导致在评估一个计划成功的可能性或一个项目将按时完成的可能性时出现不必要的乐观。相反,在评估风险时,通常会遇到不连贯的结构。一个复杂的系统,如核反应堆或人体,如果它的任何一个基本部件发生故障,就会出现故障。即使 ,每个部件发生故障的可能性很小,但如果涉及许多部件,整体故障的概率就会很高。由于锚定,人们会倾向于低估复杂系统的故障概率。因此,锚定偏差的方向有时可以从事件的结构中推断出来。链状结构的连接点会导致高估, ,漏斗状结构的断开连接点会导致低估。
Anchoring in the assessment of subjective probability distributions . In decision analysis, experts are often required to express their beliefs about a quantity, such as the value of the Dow Jones average on a particular day, in the form of a probability distribution. Such a distribution is usually constructed by asking the person to select values of the quantity that correspond to specified percentiles of his subjective probability distribution. For example, the judge may be asked to select a number, X 90 , such that his subjective probability that this number will be higher than the value of the Dow Jones average is .90. That is, he should select the value X 90 so that he is just willing to accept 9 to 1 odds that the Dow Jones average will not exceed it. A subjective probability distribution for the value of the Dow Jones average can be constructed from several such judgments corresponding to different percentiles.
主观概率分布评估中的锚定 。在决策分析中,经常要求专家 ,以概率分布的形式表达他们对某一数量的信念,如道琼斯平均指数在某一天的价值。这种分布通常是通过要求人 ,选择与他的主观概率分布的特定百分比相对应的数量值来构建的。例如,法官可能被要求选择一个数字, X90 ,使他的主观概率,这个数字将高于道琼斯平均指数的值为0.90。也就是说,他应该选择 X90 这个数值,这样他才愿意接受9比1的概率,即道 琼斯平均数不会超过它。道琼斯平均指数值的主观概率分布可以由对应于不同百分位数的几个这样的判断来构建。
By collecting subjective probability distributions for many different quantities, it is possible to test the judge for proper calibration. A judge is properly (or externally) calibrated in a set of problems if exactly Π% of the true values of the assessed quantities falls below his stated values of X Π . For example, the true values should fall below X 01 for 1% of the quantities and above X 99 for 1% of the quantities. Thus, the true values should fall in the confidence interval between X 01 and X 99 on 98% of the problems.
通过收集许多不同数量的主观概率分布,有可能测试法官是否正确校准。在一组 的问题中,如果被评估数量的真实值正好有Π%落在他所说的 XΠ 值以下,那么法官就是正确的(或外部)校准。例如,1%的数量的真实值应低于 X01 ,1%的数量应高于 X99 。因此,98%的问题的真实值应该落在 X01 和 X99 之间的置信区间。
Several investigators 21 have obtained probability distributions for many quantities from a large number of judges. These distributions indicated large and systematic departures from proper calibration. In most studies, the actual values of the assessed quantities are either smaller than X 0l or greater than X 99 for about 30% of the problems. That is, the subjects state overly narrow confidence intervals which reflect more certainty than is justified by their knowledge about the assessed quantities. This bias is common to naive and to sophisticated subjects, and it is not eliminated by introducing proper scoring rules, which provide incentives for external calibration. This effect is attributable, in part at least, to anchoring.
一些研究人员21从大量的法官那里获得了 许多数量的概率分布。这些分布表明与适当的校准有很大的系统性偏差。在大多数研究中,在大约30%的问题中,评估量的实际值要么小于 X0l ,要么大于 X99 。也就是说,受试者陈述的置信区间过窄,反映出比他们对评估量的知识更有把握的 。这种偏见对天真和成熟的受试者来说都是常见的,它不会因为引入适当的评分规则而被消除,因为评分规则提供了外部校准的激励。这种影响至少可以部分归因于锚定。
To select X 90 for the value of the Dow Jones average, for example, it is natural to begin by thinking about one’s best estimate of the Dow Jones and to adjust this value upward. If this adjustment—like most others—is insufficient, then X 90 will not be sufficiently extreme. A similar anchoring effect will occur in the selection of X 10 , which is presumably obtained by adjusting one’s best estimate downward. Consequently, the confidence interval between X 10 and X 90 will be too narrow, and the assessed probability distribution will be too tight. In support of this interpretation it can be shown that subjective probabilities are systematically altered by a procedure in which one’s best estimate does not serve as an anchor.
例如,要为道琼斯平均数的价值选择 X90 ,很自然地开始考虑自己对 道琼斯的最佳估计,并向上调整这一价值。如果这种调整和其他大多数调整一样,是不充分的,那么 X90 就不会足够极端。在选择 X10 时也会出现类似的锚定效应, X10 大概是通过向下调整一个人的最佳估计值得到的。因此, X10 和 X90 之间的置信区间会太窄,而评估的概率分布 ,会太紧。为了支持这种解释,可以证明主观概率被一个程序系统地改变了,在这个程序中,一个人的最佳估计并不作为一个锚。
Subjective probability distributions for a given quantity (the Dow Jones average) can be obtained in two different ways: (i) by asking the subject to select values of the Dow Jones that correspond to specified percentiles of his probability distribution and (ii) by asking the subject to assess the probabilities that the true value of the Dow Jones will exceed some specified values. The two procedures are formally equivalent and should yield identical distributions. However, they suggest different modes of adjustment from different anchors. In procedure (i), the natural starting point is one’s best estimate of the quantity. In procedure (ii), on the other hand, the subject may be anchored on the value stated in the question. Alternatively, he may be anchored on even odds, or a 50–50 chance, which is a natural starting point in the estimation of likelihood. In either case, procedure (ii) should yield less extreme odds than procedure (i).
可以通过两种不同的方式获得一个给定数量(道琼斯平均数)的主观概率分布:(i)要求受试者选择与他的概率分布的指定百分比相对应的道琼斯值 ;(ii)要求受试者评估道琼斯的真实值将超过一些指定值的概率。这两个程序在形式上是等同的,应该产生相同的分布。然而,它们暗示了来自不同锚点的不同调整模式。在程序(一)中,自然的起点是一个人对数量的最佳估计 。另一方面,在程序(ii)中,受试者可能锚定在问题中所述的数值上。或者,他可能被固定在偶数,或50-50的机会上,这是估计可能性的一个自然起点。在任何一种情况下,程序(二)应该比程序(一)产生更少的极端几率。
To contrast the two procedures, a set of 24 quantities (such as the air distance from New Delhi to Peking) was presented to a group of subjects who assessed either X 10 or X 90 for each problem. Another group of subjects received the median judgment of the first group for each of the 24 quantities. They were asked to assess the odds that each of the given values exceeded the true value of the relevant quantity. In the absence of any bias, the second group should retrieve the odds specified to the first group, that is, 9:1. However, if even odds or the stated value serve as anchors, the odds of the second group should be less extreme, that is, closer to 1:1. Indeed, the median odds stated by this group, across all problems, were 3:1. When the judgments of the two groups were tested for external calibration, it was found that subjects in the first group were too extreme, in accord with earlier studies. The events that they defined as having a probability of .10 actually obtained in 24% of the cases. In contrast, subjects in the second group were too conservative. Events to which they assigned an average probability of .34 actually obtained in 26% of the cases. These results illustrate the manner in which the degree of calibration depends on the procedure of elicitation.
为了对比这两种程序,一组被试将24个数量 (如从新德里到北京的航空距离),他们对每个问题的评估是 X10 或 X90 。另一组受试者收到第一组对24个数量中每个数量的判断中值。他们被要求评估每个给定值超过相关数量的 真值的几率。在没有任何偏见的情况下,第二组 ,应该检索到对第一组规定的几率,即9:1。然而,如果偶数赔率或所述值作为锚,第二组的赔率应该不那么极端,即更接近于1:1。事实上,在所有的问题中,这一组所陈述的赔率中值是3:1。当对两组的判断进行外部校准测试时,发现第一组 的受试者过于极端,与早期的研究一致。他们定义为概率为0.10的事件实际上在24%的情况下发生了。相反,第二组的受试者则过于保守。他们认为平均概率为0.34的事件实际上在26%的情况下发生了。这些结果说明了校准的程度取决于 诱发程序的方式。
DISCUSSION
讨论
This article has been concerned with cognitive biases that stem from the reliance on judgmental heuristics. These biases are not attributable to motivational effects such as wishful thinking or the distortion of judgments by payoffs and penalties. Indeed, several of the severe errors of judgment reported earlier occurred despite the fact that subjects were encouraged to be accurate and were rewarded for the correct answers. 22
本文一直关注的是源于对判断启发式的依赖的认知偏差。这些偏差并不归因于动机效应,如一厢情愿的想法或报酬和惩罚对判断的扭曲。事实上,尽管受试者 ,鼓励他们做到准确无误,并对正确的答案给予奖励,但前面报告的几个严重的判断错误还是发生了。 22
The reliance on heuristics and the prevalence of biases are not restricted to laymen. Experienced researchers are also prone to the same biases—when they think intuitively. For example, the tendency to predict the outcome that best represents the data, with insufficient regard for prior probability, has been observed in the intuitive judgments of individuals who have had extensive training in statistics. 23 Although the statistically sophisticated avoid elementary errors, such as the gambler’s fallacy, their intuitive judgments are liable to similar fallacies in more intricate and less transparent problems.
对启发式方法的依赖和偏见的普遍存在并不限于门外汉。有经验的研究人员也容易出现同样的偏见--当他们凭直觉思考的时候。例如,在受过大量统计学训练的人的直觉判断中, ,他们倾向于预测最能代表数据的结果,而对先验概率考虑不足。 23 尽管统计学上的专家避免了一些基本的错误,如赌徒的谬误,但他们的直觉判断在更复杂和更不透明的问题上容易出现类似的谬误。
It is not surprising that useful heuristics such as representativeness and availability are retained, even though they occasionally lead to errors in prediction or estimation. What is perhaps surprising is the failure of people to infer from lifelong experience such fundamental statistical rules as regression toward the mean, or the effect of sample size on sampling variability. Although everyone is exposed, in the normal course of life, to numerous examples from which these rules could have been induced, very few people discover the principles of sampling and regression on their own. Statistical principles are not learned from everyday experience because the relevant instances are not coded appropriately. For example, people do not discover that successive lines in a text differ more in average word length than do successive pages, because they simply do not attend to the average word length of individual lines or pages. Thus, people do not learn the relation between sample size and sampling variability, although the data for such learning are abundant.
诸如代表性和可用性等有用的启发式方法被保留下来并不奇怪, ,即使它们偶尔会导致预测或估计的错误。也许令人惊讶的是,人们无法从一生的经验中推断出基本的统计规则,如向平均值回归,或样本大小对抽样变异性的影响。尽管每个人在正常的生活过程中都会接触到许多例子,这些规则可以从 ,但很少有人会自己发现抽样和回归的原则。统计学原理不是从日常经验中学习的,因为相关的实例没有被适当地编码。例如,人们不会发现文本中连续的行与连续的页在平均字长上有更大的差异,因为他们根本没有注意到个别行或页的平均字长 。因此,人们并没有学习到样本大小和抽样可变性之间的关系,尽管这种学习的数据很丰富。
The lack of an appropriate code also explains why people usually do not detect the biases in their judgments of probability. A person could conceivably learn whether his judgments are externally calibrated by keeping a tally of the proportion of events that actually occur among those to which he assigns the same probability. However, it is not natural to group events by their judged probability. In the absence of such grouping it is impossible for an individual to discover, for example, that only 50% of the predictions to which he has assigned a probability of .9 or higher actually came true.
缺乏适当的准则也解释了为什么人们通常不能发现他们对概率判断的偏差。可以想象,一个人可以通过对他赋予相同概率的事件中实际发生的 比例进行统计,来了解他的判断是否是外部校准的。然而,按判断的概率对事件进行分组是不自然的。在没有这种分组的情况下,一个人不可能发现,例如,在他赋予0.9或更高概率的预测中,只有50%的预测实际发生了。
The empirical analysis of cognitive biases has implications for the theoretical and applied role of judged probabilities. Modern decision theory 24 regards subjective probability as the quantified opinion of an idealized person. Specifically, the subjective probability of a given event is defined by the set of bets about this event that such a person is willing to accept. An internally consistent, or coherent, subjective probability measure can be derived for an individual if his choices among bets satisfy certain principles, that is, the axioms of the theory. The derived probability is subjective in the sense that different individuals are allowed to have different probabilities for the same event. The major contribution of this approach is that it provides a rigorous subjective interpretation of probability that is applicable to unique events and is embedded in a general theory of rational decision.
对认知 偏见的经验分析对判断概率的理论和应用作用有影响。现代决策理论24认为主观概率是一个理想化的人的 量化的意见。具体来说,一个特定事件的主观概率是由这个人愿意接受的关于这个事件的一组赌注来定义的。如果一个人在赌注中的选择满足某些原则,即理论的公理,就可以为他推导出一个内部一致的,或连贯的主观概率 措施。所得出的概率是主观的,即允许不同的个人对同一事件有不同的概率。这种方法的主要贡献在于它提供了对概率的严格的主观解释,适用于 ,并嵌入到理性决策的一般理论中。
It should perhaps be noted that, while subjective probabilities can sometimes be inferred from preferences among bets, they are normally not formed in this fashion. A person bets on team A rather than on team B because he believes that team A is more likely to win; he does not infer this belief from his betting preferences. Thus, in reality, subjective probabilities determine preferences among bets and are not derived from them, as in the axiomatic theory of rational decision. 25
也许应该指出的是,虽然主观概率有时可以从投注的偏好中推断出来,但它们通常不是以这种方式形成的。一个人在A队而不是B队上下注,是因为他相信A队更有可能获胜;他并没有从他的投注偏好中推断出这种信念。 因此,在现实中,主观概率决定了投注的偏好,而不是像理性决策的公理理论那样,从投注中得出。 25
The inherently subjective nature of probability has led many students to the belief that coherence, or internal consistency, is the only valid criterion by which judged probabilities should be evaluated. From the standpoint of the formal theory of subjective probability, any set of internally consistent probability judgments is as good as any other. This criterion is not entirely satisfactory, because an internally consistent set of subjective probabilities can be incompatible with other beliefs held by the individual. Consider a person whose subjective probabilities for all possible outcomes of a coin-tossing game reflect the gambler’s fallacy. That is, his estimate of the probability of tails on a particular toss increases with the number of consecutive heads that preceded that toss. The judgments of such a person could be internally consistent and therefore acceptable as adequate subjective probabilities according to the criterion of the formal theory. These probabilities, however, are incompatible with the generally held belief that a coin has no memory and is therefore incapable of generating sequential dependencies. For judged probabilities to be considered adequate, or rational, internal consistency is not enough. The judgments must be compatible with the entire web of beliefs held by the individual. Unfortunately, there can be no simple formal procedure for assessing the compatibility of a set of probability judgments with the judge’s total system of beliefs. The rational judge will nevertheless strive for compatibility, even though internal consistency is more easily achieved and assessed. In particular, he will attempt to make his probability judgments compatible with his knowledge about the subject matter, the laws of probability, and his own judgmental heuristics and biases.
概率的内在主观性使许多学生相信,一致性或内部一致性是评价判断概率的唯一有效标准。从主观概率的正式 理论的角度来看,任何一组内部一致的概率判断都和其他任何概率判断一样好。这个标准并不完全令人满意,因为一套内部一致的主观概率可能与个人持有的其他信念不一致。考虑一个人,他对掷硬币游戏的所有可能结果的主观概率反映了赌徒的 谬误。也就是说,他对某次抛掷中出现尾巴的概率的估计会随着该次抛掷前连续出现的头像的数量而增加。这样一个人的判断可以是内部一致的,因此根据形式理论的标准,可以接受为充分的主观概率。然而,这些概率与普遍持有的 ,即硬币没有记忆,因此不能产生连续的依赖关系的信念不相容。要使判断的概率被认为是充分的,或者说是合理的,仅有内部一致性是不够的。判断必须与个人持有的整个信念网络相一致。不幸的是,不可能有简单的正式程序来评估一组概率 判断与法官的全部信仰系统的兼容性。不过,理性的法官还是会努力争取兼容性,尽管内部一致性更容易实现和评估。特别是,他将试图使他的概率判断与他关于主题事项的知识、概率规律以及他自己的判断启发式和偏见相兼容。
SUMMARY
摘要
This article described three heuristics that are employed in making judgments under uncertainty: (i) representativeness, which is usually employed when people are asked to judge the probability that an object or event A belongs to class or process B; (ii) availability of instances or scenarios, which is often employed when people are asked to assess the frequency of a class or the plausibility of a particular development; and (iii) adjustment from an anchor, which is usually employed in numerical prediction when a relevant value is available. These heuristics are highly economical and usually effective, but they lead to systematic and predictable errors. A better understanding of these heuristics and of the biases to which they lead could improve judgments and decisions in situations of uncertainty.
这篇文章 ,描述了在不确定情况下做出判断时采用的三种启发式方法:(i)代表性,当人们被要求判断一个物体或事件A属于类别或过程B的概率时,通常采用这种方法;(ii)实例或情景的可用性,当人们被要求评估一个类别的频率或一个特定 发展的合理性时,通常采用这种方法;(iii)从锚点调整,当有相关数值时,通常在数字预测中采用这种方法。这些启发式方法非常经济,通常也很有效,但它们会导致系统性和可预测的错误。更好地了解这些启发式方法和它们导致的偏见,可以改善在不确定情况下的判断和决策。
NOTES
注意事项
1 . D. Kahneman and A. Tversky, “On the Psychology of Prediction,” Psychological Review 80 (1973): 237–51.
1 .D. Kahneman和A. Tversky, "On the Psychology of Prediction," Psychological Review 80 (1973):237-51.
2 . Ibid.
2 .同上。
3 . Ibid.
3 .同上。
4 . D. Kahneman and A. Tversky, “Subjective Probability: A Judgment of Representativeness,” Cognitive Psychology 3 (1972): 430–54.
4 .D. Kahneman和A. Tversky,"主观概率。A Judgment of Representativeness," Cognitive Psychology 3 (1972):430-54.
5 . Ibid.
5 .同上。
6 . W. Edwards, “Conservatism in Human Information Processing,” in Formal Representation of Human Judgment , ed. B. Kleinmuntz (New York: Wiley, 1968), 17–52.
W. Edwards,"人类信息处理中的保守主义",载于《 人类判断的正式表述 》,B. Kleinmuntz (纽约:Wiley,1968),17-52。
7 . Kahneman and Tversky, “Subjective Probability.”
7 .卡尼曼和特维斯基,"主观概率"。
8 . A. Tversky and D. Kahneman, “Belief in the Law of Small Numbers,” Psychological Bulletin 76 (1971): 105–10.
8 .A. Tversky和D. Kahneman,"相信小数法则",《 心理学通报》 76(1971)。105-10.
9 . Kahneman and Tversky, “On the Psychology of Prediction.”
9 .卡尼曼和特维斯基,"论预测的心理学"。
10 . Ibid.
10 .同上。
11 . Ibid.
11 .同上。
12 . Ibid.
12 .同上。
13 . A. Tversky and D. Kahneman, “Availability: A Heuristic for Judging Frequency and Probability,” Cognitive Psychology 5 (1973): 207–32.
13 .A. Tversky和D. Kahneman,"可用性。A Heuristic for Judging Frequency and Probability," Cognitive Psychology 5 (1973):207-32.
14 . Ibid.
14 .同上。
15 . R. C. Galbraith and B. J. Underwood, “Perceived Frequency of Concrete and Abstract Words,” Memory & Cognition 1 (1973): 56–60.
15 .R. C. Galbraith 和 B. J. Underwood, "Perceived Frequency of Concrete and Abstract Words, " Memory & Cognition 1 (1973):56-60.
16 . Tversky and Kahneman, “Availability.”
16 .Tversky和Kahneman,"可用性"。
17 . L. J. Chapman and J. P. Chapman, “Genesis of Popular but Erroneous Psychodiagnostic Observations,” Journal of Abnormal Psychology 73 (1967): 193–204; L. J. Chapman and J. P. Chapman, “Illusory Correlation as an Obstacle to the Use of Valid Psychodiagnostic Signs,” Journal of Abnormal Psychology 74 (1969): 271–80.
17 .L.J.查普曼和J.P.查普曼,《流行但错误的心理诊断观察的成因》,《 异常心理学杂志 》73(1967)。193-204;L.J.查普曼和J.P.查普曼, "虚幻的相关性是使用有效心理诊断标志的障碍",《 异常心理学杂志 》74(1969)。271-80.
18 . P. Slovic and S. Lichtenstein, “Comparison of Bayesian and Regression Approaches to the Study of Information Processing in Judgment,” Organizational Behavior & Human Performance 6 (1971): 649–744.
18 .P. Slovic和S. Lichtenstein,"比较贝叶斯和回归方法研究判断中的信息处理",《 组织行为与人类绩效 》6(1971)。649-744.
19 . M. Bar-Hillel, “On the Subjective Probability of Compound Events,” Organizational Behavior & Human Performance 9 (1973): 396–406.
19 .M. Bar-Hillel, "On the Subjective Probability of Compound Events," Organizational Behavior & Human Performance 9 (1973):396-406.
20 . J. Cohen, E. I. Chesnick, and D. Haran, “A Confirmation of the Inertial-Ψ Effect in Sequential Choice and Decision,” British Journal of Psychology 63 (1972): 41–46.
20 .J. Cohen, E. I. Chesnick, and D. Haran, "A Confirmation of the Inertial-Ψ Effect in Sequential Choice and Decision, " British Journal of Psychology 63 (1972):41-46.
21 . M. Alpert and H. Raiffa, unpublished manuscript; C. A. Stael von Holstein, “Two Techniques for Assessment of Subjective Probability Distributions: An Experimental Study,” Acta Psychologica 35 (1971): 478–94; R. L. Winkler, “The Assessment of Prior Distributions in Bayesian Analysis,” Journal of the American Statistical Association 62 (1967): 776–800.
21 .M. Alpert和H. Raiffa,未发表的手稿;C. A. Stael von Holstein,《评估主观概率分布的两种技术。An Experimental Study," Acta Psychologica 35 (1971):478-94;R. L. Winkler,"贝叶斯分析中的先验分布评估",《 美国统计学会杂志》 62(1967)。776-800.
22 . Kahneman and Tversky, “Subjective Probability”; Tversky and Kahneman, “Availability.”
22 .Kahneman和Tversky,"主观概率";Tversky和Kahneman,"可用性"。
23 . Kahneman and Tversky, “On the Psychology of Prediction”; Tversky and Kahneman, “Belief in the Law of Small Numbers.”
23 .卡尼曼和特维斯基,"论预测的心理学";特维斯基和卡尼曼,"相信小数的法则 。"
24 . L. J. Savage, The Foundations of Statistics (New York: Wiley, 1954).
24 .L. J. Savage, The Foundations of Statistics (New York: Wiley, 1954).
25 . Ibid.; B. de Finetti, “Probability: Interpretations,” in International Encyclopedia of the Social Sciences , ed. D. E. Sills, vol. 12 (New York: Macmillan, 1968), 496–505.
25 .同上;B. de Finetti,"概率。解释,"在 国际社会科学百科全书 ,编辑。D. E. Sills,第12卷(纽约:Macmillan,1968),496-505。
Appendix B: Choices, Values, and Frames fn1
附录B: 选择、价值和框架fn1
Daniel Kahneman and Amos Tversky
丹尼尔-卡尼曼和阿莫斯-特维斯基
ABSTRACT: We discuss the cognitive and the psychophysical determinants of choice in risky and riskless contexts. The psychophysics of value induce risk aversion in the domain of gains and risk seeking in the domain of losses. The psychophysics of chance induce overweighting of sure things and of improbable events, relative to events of moderate probability. Decision problems can be described or framed in multiple ways that give rise to different preferences, contrary to the invariance criterion of rational choice. The process of mental accounting, in which people organize the outcomes of transactions, explains some anomalies of consumer behavior. In particular, the acceptability of an option can depend on whether a negative outcome is evaluated as a cost or as an uncompensated loss. The relation between decision values and experience values is discussed .
ABSTRACT: 我们讨论了在有风险和无风险背景下选择的认知和心理物理学决定因素。价值的心理物理学诱发了在收益领域的风险厌恶和在损失领域的风险寻求。机会的心理物理学引起了对确定的事情和不可能的事件的过度重视,相对于 中等概率的事件。决策问题可以用多种方式描述或构建,从而产生不同的偏好,这与理性选择的不变性标准相反。人们组织交易结果的心理核算过程,解释了消费者行为的一些反常现象。特别是,一个选项的可接受性可能取决于 ,一个负面的结果是被评估为成本还是被评估为一个未补偿的损失。我们讨论了决策价值和经验价值之间的关系 。
Making decisions is like speaking prose—people do it all the time, knowingly or unknowingly. It is hardly surprising, then, that the topic of decision making is shared by many disciplines, from mathematics and statistics, through economics and political science, to sociology and psychology. The study of decisions addresses both normative and descriptive questions. The normative analysis is concerned with the nature of rationality and the logic of decision making. The descriptive analysis, in contrast, is concerned with people’s beliefs and preferences as they are, not as they should be. The tension between normative and descriptive considerations characterizes much of the study of judgment and choice.
做出决定就像说散文一样,人们总是有意无意地这样做。因此,从数学和统计学,到经济学和政治学 ,再到社会学和心理学,许多学科都有决策这一主题,这一点并不奇怪。对决策的研究既涉及规范性问题,也涉及描述性问题。规范性分析关注的是理性的本质和决策的逻辑。与此相反,描述性分析关注的是人们的信仰和偏好,因为它们是真实的,而不是应该是这样的。规范性考虑和描述性考虑之间的紧张关系 ,这是对判断和选择的大部分研究的特点。
Analyses of decision making commonly distinguish risky and riskless choices. The paradigmatic example of decision under risk is the acceptability of a gamble that yields monetary outcomes with specified probabilities. A typical riskless decision concerns the acceptability of a transaction in which a good or a service is exchanged for money or labor. In the first part of this article we present an analysis of the cognitive and psychophysical factors that determine the value of risky prospects. In the second part we extend this analysis to transactions and trades.
对决策的分析通常会区分有风险和无风险的选择。风险决策的典型例子是赌博的可接受性,赌博产生的货币结果具有特定的概率。一个典型的无风险决策涉及到一项交易的可接受性,在这项交易中,货物或服务被用来交换金钱 或劳动。在本文的第一部分,我们介绍了对决定风险前景价值的认知和心理物理因素的分析。在第二部分,我们将这一分析扩展到交易和贸易。
RISKY CHOICE
冒险的选择
Risky choices, such as whether or not to take an umbrella and whether or not to go to war, are made without advance knowledge of their consequences. Because the consequences of such actions depend on uncertain events such as the weather or the opponent’s resolve, the choice of an act may be construed as the acceptance of a gamble that can yield various outcomes with different probabilities. It is therefore natural that the study of decision making under risk has focused on choices between simple gambles with monetary outcomes and specified probabilities, in the hope that these simple problems will reveal basic attitudes toward risk and value.
有风险的 ,比如是否打伞,是否参战,都是在没有事先了解其后果的情况下做出的选择。因为 ,这种行为的后果取决于不确定的事件,如天气或对手的决心,对行为的选择可以被理解为接受一场赌博,可以产生不同概率的结果。因此,对风险下决策的研究很自然地集中在具有货币结果和特定概率的简单赌博之间的选择, ,希望这些简单的问题能够揭示对风险和价值的基本态度。
We shall sketch an approach to risky choice that derives many of its hypotheses from a psychophysical analysis of responses to money and to probability. The psychophysical approach to decision making can be traced to a remarkable essay that Daniel Bernoulli published in 1738 (Bernoulli 1954) in which he attempted to explain why people are generally averse to risk and why risk aversion decreases with increasing wealth. To illustrate risk aversion and Bernoulli’s analysis, consider the choice between a prospect that offers an 85% chance to win $1,000 (with a 15% chance to win nothing) and the alternative of receiving $800 for sure. A large majority of people prefer the sure thing over the gamble, although the gamble has higher (mathematical) expectation. The expectation of a monetary gamble is a weighted average, where each possible outcome is weighted by its probability of occurrence. The expectation of the gamble in this example is .85 × $1,000 + .15 × $0 = $850, which exceeds the expectation of $800 associated with the sure thing. The preference for the sure gain is an instance of risk aversion. In general, a preference for a sure outcome over a gamble that has higher or equal expectation is called risk averse, and the rejection of a sure thing in favor of a gamble of lower or equal expectation is called risk seeking.
我们将勾勒出一种研究风险选择的方法,它的许多假设来自于对金钱和概率反应的心理物理学分析。 决策的心理物理学方法可以追溯到丹尼尔-伯努利(Daniel Bernoulli)在1738年发表的一篇杰出的文章(Bernoulli 1954),他在文章中试图解释为什么人们普遍厌恶风险以及为什么风险厌恶会随着财富的增加而减少。为了说明风险厌恶和伯努利的分析,考虑在一个有85%的机会赢得1000美元(有15%的机会一无所获)的前景和一个肯定能得到800美元的选择之间进行选择。绝大多数人都喜欢稳赚不赔的东西,而不是赌博,尽管 ,赌博的期望值更高(数学)。货币赌博的期望值是一个加权平均值,每个可能的结果都按其发生的概率进行加权。在这个例子中,赌博的期望值是0.85×1,000美元+0.15×0美元=850美元,这超过了与肯定收益相关的800美元的期望值。对肯定收益的偏好是风险规避的一个实例。 一般来说,对确定的结果而不是对具有较高或相等期望值的赌博的偏好被称为风险厌恶,而拒绝确定的东西而选择较低或相等期望值的赌博被称为风险寻求。
Bernoulli suggested that people do not evaluate prospects by the expectation of their monetary outcomes, but rather by the expectation of the subjective value of these outcomes. The subjective value of a gamble is again a weighted average, but now it is the subjective value of each outcome that is weighted by its probability. To explain risk aversion within this framework, Bernoulli proposed that subjective value, or utility, is a concave function of money. In such a function, the difference between the utilities of $200 and $100, for example, is greater than the utility difference between $1,200 and $1,100. It follows from concavity that the subjective value attached to a gain of $800 is more than 80% of the value of a gain of $1,000. Consequently, the concavity of the utility function entails a risk averse preference for a sure gain of $800 over an 80% chance to win $1,000, although the two prospects have the same monetary expectation.
伯努利提出,人们不是通过对其货币结果的预期来评估前景,而是通过对这些 结果的主观价值的预期。赌博的主观价值又是一个加权平均值,但现在是每个结果的主观价值被其概率所加权。为了在这个框架内解释风险厌恶,伯努利提出,主观价值或效用是一个货币的凹函数。在这样一个函数中,例如,200美元和100美元的效用之差,大于1200美元和1100美元的效用之差 。从凹性可以看出,800美元收益的主观价值是1000美元收益价值的80%以上。因此,效用函数的凹性意味着对800美元的肯定收益的风险厌恶,而不是对赢得1000美元的80%机会的风险厌恶,尽管这两种前景具有相同的货币预期。
It is customary in decision analysis to describe the outcomes of decisions in terms of total wealth. For example, an offer to bet $20 on the toss of a fair coin is represented as a choice between an individual’s current wealth W and an even chance to move to W + $20 or to W − $20. This representation appears psychologically unrealistic: People do not normally think of relatively small outcomes in terms of states of wealth but rather in terms of gains, losses, and neutral outcomes (such as the maintenance of the status quo). If the effective carriers of subjective value are changes of wealth rather than ultimate states of wealth, as we propose, the psychophysical analysis of outcomes should be applied to gains and losses rather than to total assets. This assumption plays a central role in a treatment of risky choice that we called prospect theory (Kahneman and Tversky 1979). Introspection as well as psychophysical measurements suggest that subjective value is a concave function of the size of a gain. The same generalization applies to losses as well. The difference in subjective value between a loss of $200 and a loss of $100 appears greater than the difference in subjective value between a loss of $1,200 and a loss of $1,100. When the value functions for gains and for losses are pieced together, we obtain an S-shaped function of the type displayed in Figure 1 .
在决策分析中,习惯于 ,用总财富来描述决策的结果。例如,在抛出一枚公平的硬币上押注20美元的提议被表示为在个人当前的财富 W 和移动到 W +20美元或 W -20美元的偶数机会之间的选择。这种表述在心理学上似乎是不现实的。人们通常不会从财富的状态 ,而是从收益、损失和中性结果(如维持现状)的角度来考虑相对较小的结果。如果主观价值的有效载体是财富的变化,而不是财富的最终状态,正如我们所提议的那样,对结果的心理物理分析应该适用于收益和损失,而不是总资产。这一假设在我们称之为前景理论(Kahneman and Tversky 1979)的对 风险选择的处理中起到了核心作用。自省以及心理物理测量表明 ,主观价值是收益大小的一个凹函数。同样的概括也适用于损失。损失200美元和损失100美元之间的主观价值差异似乎比损失1200美元 和损失1100美元之间的主观价值差异更大。当收益和损失的价值函数被拼凑在一起时,我们得到一个 图1 中显示的S型函数。

Figure 1. A Hypothetical Value Function
图1.假设的价值函数
The value function shown in Figure 1 is (a) defined on gains and losses rather than on total wealth, (b) concave in the domain of gains and convex in the domain of losses, and (c) considerably steeper for losses than for gains. The last property, which we label loss aversion , expresses the intuition that a loss of $X is more aversive than a gain of $X is attractive. Loss aversion explains people’s reluctance to bet on a fair coin for equal stakes: The attractiveness of the possible gain is not nearly sufficient to compensate for the aversiveness of the possible loss. For example, most respondents in a sample of undergraduates refused to stake $10 on the toss of a coin if they stood to win less than $30.
图1 所示的价值函数是(a)定义在收益和损失上,而不是总财富上,(b)在收益领域内是凹的,在损失领域内是凸的,以及(c)损失比收益要陡峭得多。最后一个属性,我们称之为 损失厌恶 ,表达了这样一种直觉:损失X美元比收益X美元 ,更令人厌恶。损失厌恶解释了人们不愿意在一个公平的硬币上投注相同的赌注。可能的收益的吸引力几乎不足以补偿可能的损失的厌恶感。例如,在一个本科生样本中,如果他们赢得的钱少于30美元,大多数受访者都拒绝在抛掷硬币上押10美元。
The assumption of risk aversion has played a central role in economic theory. However, just as the concavity of the value of gains entails risk aversion, the convexity of the value of losses entails risk seeking. Indeed, risk seeking in losses is a robust effect, particularly when the probabilities of loss are substantial. Consider, for example, a situation in which an individual is forced to choose between an 85% chance to lose $1,000 (with a 15% chance to lose nothing) and a sure loss of $800. A large majority of people express a preference for the gamble over the sure loss. This is a risk seeking choice because the expectation of the gamble (−$850) is inferior to the expectation of the sure loss (−$800). Risk seeking in the domain of losses has been confirmed by several investigators (Fishburn and Kochenberger 1979; Hershey and Schoemaker 1980; Payne, Laughhunn, and Crum 1980; Slovic, Fischhoff, and Lichtenstein 1982). It has also been observed with nonmonetary outcomes, such as hours of pain (Eraker and Sox 1981) and loss of human lives (Fischhoff 1983; Tversky 1977; Tversky and Kahneman 1981). Is it wrong to be risk averse in the domain of gains and risk seeking in the domain of losses? These preferences conform to compelling intuitions about the subjective value of gains and losses, and the presumption is that people should be entitled to their own values. However, we shall see that an S-shaped value function has implications that are normatively unacceptable.
风险规避的假设在经济理论中起到了核心作用, 。然而,正如收益价值的凹性带来了风险规避一样,损失价值的凸性也带来了风险寻求。事实上,损失中的风险寻求是一种强有力的影响,特别是当损失的概率很大时。例如,考虑这样一种情况:一个人被迫在85%的机会失去1000美元 (15%的机会一无所获)和800美元的肯定损失之间做出选择。大多数人都表示倾向于选择赌博而不是肯定的损失。这是一个寻求风险的选择,因为赌博的期望值(-850美元)比肯定损失的期望值(-800美元)要低。一些研究者已经证实了在损失领域的风险寻求(Fishburn和Kochenberger 1979;Hershey 和Schoemaker 1980;Payne, Laughhunn, and Crum 1980;Slovic, Fischhoff, and Lichtenstein 1982)。它也被观察到与非货币的结果,如疼痛的时间(Eraker和Sox 1981)和人命的损失(Fischhoff 1983;Tversky 1977;Tversky和Kahneman 1981)。在收益领域厌恶风险,在损失领域寻求风险是错误的吗?这些偏好符合 ,符合关于收益和损失的主观价值的令人信服的直觉,并且假定人们应该有权获得他们自己的价值。然而,我们将看到,S型价值函数具有规范上不可接受的含义。
To address the normative issue we turn from psychology to decision theory. Modern decision theory can be said to begin with the pioneering work of von Neumann and Morgenstern (1947), who laid down several qualitative principles, or axioms, that should govern the preferences of a rational decision maker. Their axioms included transitivity (if A is preferred to B and B is preferred to C, then A is preferred to C), and substitution (if A is preferred to B, then an even chance to get A or C is preferred to an even chance to get B or C), along with other conditions of a more technical nature. The normative and the descriptive status of the axioms of rational choice have been the subject of extensive discussions. In particular, there is convincing evidence that people do not always obey the substitution axiom, and considerable disagreement exists about the normative merit of this axiom (e.g., Allais and Hagen 1979). However, all analyses of rational choice incorporate two principles: dominance and invariance. Dominance demands that if prospect A is at least as good as prospect B in every respect and better than B in at least one respect, then A should be preferred to B. Invariance requires that the preference order between prospects should not depend on the manner in which they are described. In particular, two versions of a choice problem that are recognized to be equivalent when shown together should elicit the same preference even when shown separately. We now show that the requirement of invariance, however elementary and innocuous it may seem, cannot generally be satisfied.
为了 解决规范性问题,我们从心理学转向了决策理论。现代决策理论可以说是始于冯-诺伊曼和莫根斯坦(1947)的开创性工作 ,他们规定了几个定性原则,或公理,应该支配一个理性决策者的偏好。他们的公理包括转折性(如果A比B更受欢迎,B比C更受欢迎,那么A就比C更受欢迎)和替代性(如果A比B更受欢迎,那么得到A或C的偶数机会就比得到B或C的偶数机会更受欢迎),以及 ,还有其他更技术性的条件。理性选择公理的规范性和描述性地位一直是广泛讨论的主题。特别是,有令人信服的证据表明,人们并不总是遵守替代公理,而且对这一公理的规范性价值存在相当大的分歧(例如,Allais和Hagen 1979)。然而,所有对理性选择的分析 ,都包含两个原则:支配性和不变性。支配性要求,如果前景A在各方面都至少和前景B一样好,而且至少在一个方面比B好,那么A就应该比B更受欢迎。不变性要求,前景之间的偏好顺序不应该取决于描述它们的方式。特别是,一个 选择问题的两个版本,如果在一起显示时被认为是等价的,那么即使在分开显示时也应该引起相同的偏好。我们现在表明,不管看起来多么简单和无害,不变性的要求一般都不能满足。
FRAMING OF OUTCOMES
结果的框架
Risky prospects are characterized by their possible outcomes and by the probabilities of these outcomes. The same option, however, can be framed or described in different ways (Tversky and Kahneman 1981). For example, the possible outcomes of a gamble can be framed either as gains and losses relative to the status quo or as asset positions that incorporate initial wealth. Invariance requires that such changes in the description of outcomes should not alter the preference order. The following pair of problems illustrates a violation of this requirement. The total number of respondents in each problem is denoted by N , and the percentage who chose each option is indicated in parentheses.
风险前景的特点是其可能的结果和这些结果的概率。然而,同样的选择, ,可以用不同的方式来框定或描述(Tversky和Kahneman 1981)。例如,一个赌博的可能结果可以被设定为相对于现状的收益和损失,或者被设定为包含初始财富的资产状况。不变性要求对结果描述的这种变化不应改变偏好顺序。下面这对问题说明了 ,违反了这一要求。每个问题的受访者总数用 N 表示,选择每个选项的百分比在括号中表示。
Problem 1 ( N = 152): Imagine that the U.S. is preparing for the outbreak of an unusual Asian disease, which is expected to kill 600 people. Two alternative programs to combat the disease have been proposed. Assume that the exact scientific estimates of the consequences of the programs are as follows:
问题1 (N= 152)。想象一下,美国正在为一种不寻常的亚洲疾病的爆发做准备,这种疾病预计会杀死600人。已经提出了两种防治该疾病的替代方案。假设对这些方案的后果的准确 科学估计如下。
If Program A is adopted, 200 people will be saved. (72%)
如果方案A被采纳,将有200人获救。(72%)
If Program B is adopted, there is a one-third probability that 600 people will be saved and a two-thirds probability that no people will be saved. (28%)
如果采用方案B,有三分之一的概率会有600人获救,三分之二的概率不会有人获救。(28%)
Which of the two programs would you favor?
在这两个方案中,你会赞成哪一个?
The formulation of Problem 1 implicitly adopts as a reference point a state of affairs in which the disease is allowed to take its toll of 600 lives. The outcomes of the programs include the reference state and two possible gains, measured by the number of lives saved. As expected, preferences are risk averse: A clear majority of respondents prefer saving 200 lives for sure over a gamble that offers a one-third chance of saving 600 lives. Now consider another problem in which the same cover story is followed by a different description of the prospects associated with the two programs:
问题1的表述隐含地采用了一个参考点 ,即允许疾病夺走600条生命的状态。方案的结果包括参考状态和两种可能的收益,以拯救的生命数量来衡量。正如预期的那样,人们的偏好是厌恶风险的。显然,大多数受访者更愿意肯定地拯救200条生命,而不是有三分之一机会拯救600条生命的赌博。现在考虑另一个问题 ,在这个问题中,同样的封面故事之后是对与这两个项目相关的前景的不同描述。
Problem 2 ( N = 155):
问题2 (N = 155)。
If Program C is adopted, 400 people will die. (22%)
如果方案C被采纳,将有400人死亡。(22%)
If Program D is adopted, there is a one-third probability that nobody will die and a two-thirds probability that 600 people will die. (78%)
如果采用方案D,有三分之一的概率无人死亡,三分之二的概率有600人死亡。(78%)
It is easy to verify that options C and D in Problem 2 are undistinguishable in real terms from options A and B in Problem 1, respectively. The second version, however, assumes a reference state in which no one dies of the disease. The best outcome is the maintenance of this state and the alternatives are losses measured by the number of people that will die of the disease. People who evaluate options in these terms are expected to show a risk seeking preference for the gamble (option D) over the sure loss of 400 lives. Indeed, there is more risk seeking in the second version of the problem than there is risk aversion in the first.
很容易验证,问题2中的选项C和D与问题1中的选项A和B在实际情况下是 ,没有区别。然而,第二个版本假设了一个参考状态,即没有人死于这种疾病。最好的结果是维持这个状态,而替代方案是以死于疾病的人数来衡量的损失。以这些条件评估选项的人预计会表现出寻求风险的偏好, ,选择赌博(选项D)而不是肯定损失400条生命。事实上,在问题的第二个版本中,寻求风险的情况比第一个版本中规避风险的情况更多。
The failure of invariance is both pervasive and robust. It is as common among sophisticated respondents as among naive ones, and it is not eliminated even when the same respondents answer both questions within a few minutes. Respondents confronted with their conflicting answers are typically puzzled. Even after rereading the problems, they still wish to be risk averse in the “lives saved” version; they wish to be risk seeking in the “lives lost” version; and they also wish to obey invariance and give consistent answers in the two versions. In their stubborn appeal, framing effects resemble perceptual illusions more than computational errors.
不变性的失败既普遍又强大。它在成熟的受访者中和在天真的受访者中一样普遍,即使同一受访者在几分钟内回答了两个问题,它也不会被消除。 受访者在面对他们相互矛盾的答案时通常会感到困惑。即使在重读问题之后,他们仍然希望在 "拯救生命 "的版本中厌恶风险;他们希望在 "失去生命 "的版本中寻求风险;他们还希望遵守不变性,在两个版本中给出一致的答案。就其顽固的吸引力而言,框架效应比计算错误更类似于知觉错觉, 。
The following pair of problems elicits preferences that violate the dominance requirement of rational choice.
下面这对问题引起的偏好违反了理性选择的支配性要求。
Problem 3 ( N = 86): Choose between:
问题3 (N =86)。在下列选项中选择。
E. 25% chance to win $240 and 75% chance to lose $760 (0%)
E.25%的机会赢得240美元,75%的机会失去760美元(0%)。
F. 25% chance to win $250 and 75% chance to lose $750 (100%)
F.25%的机会赢得250美元,75%的机会失去750美元(100%)。
It is easy to see that F dominates E. Indeed, all respondents chose accordingly.
不难看出,F在E中占主导地位。事实上,所有受访者都做出了相应的选择。
Problem 4 ( N = 150): Imagine that you face the following pair of concurrent decisions. First examine both decisions, then indicate the options you prefer.
问题4 (N = 150)。想象一下,你面临以下一对同时进行的决策。首先审查这两个决定,然后指出你喜欢的选项。
Decision (i) Choose between:
决定 (i) 在以下方面选择。
A. a sure gain of $240 (84%)
A. 肯定收益为240美元(84%)。
B. 25% chance to gain $1,000 and 75% chance to gain nothing (16%) Decision (ii) Choose between:
B.25%的机会获得1000美元,75%的机会一无所获(16%) 决定 (ii) 在两者之间选择。
C. a sure loss of $750 (13%)
C. 肯定损失750美元(13%)。
D. 75% chance to lose $1,000 and 25% chance to lose nothing (87%)
D.75%的机会失去1000美元,25%的机会一无所获(87%)。
As expected from the previous analysis, a large majority of subjects made a risk averse choice for the sure gain over the positive gamble in the first decision, and an even larger majority of subjects made a risk seeking choice for the gamble over the sure loss in the second decision. In fact, 73% of the respondents chose A and D and only 3% chose B and C. The same pattern of results was observed in a modified version of the problem, with reduced stakes, in which undergraduates selected gambles that they would actually play.
正如前面的分析所预期的那样,在第一次决策中,绝大多数受试者对肯定收益的赌博做出了风险规避的选择,而在第二次决策中,更多的受试者对肯定损失的赌博做出了风险寻求的选择。事实上,73%的被试选择了A和D,只有3%的被试选择了B和C。同样的结果模式被观察到 ,在这个问题的修改版本中,降低了赌注,本科生选择了他们实际会玩的赌博。
Because the subjects considered the two decisions in Problem 4 simultaneously, they expressed in effect a preference for A and D over B and C. The preferred conjunction, however, is actually dominated by the rejected one. Adding the sure gain of $240 (option A) to option D yields a 25% chance to win $240 and a 75% chance to lose $760. This is precisely option E in Problem 3. Similarly, adding the sure loss of $750 (option C) to option B yields a 25% chance to win $250 and a 75% chance to lose $750. This is precisely option F in Problem 3. Thus, the susceptibility to framing and the S-shaped value function produce a violation of dominance in a set of concurrent decisions.
由于受试者同时考虑了问题4中的两个决定,他们实际上表达了对A和D的偏爱,而不是对B和C的偏爱,但是偏爱的组合实际上被拒绝的组合所支配。将肯定获得的240美元(选项A)加到 选项D ,就有25%的机会赢得240美元,75%的机会失去760美元。这正是问题3中的选项E。同样的,在选项B中加入750美元的肯定损失(选项C),可以得到25%的机会赢得250美元,75%的机会失去750美元。这正是问题3中的选项F。因此,在一组同时进行的 决策中,对框架的易感性和S型价值函数产生了对支配地位的违反。
The moral of these results is disturbing: Invariance is normatively essential, intuitively compelling, and psychologically unfeasible. Indeed, we conceive only two ways of guaranteeing invariance. The first is to adopt a procedure that will transform equivalent versions of any problem into the same canonical representation. This is the rationale for the standard admonition to students of business, that they should consider each decision problem in terms of total assets rather than in terms of gains or losses (Schlaifer 1959). Such a representation would avoid the violations of invariance illustrated in the previous problems, but the advice is easier to give than to follow. Except in the context of possible ruin, it is more natural to consider financial outcomes as gains and losses rather than as states of wealth. Furthermore, a canonical representation of risky prospects requires a compounding of all outcomes of concurrent decisions (e.g., Problem 4) that exceeds the capabilities of intuitive computation even in simple problems. Achieving a canonical representation is even more difficult in other contexts such as safety, health, or quality of life. Should we advise people to evaluate the consequence of a public health policy (e.g., Problems 1 and 2) in terms of overall mortality, mortality due to diseases, or the number of deaths associated with the particular disease under study?
这些结果的寓意是令人不安的。不变性在规范上是必要的,在直觉上是令人信服的,在心理上是不可行的。事实上,我们只设想了两种保证不变性的方法。第一种是采用一种程序,将任何问题的等价版本转化为相同的典范表示。这就是对学生的标准告诫的理由 ,即他们应该从总资产的角度而不是从收益或损失的角度考虑每个决策问题(Schlaifer 1959)。这样的表述可以避免前面问题中说明的违反不变性的情况,但这个建议说起来容易,做起来难。除了在可能毁灭的情况下,将金融结果视为收益和 损失,而不是财富的状态,是比较自然的。此外,风险前景的典型表示需要对所有同时发生的决策结果进行复合(例如,问题4),即使在简单的问题中也超出了直觉计算的能力。在其他情况下,如安全、健康或生活质量,实现典型代表甚至更加困难。我们是否应该建议 ,以总体死亡率、疾病死亡率或与所研究的特定疾病相关的死亡人数来评估一项公共卫生政策的后果(例如问题1和2)?
Another approach that could guarantee invariance is the evaluation of options in terms of their actuarial rather than their psychological consequences. The actuarial criterion has some appeal in the context of human lives, but it is clearly inadequate for financial choices, as has been generally recognized at least since Bernoulli, and it is entirely inapplicable to outcomes that lack an objective metric. We conclude that frame invariance cannot be expected to hold and that a sense of confidence in a particular choice does not ensure that the same choice would be made in another frame. It is therefore good practice to test the robustness of preferences by deliberate attempts to frame a decision problem in more than one way (Fischhoff, Slovic, and Lichtenstein 1980).
另一种可以保证不变性的方法是用精算而不是心理后果来评价选择。精算标准 ,在人类生活的背景下有一定的吸引力,但对于财务选择来说显然是不够的,至少自伯努利以来人们普遍认识到了这一点,而且它完全不适用于缺乏客观尺度的结果。我们的结论是,不能指望框架不变性成立,对某一特定选择的信心感并不能确保在另一个框架中也会做出同样的选择 。因此,通过刻意尝试以一种以上的方式设定决策问题来测试偏好的稳健性是一种很好的做法(Fischhoff, Slovic, and Lichtenstein 1980)。
THE PSYCHOPHYSICS OF CHANCES
机会的心理物理学
Our discussion so far has assumed a Bernoullian expectation rule according to which the value, or utility, of an uncertain prospect is obtained by adding the utilities of the possible outcomes, each weighted by its probability. To examine this assumption, let us again consult psychophysical intuitions. Setting the value of the status quo at zero, imagine a cash gift, say of $300, and assign it a value of one. Now imagine that you are only given a ticket to a lottery that has a single prize of $300. How does the value of the ticket vary as a function of the probability of winning the prize? Barring utility for gambling, the value of such a prospect must vary between zero (when the chance of winning is nil) and one (when winning $300 is a certainty).
到目前为止,我们的讨论已经假设了伯努利的期望规则,根据该规则,不确定前景的价值或效用是通过将可能的结果的效用 ,每个结果按其概率加权得到的。为了研究这个假设,让我们再次参考心理物理学的直觉。将现状的价值设定为零,想象一个现金礼物,例如300美元,并赋予它1的价值。现在想象一下,你只得到了一张彩票,而这张彩票有300美元的单一奖金。这张彩票的价值是如何随着中奖概率的变化而变化的 ?如果不考虑赌博的效用,这种前景的价值必须在零(当中奖的机会为零时)和一(当赢得300美元是肯定的)之间变化。
Intuition suggests that the value of the ticket is not a linear function of the probability of winning, as entailed by the expectation rule. In particular, an increase from 0% to 5% appears to have a larger effect than an increase from 30% to 35%, which also appears smaller than an increase from 95% to 100%. These considerations suggest a category-boundary effect: A change from impossibility to possibility or from possibility to certainty has a bigger impact than a comparable change in the middle of the scale. This hypothesis is incorporated into the curve displayed in Figure 2 , which plots the weight attached to an event as a function of its stated numerical probability. The most salient feature of Figure 2 is that decision weights are regressive with respect to stated probabilities. Except near the endpoints, an increase of .05 in the probability of winning increases the value of the prospect by less than 5% of the value of the prize. We next investigate the implications of these psychophysical hypotheses for preferences among risky options.
直觉表明,彩票的价值并不是中奖概率的线性函数,正如预期规则所要求的那样。特别是,从0%增加到5%似乎比从30%增加到35%有更大的影响 ,而后者也似乎比从95%增加到100%小。这些考虑表明了一个类别的边界效应。从不可能到可能或从可能到肯定的变化,比规模中间的可比变化有更大的影响。这一假设被纳入了 图2 中显示的曲线 ,该曲线将一个事件的权重 ,作为其所述数字概率的函数。 图2 最突出的特点是,决策权重相对于陈述的概率而言是递减的。除了在终点附近,获胜概率每增加0.05,前景的价值就会增加不到奖金价值的5%。我们接下来研究这些心理物理学假设的影响 ,以了解风险选项的偏好。

Figure 2. A Hypothetical Weighting Function
图2.假设的加权函数
In Figure 2 , decision weights are lower than the corresponding probabilities over most of the range. Underweighting of moderate and high probabilities relative to sure things contributes to risk aversion in gains by reducing the attractiveness of positive gambles. The same effect also contributes to risk seeking in losses by attenuating the aversiveness of negative gambles. Low probabilities, however, are overweighted, and very low probabilities are either overweighted quite grossly or neglected altogether, making the decision weights highly unstable in that region. The overweighting of low probabilities reverses the pattern described above: It enhances the value of long shots and amplifies the aversiveness of a small chance of a severe loss. Consequently, people are often risk seeking in dealing with improbable gains and risk averse in dealing with unlikely losses. Thus, the characteristics of decision weights contribute to the attractiveness of both lottery tickets and insurance policies.
在 图2 中,决策权重在大部分范围内都低于相应的概率。相对于确定的事情,中等和高概率的权重过低,通过减少正面赌博的吸引力,有助于在收益中规避风险。同样的效果也有助于在损失中寻求风险,因为它削弱了消极赌博的厌恶感 。然而,低概率会被加权,非常低的概率要么被严重加权,要么被完全忽略,使得决策权重在该区域非常不稳定。对低概率的加权扭转了上述的模式。它提高了长线投资的价值,放大了小概率严重损失的反作用。 因此,人们在处理不可能的收益时往往会寻求风险,而在处理不可能的损失时则会厌恶风险。因此,决策权重的特点促成了彩票和保险单的吸引力。
The nonlinearity of decision weights inevitably leads to violations of invariance, as illustrated in the following pair of problems:
决策权重的非线性不可避免地导致了对不变性的违反,正如下面这对问题所说明的。
Problem 5 ( N = 85): Consider the following two-stage game. In the first stage, there is a 75% chance to end the game without winning anything and a 25% chance to move into the second stage. If you reach the second stage you have a choice between:
问题 5 (N =85)。考虑下面这个两阶段的游戏。在第一阶段,有75%的机会在没有赢得任何东西的情况下结束游戏,有25%的机会进入第二阶段。如果你进入第二阶段,你可以选择。
A. a sure win of $30 (74%)
A. 稳赢的30美元(74%)。
B. 80% chance to win $45 (26%)
B.80%的机会赢得45美元(26%)。
Your choice must be made before the game starts, i.e., before the outcome of the first stage is known. Please indicate the option you prefer.
你的选择必须在比赛开始前做出,即在第一阶段的结果被 之前。请指出你喜欢的选项。
Problem 6 ( N = 81): Which of the following options do you prefer?
问题6 (N =81)。你喜欢以下哪种方案?
C. 25% chance to win $30 (42%)
C.25%的机会赢得30美元(42%)。
D. 20% chance to win $45 (58%)
D.20%的机会赢得45美元(58%)。
Because there is one chance in four to move into the second stage in Problem 5, prospect A offers a .25 probability of winning $30, and prospect B offers .25 × .80 = .20 probability of winning $45. Problems 5 and 6 are therefore identical in terms of probabilities and outcomes. However, the preferences are not the same in the two versions: A clear majority favors the higher chance to win the smaller amount in Problem 5, whereas the majority goes the other way in Problem 6. This violation of invariance has been confirmed with both real and hypothetical monetary payoffs (the present results are with real money), with human lives as outcomes, and with a nonsequential representation of the chance process.
因为在问题5中,有四分之一的机会进入第二阶段,所以前景A有0.25的概率赢得30美元,而前景B有0.25×0.80=0.20的概率赢得45美元。因此,问题5和问题6在概率和结果方面是相同的 。然而,这两个版本的偏好是不一样的。在问题5中,明显多数人倾向于赢得较小数额的机会,而在问题6中,多数人则倾向于另一种方式。这种违反不变性的情况已经在真实和假设的货币报酬(目前的结果是真实的货币)、 ,以人的生命作为结果,以及用机会过程的非顺序表示法得到了证实。
We attribute the failure of invariance to the interaction of two factors: the framing of probabilities and the nonlinearity of decision weights. More specifically, we propose that in Problem 5 people ignore the first phase, which yields the same outcome regardless of the decision that is made, and focus their attention on what happens if they do reach the second stage of the game. In that case, of course, they face a sure gain if they choose option A and an 80% chance of winning if they prefer to gamble. Indeed, people’s choices in the sequential version are practically identical to the choices they make between a sure gain of $30 and an 85% chance to win $45. Because a sure thing is overweighted in comparison with events of moderate or high probability (see figure 2 ), the option that may lead to a gain of $30 is more attractive in the sequential version. We call this phenomenon the pseudo-certainty effect because an event that is actually uncertain is weighted as if it were certain.
我们将不变性的失败归咎于两个因素的相互作用:概率的框架和决策权重的非线性。更具体地说,我们提出,在问题5中,人们忽略了第一阶段,无论做出什么决定,都会产生相同的结果,而把注意力集中在 ,如果他们真的进入游戏的第二阶段会发生什么。当然,在这种情况下,如果他们选择A选项,他们将面临肯定的收益,如果他们选择赌博,则有80%的机会获胜。事实上,人们在顺序版本中的选择与他们在30美元的肯定收益和45美元的85%的赢钱机会之间的选择几乎是一样的。因为与中度或高度概率的事件相比,确定的事情被高估了 ( 见图2 ),可能导致30美元收益的选项在顺序版本中更有吸引力。我们称这种现象为伪确定性效应,因为一个实际上不确定的事件被加权,就像它是确定的一样。
A closely related phenomenon can be demonstrated at the low end of the probability range. Suppose you are undecided whether or not to purchase earthquake insurance because the premium is quite high. As you hesitate, your friendly insurance agent comes forth with an alternative offer: “For half the regular premium you can be fully covered if the quake occurs on an odd day of the month. This is a good deal because for half the price you are covered for more than half the days.” Why do most people find such probabilistic insurance distinctly unattractive? Figure 2 suggests an answer. Starting anywhere in the region of low probabilities, the impact on the decision weight of a reduction of probability from p to p /2 is considerably smaller than the effect of a reduction from p /2 to 0. Reducing the risk by half, then, is not worth half the premium.
一个密切相关的现象可以在概率范围的低端表现出来。假设你 ,不知道是否要购买地震保险,因为保险费相当高。就在你犹豫不决的时候,你的友好的保险代理人提出了一个替代的提议。"如果地震发生在每月的奇数天,你只需支付正常保费的一半就可以得到完全的保障。这是一个很好的交易,因为用一半的价格,你就可以得到一半以上天数的保障。"为什么大多数人发现 ,这种概率性保险明显没有吸引力? 图2 提出了一个答案。从低概率区域的任何地方开始,将概率从 p 减少到 p/2 对决策权重的影响要比从 p/2 减少到0的影响小得多。
The aversion to probabilistic insurance is significant for three reasons. First, it undermines the classical explanation of insurance in terms of a concave utility function. According to expected utility theory, probabilistic insurance should be definitely preferred to normal insurance when the latter is just acceptable (see Kahneman and Tversky 1979). Second, probabilistic insurance represents many forms of protective action, such as having a medical checkup, buying new tires, or installing a burglar alarm system. Such actions typically reduce the probability of some hazard without eliminating it altogether. Third, the acceptability of insurance can be manipulated by the framing of the contingencies. An insurance policy that covers fire but not flood, for example, could be evaluated either as full protection against a specific risk (e.g., fire), or as a reduction in the overall probability of property loss. Figure 2 suggests that people greatly undervalue a reduction in the probability of a hazard in comparison to the complete elimination of that hazard. Hence, insurance should appear more attractive when it is framed as the elimination of risk than when it is described as a reduction of risk. Indeed, Slovic, Fischhoff, and Lichtenstein (1982) showed that a hypothetical vaccine that reduces the probability of contracting a disease from 20% to 10% is less attractive if it is described as effective in half of the cases than if it is presented as fully effective against one of two exclusive and equally probable virus strains that produce identical symptoms.
对概率性保险的厌恶是重要的 ,原因有三。首先,它破坏了以凹形效用函数来解释保险的经典方法。根据预期效用理论,当正常保险只是可以接受时,概率性保险应该肯定比正常保险更受欢迎(见Kahneman和Tversky 1979)。其次,概率保险代表了许多形式的保护性行动,例如有 体检,购买新轮胎,或安装防盗报警系统。这种行动通常会减少一些危险的概率,而不会完全消除它。第三,保险的可接受性可以通过突发事件的框架来操纵。例如,一份包括火灾但不包括水灾的保险可以被评估为对某一特定风险的全面保护(例如, 火灾),或者被评估为对财产损失的总体概率的降低。 图2 表明,与完全消除危险相比,人们大大低估了减少危险概率的价值。因此,当保险被描述为消除风险时,应该比被描述为减少风险时更有吸引力。事实上,Slovic, Fischhoff, and Lichtenstein (1982)表明,一种假设的疫苗,如果被描述为对一半的病例有效,那么它的吸引力就不如被描述为对两种产生相同症状的、可能性相同的病毒株之一完全有效。
FORMULATION EFFECTS
配方效应
So far we have discussed framing as a tool to demonstrate failures of invariance. We now turn attention to the processes that control the framing of outcomes and events. The public health problem illustrates a formulation effect in which a change of wording from “lives saved” to “lives lost” induced a marked shift of preference from risk aversion to risk seeking. Evidently, the subjects adopted the descriptions of the outcomes as given in the question and evaluated the outcomes accordingly as gains or losses. Another formulation effect was reported by McNeil, Pauker, Sox, and Tversky (1982). They found that preferences of physicians and patients between hypothetical therapies for lung cancer varied markedly when their probable outcomes were described in terms of mortality or survival. Surgery, unlike radiation therapy, entails a risk of death during treatment. As a consequence, the surgery option was relatively less attractive when the statistics of treatment outcomes were described in terms of mortality rather than in terms of survival.
到目前为止,我们已经讨论了作为证明 不变性的失败的工具的框架。我们现在把注意力转向控制结果和事件的框架的过程。公共卫生问题说明了一种表述效果,即把措辞从 "拯救的生命 "改为 "失去的生命",就会使人们的偏好从厌恶风险转向寻求风险。显然,被试采用了问题中对结果的描述 ,并相应地将结果评价为收益或损失。McNeil、Pauker、Sox和Tversky(1982)报告了另一种表述效果。他们发现,当肺癌的可能结果被描述为死亡率或存活率时,医生和病人对假想疗法的偏好有明显的不同。手术与放疗不同,在治疗期间会有死亡的风险 。因此,当治疗结果的统计数字是以死亡率而不是以生存率来描述时,手术方案的吸引力相对较小。
A physician, and perhaps a presidential advisor as well, could influence the decision made by the patient or by the President, without distorting or suppressing information, merely by the framing of outcomes and contingencies. Formulation effects can occur fortuitously, without anyone being aware of the impact of the frame on the ultimate decision. They can also be exploited deliberately to manipulate the relative attractiveness of options. For example, Thaler (1980) noted that lobbyists for the credit card industry insisted that any price difference between cash and credit purchases be labeled a cash discount rather than a credit card surcharge. The two labels frame the price difference as a gain or as a loss by implicitly designating either the lower or the higher price as normal. Because losses loom larger than gains, consumers are less likely to accept a surcharge than to forgo a discount. As is to be expected, attempts to influence framing are common in the marketplace and in the political arena.
一个医生,也许还有一个总统顾问,可以在不歪曲或压制信息的情况下,仅仅通过对结果和突发事件的框架 ,影响病人或总统的决定。框架效应可以偶然发生,没有人意识到框架对最终决定的影响。它们也可以被故意利用来操纵选项的相对吸引力。例如,Thaler(1980)指出,信用卡行业的游说者坚持认为,现金和信用卡购买之间的任何价格差异都应该被贴上 ,即现金折扣而不是信用卡附加费。这两个标签通过隐含地指定较低或较高的价格为正常价格,将价格差异框定为一种收益或损失。由于损失比收益更大,消费者不太可能接受附加费而放弃折扣。可以预见的是,在市场和 政治舞台上,试图影响框架的做法是很常见的。
The evaluation of outcomes is susceptible to formulation effects because of the nonlinearity of the value function and the tendency of people to evaluate options in relation to the reference point that is suggested or implied by the statement of the problem. It is worthy of note that in other contexts people automatically transform equivalent messages into the same representation. Studies of language comprehension indicate that people quickly recode much of what they hear into an abstract representation that no longer distinguishes whether the idea was expressed in an active or in a passive form and no longer discriminates what was actually said from what was implied, presupposed, or implicated (Clark and Clark 1977). Unfortunately, the mental machinery that performs these operations silently and effortlessly is not adequate to perform the task of recoding the two versions of the public health problem or the mortality survival statistics into a common abstract form.
由于价值函数的非线性,以及人们倾向于根据问题陈述所建议或暗示的参考点来评价选项,因此对结果的评价很容易受到表述效果的影响。值得注意的是,在其他情况下,人们会自动将同等的信息转化为相同的表述。 对语言理解的研究表明,人们很快就会把他们听到的大部分内容重新编码为一种抽象的表征,不再区分这个想法是以主动还是被动的形式表达的,也不再区分实际说了什么和隐含的、预设的或暗示的内容(Clark and Clark 1977)。不幸的是,默默地、毫不费力地执行 这些操作的心理机器,并不足以完成将公共卫生问题的两个版本或死亡率生存统计数字重新编码为一个共同的抽象形式的任务。
TRANSACTIONS AND TRADES
交易和贸易
Our analysis of framing and of value can be extended to choices between multiattribute options, such as the acceptability of a transaction or a trade. We propose that, in order to evaluate a multiattribute option, a person sets up a mental account that specifies the advantages and the disadvantages associated with the option, relative to a multiattribute reference state. The overall value of an option is given by the balance of its advantages and its disadvantages in relation to the reference state. Thus, an option is acceptable if the value of its advantages exceeds the value of its disadvantages. This analysis assumes psychological—but not physical—separability of advantages and disadvantages. The model does not constrain the manner in which separate attributes are combined to form overall measures of advantage and of disadvantage, but it imposes on these measures assumptions of concavity and of loss aversion.
我们对框架和价值的分析可以扩展到多属性选择之间的选择,比如一项交易或贸易的可接受性。我们建议, ,为了评估 一个多属性的选择,一个人建立了一个心理账户,明确了与该选择相关的优点和缺点,相对于一个多属性的参考状态。一个选项的总体价值是由其相对于参考状态的优势和劣势的平衡来决定的。因此,如果一个选项的优势价值 ,超过其劣势价值,那么这个选项是可以接受的。这个分析假设了优势和劣势在心理上的可分性,但不是物理上的可分性。这个模型并不限制将不同的属性结合起来形成优势和劣势的总体衡量标准,但它对这些衡量标准施加了凹陷性和损失厌恶的假设。
Our analysis of mental accounting owes a large debt to the stimulating work of Richard Thaler (1980, 1985), who showed the relevance of this process to consumer behavior. The following problem, based on examples of Savage (1954) and Thaler (1980), introduces some of the rules that govern the construction of mental accounts and illustrates the extension of the concavity of value to the acceptability of transactions.
我们对心理账户的分析 ,这要归功于理查德-塞勒(Richard Thaler)(1980,1985)的激励性工作,他表明了这一过程与消费者行为的相关性。下面的问题,基于萨维奇(1954)和塞勒(1980)的例子,介绍了一些管理心理账户构建的规则,并说明了价值的凹陷性对交易可接受性的延伸。
Problem 7: Imagine that you are about to purchase a jacket for $125 and a calculator for $15. The calculator salesman informs you that the calculator you wish to buy is on sale for $10 at the other branch of the store, located 20 minutes’ drive away. Would you make a trip to the other store?
问题7:想象一下 ,你准备购买一件125美元的夹克衫和一个15美元的计算器。计算器销售员告诉你,你想买的计算器在20分钟车程外的另一家商店里以10美元的价格出售。你会去另一家店吗?
This problem is concerned with the acceptability of an option that combines a disadvantage of inconvenience with a financial advantage that can be framed as a minimal, topical, or comprehensive account. The minimal account includes only the differences between the two options and disregards the features that they share. In the minimal account, the advantage associated with driving to the other store is framed as a gain of $5. A topical account relates the consequences of possible choices to a reference level that is determined by the context within which the decision arises. In the preceding problem, the relevant topic is the purchase of the calculator, and the benefit of the trip is therefore framed as a reduction of the price, from $15 to $10. Because the potential saving is associated only with the calculator, the price of the jacket is not included in the topical account. The price of the jacket, as well as other expenses, could well be included in a more comprehensive account in which the saving would be evaluated in relation to, say, monthly expenses.
这个问题涉及到一个选项的可接受性,这个选项结合了不便的缺点和财务 ,可以被框定为最小的、局部的或全面的账户。最低限度的说法只包括两个选项之间的差异,而忽略了它们的共同特征。在最低限度的描述中,开车去另一家商店的优势被描述为5美元的收益。专题描述将可能的选择的后果与一个参考水平联系起来,这个参考水平是由决策产生的背景决定的 。在前面的问题中,相关的主题是购买计算器,因此旅行的好处被设定为价格的减少,从15美元减少到10美元。因为潜在的节省只与计算器有关,所以外套的价格不包括在主题账户中。外套的价格,以及 其他费用,完全可以包括在一个更全面的账户中,在这个账户中,节省的费用将被评估,例如,每月的开支。
The formulation of the preceding problem appears neutral with respect to the adoption of a minimal, topical, or comprehensive account. We suggest, however, that people will spontaneously frame decisions in terms of topical accounts that, in the context of decision making, play a role analogous to that of “good forms” in perception and of basic-level categories in cognition. Topical organization, in conjunction with the concavity of value, entails that the willingness to travel to the other store for a saving of $5 on a calculator should be inversely related to the price of the calculator and should be independent of the price of the jacket. To test this prediction, we constructed another version of the problem in which the prices of the two items were interchanged. The price of the calculator was given as $125 in the first store and $120 in the other branch, and the price of the jacket was set at $15. As predicted, the proportions of respondents who said they would make the trip differed sharply in the two problems. The results showed that 68% of the respondents ( N = 88) were willing to drive to the other branch to save $5 on a $15 calculator, but only 29% of 93 respondents were willing to make the same trip to save $5 on a $125 calculator. This finding supports the notion of topical organization of accounts, since the two versions are identical both in terms of a minimal and a comprehensive account.
前面的问题的表述在采用最低限度的、局部的或全面的帐户方面似乎是中性的。然而,我们认为,人们会自发地以专题性的说法来制定决策,在决策的 ,其作用类似于感知中的 "良好形式 "和认知中的基本层次类别。主题组织与价值的凹陷性相结合,意味着为节省5美元的计算器而去另一家商店的意愿应该与计算器的价格成反比,并且应该与外套的价格无关。 为了检验这一预测,我们构建了另一个版本的问题,其中两件物品的价格是互换的。计算器的价格在第一家商店是125美元,在另一家商店是120美元,而夹克的价格被设定为15美元。正如预测的那样,在这两个问题中,说他们会去旅行的受访者的比例有很大的不同。结果 ,68%的受访者 (N =88)愿意开车到另一家分店,为15美元的计算器节省5美元,但93名受访者中只有29%愿意做同样的旅行,为125美元的计算器节省5美元。这一发现支持了账户的专题组织的概念,因为两个版本在最小的账户和综合账户方面都是相同的。
The significance of topical accounts for consumer behavior is confirmed by the observation that the standard deviation of the prices that different stores in a city quote for the same product is roughly proportional to the average price of that product (Pratt, Wise, and Zeckhauser 1979). Since the dispersion of prices is surely controlled by shoppers’ efforts to find the best buy, these results suggest that consumers hardly exert more effort to save $15 on a $150 purchase than to save $5 on a $50 purchase.
对消费者行为的意义 ,这一点被以下观察所证实:一个城市的不同商店对同一产品所报价格的标准偏差与该产品的平均价格大致成正比(Pratt, Wise, and Zeckhauser 1979)。由于价格的分散性肯定是由购物者寻找最佳购买的努力所控制的,这些结果表明,消费者 ,在150美元的购物中节省15美元的努力几乎比在50美元的购物中节省5美元的努力多。
The topical organization of mental accounts leads people to evaluate gains and losses in relative rather than in absolute terms, resulting in large variations in the rate at which money is exchanged for other things, such as the number of phone calls made to find a good buy or the willingness to drive a long distance to get one. Most consumers will find it easier to buy a car stereo system or a Persian rug, respectively, in the context of buying a car or a house than separately. These observations, of course, run counter to the standard rational theory of consumer behavior, which assumes invariance and does not recognize the effects of mental accounting.
心理账户的专题组织导致人们以相对而非绝对的方式来评估收益和损失,从而导致金钱换取其他东西的速度有很大的差异,例如为寻找好的买物而打的电话的数量,或愿意开着 长途跋涉去买。大多数消费者会发现,在买车或买房的情况下,分别购买汽车音响系统或波斯地毯,比单独购买更容易。当然,这些观察结果与消费者行为的标准理性理论背道而驰,该理论假设了不变性,不承认心理核算的影响。
The following problems illustrate another example of mental accounting in which the posting of a cost to an account is controlled by topical organization:
下面的问题说明了另一个 的心智会计的例子,在这个例子中,将一项费用记入一个账户是由专题组织控制的。
Problem 8 ( N = 200): Imagine that you have decided to see a play and paid the admission price of $10 per ticket. As you enter the theater, you discover that you have lost the ticket. The seat was not marked, and the ticket cannot be recovered.
问题8 (N =200)。想象一下,你决定去看一场戏,并支付了每张票10美元的入场费。当你进入剧院时,你发现你的票丢了。座位上没有标记,票也找不到了。
Would you pay $10 for another ticket?
你会花10美元再买一张票吗?
Yes (46%) No (54%)
是(46%) 不是(54%)
Problem 9 ( N = 183): Imagine that you have decided to see a play where admission is $10 per ticket. As you enter the theater, you discover that you have lost a $10 bill.
问题9 (N =183)。想象一下,你决定去看一场戏,每张票是10美元。当你进入剧院时,你发现你丢失了一张10美元的钞票。
Would you still pay $10 for a ticket for the play?
你还会花10美元买一张戏的票吗?
Yes (88%) No (12%)
是(88%) 不是(12%)
The difference between the responses to the two problems is intriguing. Why are so many people unwilling to spend $10 after having lost a ticket, if they would readily spend that sum after losing an equivalent amount of cash? We attribute the difference to the topical organization of mental accounts. Going to the theater is normally viewed as a transaction in which the cost of the ticket is exchanged for the experience of seeing the play. Buying a second ticket increases the cost of seeing the play to a level that many respondents apparently find unacceptable. In contrast, the loss of the cash is not posted to the account of the play, and it affects the purchase of a ticket only by making the individual feel slightly less affluent.
对这两个问题的反应之间的差异是耐人寻味的。为什么有那么多人在输掉 一张彩票后不愿意花10美元,而他们在输掉同等数额的现金后会很乐意花这笔钱?我们把这种差异归因于心理账户的主题组织。去剧院看戏通常被视为一种交易,在这种交易中,门票的费用被用来交换看戏的经历。购买第二张票会使看戏的成本增加到一个许多受访者 ,这显然是不可接受的水平。相比之下,现金的损失并没有被记入戏剧账户,它对购票的影响仅仅是使个人感觉稍微不那么富裕。
An interesting effect was observed when the two versions of the problem were presented to the same subjects. The willingness to replace a lost ticket increased significantly when that problem followed the lost-cash version. In contrast, the willingness to buy a ticket after losing cash was not affected by prior presentation of the other problem. The juxtaposition of the two problems apparently enabled the subjects to realize that it makes sense to think of the lost ticket as lost cash, but not vice versa.
当这两个版本的问题被展示给同一受试者时,观察到了一个有趣的效果。当这个问题 ,紧接着丢失现金的版本时,更换丢失车票的意愿明显增加。相反,丢失现金后买票的意愿并没有受到事先呈现的另一个问题的影响。这两个问题的并列出现,显然使受试者认识到,把丢失的车票看作是丢失的现金是有意义的,但反之则不然。
The normative status of the effects of mental accounting is questionable. Unlike earlier examples, such as the public health problem, in which the two versions differed only in form, it can be argued that the alternative versions of the calculator and ticket problems differ also in substance. In particular, it may be more pleasurable to save $5 on a $15 purchase than on a larger purchase, and it may be more annoying to pay twice for the same ticket than to lose $10 in cash. Regret, frustration, and self-satisfaction can also be affected by framing (Kahneman and Tversky 1982). If such secondary consequences are considered legitimate, then the observed preferences do not violate the criterion of invariance and cannot readily be ruled out as inconsistent or erroneous. On the other hand, secondary consequences may change upon reflection. The satisfaction of saving $5 on a $15 item can be marred if the consumer discovers that she would not have exerted the same effort to save $10 on a $200 purchase. We do not wish to recommend that any two decision problems that have the same primary consequences should be resolved in the same way. We propose, however, that systematic examination of alternative framings offers a useful reflective device that can help decision makers assess the values that should be attached to the primary and secondary consequences of their choices.
心理核算的效果的规范性地位是值得怀疑的。与 早期的例子不同,例如公共卫生问题,其中两个版本只在形式上有差异,可以说计算器和票据问题的替代版本在实质上也有差异。特别是,在15美元的购物中节省5美元可能比在 更大的购物中更令人高兴,而为同一张票支付两次可能比损失10美元的现金更令人讨厌。 后悔、沮丧和自我满足感也会受到框架的影响(Kahneman和Tversky,1982)。如果这样的次要后果被认为是合法的,那么观察到的偏好就不会违反不变性的标准,不能轻易排除不一致或错误的情况。另一方面,次要后果在思考后可能会改变。如果消费者发现她不会为购买200美元的商品节省10美元而付出同样的努力,那么在 15美元的商品上节省5美元的满足感就会受到损害。我们不想建议任何两个具有相同主要后果的决策问题应该以相同的方式解决。然而,我们建议,对替代性框架的系统检查提供了一个有用的反思工具,可以帮助决策者 ,评估他们选择的主要和次要后果所应具备的价值。
LOSSES AND COSTS
损失和费用
Many decision problems take the form of a choice between retaining the status quo and accepting an alternative to it, which is advantageous in some respects and disadvantageous in others. The analysis of value that was applied earlier to unidimensional risky prospects can be extended to this case by assuming that the status quo defines the reference level for all attributes. The advantages of alternative options will then be evaluated as gains and their disadvantages as losses. Because losses loom larger than gains, the decision maker will be biased in favor of retaining the status quo.
许多决策问题的形式是在保留现状和接受现状的替代方案之间进行选择,这种替代方案在某些方面是有利的,在其他方面是不利的。前面应用于单维风险前景的价值分析可以通过假设现状定义了所有属性的参考水平而扩展到这种情况 。然后,备选方案的优势将被评估为收益,其劣势将被评估为损失。由于损失比收益大,决策者会偏向于保留现状。
Thaler (1980) coined the term “endowment effect” to describe the reluctance of people to part from assets that belong to their endowment. When it is more painful to give up an asset than it is pleasurable to obtain it, buying prices will be significantly lower than selling prices. That is, the highest price that an individual will pay to acquire an asset will be smaller than the minimal compensation that would induce the same individual to give up that asset, once acquired. Thaler discussed some examples of the endowment effect in the behavior of consumers and entrepreneurs. Several studies have reported substantial discrepancies between buying and selling prices in both hypothetical and real transactions (Gregory 1983; Hammack and Brown 1974; Knetsch and Sinden 1984). These results have been presented as challenges to standard economic theory, in which buying and selling prices coincide except for transaction costs and effects of wealth. We also observed reluctance to trade in a study of choices between hypothetical jobs that differed in weekly salary ( S ) and in the temperature ( T ) of the workplace. Our respondents were asked to imagine that they held a particular position ( S 1 , T 1 ) and were offered the option of moving to a different position ( S 2 , T 2 ), which was better in one respect and worse in another. We found that most subjects who were assigned to ( S 1 , T 1 ) did not wish to move to ( S 2 , T 2 ), and that most subjects who were assigned to the latter position did not wish to move to the former. Evidently, the same difference in pay or in working conditions looms larger as a disadvantage than as an advantage.
Thaler(1980)创造了 "禀赋效应 "一词来描述 ,人们不愿意放弃属于他们禀赋的资产。当放弃一项资产比获得它的乐趣更痛苦时,购买价格将大大低于销售价格。也就是说,一个人为了获得一项资产所付出的最高价格将小于促使同一个人在获得该资产后放弃该资产的最低补偿。 Thaler讨论了消费者和企业家行为中的禀赋效应的一些例子。一些研究报告称,在假设的和真实的交易中,购买和销售价格之间存在巨大的差异(Gregory 1983;Hammack and Brown 1974;Knetsch and Sinden 1984)。这些结果被认为是对标准经济理论的挑战,在这些理论中,除了交易成本和财富效应外,购买和 销售价格是一致的。我们还在一项研究中观察到,在周薪 (S )和工作场所温度 (T )不同的假想工作之间选择交易的意愿。我们的受访者被要求想象他们担任一个特定的职位 ( S1 , T1 ),并被提供选择转移到一个不同的职位 (S2 , T2 ), ,这个职位在某方面更好,在另方面更差。我们发现,大多数被分配到 ( S1 , T1 )的受试者不希望转到 (S2 , T2 ),而大多数被分配到后一个职位的受试者也不希望转到前一个职位。显而易见,同样的薪酬或工作条件的差异,作为一种劣势比作为一种优势更明显。
In general, loss aversion favors stability over change. Imagine two hedonically identical twins who find two alternative environments equally attractive. Imagine further that by force of circumstance the twins are separated and placed in the two environments. As soon as they adopt their new states as reference points and evaluate the advantages and disadvantages of each other’s environments accordingly, the twins will no longer be indifferent between the two states, and both will prefer to stay where they happen to be. Thus, the instability of preferences produces a preference for stability. In addition to favoring stability over change, the combination of adaptation and loss aversion provides limited protection against regret and envy by reducing the attractiveness of foregone alternatives and of others’ endowments.
一般来说,损失厌恶倾向于 稳定性而不是变化。想象一下,两对享乐主义相同的双胞胎,他们发现两个可供选择的环境同样具有吸引力。想象一下,由于情况所迫,这对双胞胎被分开并被置于这两种环境中。一旦他们把自己的新状态作为参考点,并据此评价对方环境的优劣,这对双胞胎就不再对两种状态无动于衷 ,而都会选择留在他们碰巧所在的地方。因此,偏好的不稳定性 ,产生了对稳定性的偏好。除了偏向于稳定而不是变化之外,适应和损失厌恶的结合,通过减少被放弃的替代方案和他人的禀赋的吸引力,提供了对遗憾和嫉妒的有限保护。
Loss aversion and the consequent endowment effect are unlikely to play a significant role in routine economic exchanges. The owner of a store, for example, does not experience money paid to suppliers as losses and money received from customers as gains. Instead, the merchant adds costs and revenues over some period of time and only evaluates the balance. Matching debits and credits are effectively canceled prior to evaluation. Payments made by consumers are also not evaluated as losses but as alternative purchases. In accord with standard economic analysis, money is naturally viewed as a proxy for the goods and services that it could buy. This mode of evaluation is made explicit when an individual has in mind a particular alternative, such as, “I can either buy a new camera or a new tent.” In this analysis, a person will buy a camera if its subjective value exceeds the value of retaining the money it would cost.
损失规避和随之而来的禀赋效应不太可能在常规经济交流中发挥重要作用。例如,商店的老板不会把支付给供应商的钱当作损失,把从顾客那里得到的钱当作收益。相反,商人在一段时间内增加成本和收入,只对余额进行评估。匹配的借方和贷方在评估前有效地 ,取消了。消费者的付款也没有被评估为损失,而是被评估为替代购买。根据标准的经济分析,货币自然被视为它可以购买的商品和服务的代表。当一个人心中有一个特定的替代物时,这种评价模式就明确了,例如,"我要么买一个新的相机,要么买一个新的帐篷。" 在这种分析中,如果一个人的主观价值超过了保留它所花费的钱的价值,他就会购买一台相机。
There are cases in which a disadvantage can be framed either as a cost or as a loss. In particular, the purchase of insurance can also be framed as a choice between a sure loss and the risk of a greater loss. In such cases the cost-loss discrepancy can lead to failures of invariance. Consider, for example, the choice between a sure loss of $50 and a 25% chance to lose $200. Slovic, Fischhoff, and Lichtenstein (1982) reported that 80% of their subjects expressed a risk-seeking preference for the gamble over the sure loss. However, only 35% of subjects refused to pay $50 for insurance against a 25% risk of losing $200. Similar results were also reported by Schoemaker and Kunreuther (1979) and by Hershey and Schoemaker (1980). We suggest that the same amount of money that was framed as an uncompensated loss in the first problem was framed as the cost of protection in the second. The modal preference was reversed in the two problems because losses are more aversive than costs.
在有些情况下,劣势既可以被描述为成本,也可以被描述为损失。特别是,购买保险也可以被说成是在肯定的损失和更大的损失的风险之间的选择。在这种情况下,成本-损失的差异可能会导致失败 的不变性。例如,考虑在肯定损失50美元和25%的机会损失200美元之间的选择。Slovic、Fischhoff和Lichtenstein(1982)报告说,80%的受试者表示对赌博的风险偏好高于对肯定损失的偏好。然而,只有35%的受试者拒绝支付50美元的保险费,以应对损失200美元的25%的风险。Schoemaker 和 Kunreuther (1979) 以及 Hershey 和 Schoemaker (1980) 也报告了类似的结果。我们认为,在第一个问题中被设定为无补偿损失的相同金额在第二个问题中被设定为保护成本。这两个问题中的模式偏好是相反的,因为损失比成本更令人厌恶。
We have observed a similar effect in the positive domain, as illustrated by the following pair of problems:
我们在正面领域观察到了类似的效果,正如以下 对问题所说明的那样。
Problem 10: Would you accept a gamble that offers a 10% chance to win $95 and a 90% chance to lose $5?
问题10:你会接受10%的机会赢95美元,90%的机会输5美元的赌博吗?
Problem 11: Would you pay $5 to participate in a lottery that offers a 10% chance to win $100 and a 90% chance to win nothing?
问题11:你会花5美元参加有10%的机会赢得100美元和90%的机会什么都没有的彩票吗?
A total of 132 undergraduates answered the two questions, which were separated by a short filler problem. The order of the questions was reversed for half the respondents. Although it is easily confirmed that the two problems offer objectively identical options, 55 of the respondents expressed different preferences in the two versions. Among them, 42 rejected the gamble in Problem 10 but accepted the equivalent lottery in Problem 11. The effectiveness of this seemingly inconsequential manipulation illustrates both the cost-loss discrepancy and the power of framing. Thinking of the $5 as a payment makes the venture more acceptable than thinking of the same amount as a loss.
共有132名本科生回答了这两个问题,这两个问题之间有一个简短的填充问题。对于一半的受访者来说,问题的顺序被颠倒了 。虽然很容易确认这两个问题提供了客观上相同的选项,但有55名受访者在两个版本中表达了不同的偏好。其中,42人拒绝了问题10中的赌博,但接受了问题11中的同等抽奖。这种看似无足轻重的操作的有效性既说明了成本-损失差异 ,也说明了框架的力量。把5美元看成是一种付款,比把同样的金额看成是一种损失更能让人接受。
The preceding analysis implies that an individual’s subjective state can be improved by framing negative outcomes as costs rather than as losses. The possibility of such psychological manipulations may explain a paradoxical form of behavior that could be labeled the dead-loss effect. Thaler (1980) discussed the example of a man who develops tennis elbow soon after paying the membership fee in a tennis club and continues to play in agony to avoid wasting his investment. Assuming that the individual would not play if he had not paid the membership fee, the question arises: How can playing in agony improve the individual’s lot? Playing in pain, we suggest, maintains the evaluation of the membership fee as a cost. If the individual were to stop playing, he would be forced to recognize the fee as a dead loss, which may be more aversive than playing in pain.
前面的分析意味着,个人的主观状态可以通过将负面结果框定为成本而不是损失来改善。这种心理操纵的可能性可以解释一种自相矛盾的行为形式,这种行为可以被称为 ,即死损效应。Thaler(1980)讨论了这样一个例子:一个人在支付了 网球俱乐部的会员费后不久就患上了网球肘,为了避免浪费他的投资,他继续在痛苦中打球。假设这个人如果不交会员费就不会打球,问题就来了。在痛苦中打球如何能改善个人的命运?我们认为,在痛苦中玩耍, ,保持对会员费作为一种成本的评价。如果个人停止游戏,他将被迫承认该费用是一种致命的损失,这可能比在痛苦中游戏更令人厌恶。
CONCLUDING REMARKS
结语
The concepts of utility and value are commonly used in two distinct senses: (a) experience value, the degree of pleasure or pain, satisfaction or anguish in the actual experience of an outcome; and (b) decision value, the contribution of an anticipated outcome to the overall attractiveness or aversiveness of an option in a choice. The distinction is rarely explicit in decision theory because it is tacitly assumed that decision values and experience values coincide. This assumption is part of the conception of an idealized decision maker who is able to predict future experiences with perfect accuracy and evaluate options accordingly. For ordinary decision makers, however, the correspondence of decision values between experience values is far from perfect (March 1978). Some factors that affect experience are not easily anticipated, and some factors that affect decisions do not have a comparable impact on the experience of outcomes.
功用和价值的概念通常在两种不同的意义上使用:(a) 经验价值,在实际体验中的快乐或痛苦、满意或痛苦的程度 ;(b) 决策价值,在选择中预期结果对选项的总体吸引力或反作用的贡献。这种区分在决策理论中很少明确,因为人们默认决策价值和经验价值是一致的。这一假设是理想化的决策者概念的一部分,他能够完全准确地预测未来的 ,并据此评估各种选择。然而,对于普通的决策者来说,决策值与经验值的对应关系远非完美(March 1978)。一些影响经验的因素不容易被预测,而一些影响决策的因素对结果经验的影响也不具有可比性。
In contrast to the large amount of research on decision making, there has been relatively little systematic exploration of the psychophysics that relate hedonic experience to objective states. The most basic problem of hedonic psychophysics is the determination of the level of adaptation or aspiration that separates positive from negative outcomes. The hedonic reference point is largely determined by the objective status quo, but it is also affected by expectations and social comparisons. An objective improvement can be experienced as a loss, for example, when an employee receives a smaller raise than everyone else in the office. The experience of pleasure or pain associated with a change of state is also critically dependent on the dynamics of hedonic adaptation. Brickman and Campbell’s (1971) concept of the hedonic treadmill suggests the radical hypothesis that rapid adaptation will cause the effects of any objective improvement to be short-lived. The complexity and subtlety of hedonic experience make it difficult for the decision maker to anticipate the actual experience that outcomes will produce. Many a person who ordered a meal when ravenously hungry has admitted to a big mistake when the fifth course arrived on the table. The common mismatch of decision values and experience values introduces an additional element of uncertainty in many decision problems.
与关于决策的大量 ,相对而言,对于将享乐体验与客观状态联系起来的心理物理学的系统探索却很少。享乐心理物理学的最基本问题是确定适应或渴望的程度,以区分积极和消极的结果。享乐参考点在很大程度上是由客观状态 quo决定的,但它也受到期望和社会比较的影响。一个客观的改善可以被体验为一种损失,例如,当一个雇员得到的加薪比办公室里的其他人少。与状态变化相关的快乐或痛苦的体验也关键取决于享乐适应的动态。Brickman和Campbell(1971)的hedonic treadmill的概念提出了一个激进的假设,即快速的适应会导致任何客观改善的效果都是短暂的。享乐体验的复杂性和微妙性使得决策者很难预测结果将产生的实际体验。许多人在饥肠辘辘的时候点了一餐,当第五道菜 ,就承认犯了大错。决策价值和经验价值的普遍不匹配在许多决策问题中引入了一个额外的不确定性因素。
The prevalence of framing effects and violations of invariance further complicates the relation between decision values and experience values. The framing of outcomes often induces decision values that have no counterpart in actual experience. For example, the framing of outcomes of therapies for lung cancer in terms of mortality or survival is unlikely to affect experience, although it can have a pronounced influence on choice. In other cases, however, the framing of decisions affects not only decision but experience as well. For example, the framing of an expenditure as an uncompensated loss or as the price of insurance can probably influence the experience of that outcome. In such cases, the evaluation of outcomes in the context of decisions not only anticipates experience but also molds it.
框定效应和违反不变性的普遍存在,使决策价值和经验价值之间的关系更加复杂。对结果的框定往往会诱发在实际经验中没有对应的决策价值。 例如,以死亡率或生存率来框定肺癌治疗的结果,不太可能影响经验,尽管它对选择有明显的影响。然而,在其他情况下,决策的框架不仅影响决策,也影响经验。例如,把一项支出说成是无偿的损失或保险的价格,可能会影响到该结果的经验, 。在这种情况下,在决定的背景下对结果的评价不仅预示着经验,而且也塑造了经验。
REFERENCES
参考文献
Allais, M., and O. Hagen, eds. 1979. Expected Utility Hypotheses and the Allais Paradox . Hingham, MA: D. Reidel.
Allais, M., and O. Hagen, eds. 1979. 预期效用假设和Allais悖论 。Hingham, MA:D. Reidel.
Bernoulli, D. 1954 [1738]. “Exposition of a New Theory on the Measurement of Risk.” Econometrica 22:23–36.
Bernoulli, D. 1954 [1738]."关于风险测量的新理论的阐述"。 Econometrica 22:23-36.
Brickman, P., and D. T. Campbell. 1971. “Hedonic Relativism and Planning the Good Society.” In Adaptation Level Theory: A Symposium , ed. M. H. Appley. New York: Academic Press, 287–302.
Brickman, P., and D. T. Campbell.1971."Hedonic Relativism and Planning the Good Society"。In Adaptation Level Theory:A Symposium , ed.M. H. Appley.New York:学术出版社,287-302。
Clark, H. H., and E. V. Clark. 1977. Psychology and Language . New York: Harcourt.
Clark, H. H., and E. V. Clark.1977. Psychology and Language .纽约:哈考特。
Erakar, S. E., and H. C. Sox. 1981. “Assessment of Patients’ Preferences for Therapeutic Outcomes.” Medical Decision Making 1:29–39.
Erakar, S. E., and H. C. Sox。1981."评估病人对治疗结果的偏好"。 医学决策 》1:29-39。
Fischhoff, B. 1983. “Predicting Frames.” Journal of Experimental Psychology: Learning, Memory and Cognition 9:103–16.
Fischhoff, B. 1983."预测框架"。 Journal of Experimental Psychology:学习、记忆和认知 9:103-16。
Fischhoff, B., P. Slovic, and S. Lichtenstein. 1980. “Knowing What You Want: Measuring Labile Values.” In Cognitive Processes in Choice and Decision Behavior , ed. T. Wallsten. Hillsdale, NJ: Erlbaum, 117–41.
Fischhoff, B., P. Slovic, and S. Lichtenstein.1980."知道你想要什么:测量不稳定的价值"。在《 选择和决策行为中的认知过程 》中,T. Wallsten编辑。T. Wallsten.Hillsdale, NJ: Erlbaum, 117-41.
Fishburn, P. C., and G. A. Kochenberger. 1979. “Two-Piece von Neumann–Morgenstern Utility Functions.” Decision Sciences 10:503–18.
Fishburn, P. C., and G. A. Kochenberger.1979."两片式von Neumann-Morgenstern 效用函数"。 决策科学 》10:503-18。
Gregory, R. 1983. “Measures of Consumer’s Surplus: Reasons for the Disparity in Observed Values.” Unpublished manuscript, Keene State College, Keene, NH.
Gregory, R. 1983."消费者盈余的措施。观察到的价值不一致的原因"。未发表的手稿,基恩州立学院,基恩,NH。
Hammack, J., and G. M. Brown Jr. 1974. Waterfowl and Wetlands: Toward Bioeconomic Analysis . Baltimore: Johns Hopkins University Press.
Hammack, J., and G. M. Brown Jr. 1974. 水禽和湿地。迈向生物经济分析 。巴尔的摩。Johns Hopkins大学出版社。
Hershey, J. C., and P. J. H. Schoemaker. 1980. “Risk Taking and Problem Context in the Domain of Losses: An Expected-Utility Analysis.” Journal of Risk and Insurance 47:111–32.
Hershey, J. C., and P. J. H. Schoemaker.1980."风险承担 和损失领域的问题背景。预期效用分析"。 风险与保险杂志 》47:111-32。
Kahneman, D., and A. Tversky. 1979. “Prospect Theory: An Analysis of Decision under Risk.” Econometrica 47:263–91.
Kahneman, D., and A. Tversky.1979."前景理论。风险下的决策分析"。 Econometrica 47:263-91.
———. 1982. “The Simulation Heuristic.” In Judgment Under Uncertainty: Heuristics and Biases , ed. D. Kahneman, P. Slovic, and A. Tversky. New York: Cambridge University Press, 201–208.
---.1982."模拟启发式"。In Judgment Under Uncertainty: Heuristics and Biases ,ed.D. Kahneman, P. Slovic, and A. Tversky.纽约。剑桥大学 Press,201-208。
Knetsch, J., and J. Sinden. 1984. “Willingness to Pay and Compensation Demanded: Experimental Evidence of an Unexpected Disparity in Measures of Value.” Quarterly Journal of Economics 99:507–21.
Knetsch, J., and J. Sinden.1984."支付意愿和要求的补偿。价值测量中的意外差异的实验证据"。 经济学季刊》 99:507-21。
March, J. G. 1978. “Bounded Rationality, Ambiguity, and the Engineering of Choice.” Bell Journal of Economics 9:587–608.
March, J. G. 1978."有限理性、模糊性和选择的工程"。 Bell Journal of Economics 9:587-608.
McNeil, B., S. Pauker, H. Sox Jr., and A. Tversky. 1982. “On the Elicitation of Preferences for Alternative Therapies.” New England Journal of Medicine 306:1259–62.
McNeil, B., S. Pauker, H. Sox Jr., and A. Tversky.1982."关于 征求对替代疗法的偏好"。 新英格兰医学杂志 》306:1259-62。
Payne, J. W., D. J. Laughhunn, and R. Crum. 1980. “Translation of Gambles and Aspiration Level Effects in Risky Choice Behavior.” Management Science 26:1039–60.
Payne, J. W., D. J. Laughhunn, and R. Crum。1980."风险选择行为中的赌博转换和愿望水平效应"。 管理科学》 26:1039-60。
Pratt, J. W., D. Wise, and R. Zeckhauser. 1979. “Price Differences in Almost Competitive Markets.” Quarterly Journal of Economics 93:189–211.
Pratt, J. W., D. Wise, and R. Zeckhauser.1979."几乎竞争性市场中的价格差异"。 经济学季刊 》 93:189-211。
Savage, L. J. 1954. The Foundation of Statistics . New York: Wiley.
Savage, L. J. 1954. The Foundation of Statistics .New York:Wiley.
Schlaifer, R. 1959. Probability and Statistics for Business Decisions . New York: McGraw-Hill.
Schlaifer, R. 1959. 商业决策的概率和统计 。New York:McGraw-Hill.
Schoemaker, P.J.H., and H. C. Kunreuther. 1979. “An Experimental Study of Insurance Decisions.” Journal of Risk and Insurance 46:603–18.
Schoemaker, P.J.H., and H. C. Kunreuther.1979."保险决策的实验性研究"。 风险与保险杂志 》46:603-18。
Slovic, P., B. Fischhoff, and S. Lichtenstein. 1982. “Response Mode, Framing, and Information-Processing Effects in Risk Assessment.” In New Directions for Methodology of Social and Behavioral Science: Question Framing and Response Consistency , ed. R. Hogarth. San Francisco: Jossey-Bass, 21–36.
Slovic, P., B. Fischhoff, and S. Lichtenstein.1982."风险评估中的反应模式、框架和 信息处理 影响"。在 社会和行为科学方法学的新方向 中 。 在社会和行为科学方法学的新方向: 问题框架和反应一致性 ,编辑。R. Hogarth.San Francisco:Jossey-Bass, 21-36.
Thaler, R. 1980. “Toward a Positive Theory of Consumer Choice.” Journal of Economic Behavior and Organization 1: 39–60.
Thaler, R. 1980."迈向消费者选择的积极理论"。 经济行为与组织杂志 》1:39-60。
———. 1985. “Using Mental Accounting in a Theory of Consumer Behavior.” Marketing Science 4:199–214.
---.1985."在消费者 行为的理论中使用心理会计"。 营销科学 》4:199-214。
Tversky, A. 1977. “On the Elicitation of Preferences: Descriptive and Prescriptive Considerations.” In Conflicting Objectives in Decisions , ed. D. Bell, R. L. Kenney, and H. Raiffa. New York: Wiley, 209–22.
Tversky, A. 1977."关于偏好的征询。描述性和规定性的考虑"。在《 决策中的冲突目标 》一书中,D.Bell、R.L.Kenney和H.Raiffa编著。D. Bell, R. L. Kenney, and H. Raiffa.New York:Wiley, 209-22.
Tversky, A., and D. Kahneman. 1981. “The Framing of Decisions and the Psychology of Choice.” Science 211:453–58.
Tversky, A., and D. Kahneman.1981."决策的框架和选择的心理学"。 科学 》211:453-58。
von Neumann, J., and O. Morgenstern. 1947. Theory of Games and Economic Behavior , 2nd ed. Princeton: Princeton University Press.
von Neumann, J., and O. Morgenstern. 1947. 游戏和经济行为理论 ,第二版。Princeton:普林斯顿大学出版社。
Notes
笔记
INTRODUCTION
简介
prone to collect too few observations : We had read a book that criticized psychologists for using small samples, but did not explain their choices: Jacob Cohen, Statistical Power Analysis for the Behavioral Sciences (Hillsdale, NJ: Erlbaum, 1969).
容易收集太少的观察结果 。我们曾 读过一本书,批评心理学家使用小样本,但没有解释他们的选择。Jacob Cohen, Statistical Power Analysis for the Behavioral Sciences (Hillsdale, NJ: Erlbaum, 1969)。
question about words : I have slightly altered the original wording, which referred to letters in the first and third position of words.
关于单词的问题 。我稍微改变了原来的措辞,它指的是 字的第一和第三位置的字母。
negative view of the mind : A prominent German psychologist has been our most persistent critic. Gerd Gigerenzer, “How to Make Cognitive Illusions Disappear,” European Review of Social Psychology 2 (1991): 83–115. Gerd Gigerenzer, “Personal Reflections on Theory and Psychology,” Theory & Psychology 20 (2010): 733–43. Daniel Kahneman and Amos Tversky, “On the Reality of Cognitive Illusions,” Psychological Review 103 (1996): 582–91.
对心灵的负面看法 。一位杰出的德国心理学家一直是我们最顽固的批评者。格尔德-吉格伦泽,"如何使认知幻觉消失",《 欧洲社会心理学评论 》2(1991)。83-115.格尔德-吉格伦泽,"对理论和心理学的个人思考",《 理论与心理学 》20(2010)。733-43.丹尼尔-卡尼曼和阿莫斯-特维斯基,"论认知幻觉的真实性," Psychological Review 103 (1996):582-91.
offered plausible alternatives : Some examples from many are Valerie F. Reyna and Farrell J. Lloyd, “Physician Decision-Making and Cardiac Risk: Effects of Knowledge, Risk Perception, Risk Tolerance and Fuzzy-Processing,” Journal of Experimental Psychology: Applied 12 (2006): 179–95. Nicholas Epley and Thomas Gilovich, “The Anchoring-and-Adjustment Heuristic,” Psychological Science 17 (2006): 311–18. Norbert Schwarz et al., “Ease of Retrieval of Information: Another Look at the Availability Heuristic,” Journal of Personality and Social Psychology 61 (1991): 195–202. Elke U. Weber et al., “Asymmetric Discounting in Intertemporal Choice,” Psychological Science 18 (2007): 516–23. George F. Loewenstein et al., “Risk as Feelings,” Psychological Bulletin 127 (2001): 267–86.
提供合理的替代方案 。许多人中的一些例子是Valerie F. Reyna和Farrell J. Lloyd,"医生决策和心脏风险:知识、风险认知、风险容忍度和模糊处理的影响", 《实验心理学杂志。应用 12(2006)。179-95.Nicholas Epley和Thomas Gilovich,"锚定和调整启发式", Psychological Science 17 (2006):311-18.Norbert Schwarz等人,"信息检索的难易程度。对可用性启发式的另一种看法》, 《人格与社会心理学杂志 》61(1991)。195-202.Elke U. Weber等人,"时内选择中的不对称贴现",《 心理科学》 18(2007)。516-23.George F. Loewenstein等人,《作为感觉的风险》,《 心理学通报》 127 (2001):267-86.
Nobel Prize that I received : The prize awarded in economics is named Bank of Sweden Prize in Economic Sciences in Memory of Alfred Nobel. It was first given in 1969. Some physical scientists were not pleased with the addition of a Nobel Prize in social science, and the distinctive label of the economics prize was a compromise.
我获得的诺贝尔奖 。经济学奖被命名为瑞典银行纪念阿尔弗雷德-诺贝尔经济科学奖。它于1969年首次颁发。一些物理学家对在社会科学领域增设诺贝尔奖并不满意,而经济学奖的独特标签是一种妥协。
prolonged practice : Herbert Simon and his students at Carnegie Mellon in the 1980s set the foundations for our understanding of expertise. For an excellent popular introduction to the subject, see Joshua Foer, Moonwalking with Einstein: The Art and Science of Remembering (New York: Penguin Press, 2011). He presents work that is reviewed in more technical detail in K. Anders Ericsson et al., eds., The Cambridge Handbook of Expertise and Expert Performance (New York: Cambridge University Press, 2006.)
长时间的实践 。赫伯特-西蒙和他的学生 ,20世纪80年代在卡内基梅隆大学为我们对专业知识的理解奠定了基础。关于这个主题的出色的流行介绍,见约书亚-福尔,《 与爱因斯坦月球漫步》。The Art and Science of Remembering (New York: Penguin Press, 2011).他提出的工作在K.Anders Ericsson等人编辑的《 剑桥专业知识和专家手册》 ( The Cambridge Handbook of Expertise and Expert Performance )(纽约:剑桥大学出版社,2006年)中得到了更多技术细节的回顾。
kitchen was on fire : Gary A. Klein, Sources of Power (Cambridge, MA: MIT Press, 1999).
厨房着火了 。Gary A. Klein, Sources of Power (Cambridge, MA: MIT Press, 1999).
studied chess masters : Herbert Simon was one of the great scholars of the twentieth century, whose discoveries and inventions ranged from political science (where he began his career) to economics (in which he won a Nobel Prize) to computer science (in which he was a pioneer) and to psychology.
研究国际象棋大师 。赫伯特-西蒙是二十世纪伟大的学者之一,他的发现和发明从政治学(他在那里开始了他的职业生涯)到经济学(他在那里获得了诺贝尔奖)到计算机科学 (他是这方面的先驱)以及心理学。
“The situation … recognition” : Herbert A. Simon, “What Is an Explanation of Behavior?” Psychological Science 3 (1992): 150–61.
"情况......认可"。 赫伯特-A-西蒙,"什么是行为的解释?" 心理科学》 3(1992)。150-61.
affect heuristic : The concept of the affect heuristic was developed by Paul Slovic, a classmate of Amos’s at Michigan and a lifelong friend.
情感启发式 :情感启发式的概念是由保罗-斯洛维奇提出的,他是阿莫斯在密歇根的同学,也是他一生的朋友。
without noticing the substitution : See chapter 9 .
而没有注意到这种替换 。见 第9章 。
1: THE CHARACTERS OF THE STORY
1:故事中的人物
offered many labels : For reviews of the field, see Jonathan St. B. T. Evans and Keith Frankish, eds., In Two Minds : Dual Processes and Beyond (New York: Oxford University Press, 2009); Jonathan St. B. T. Evans, “Dual-Processing Accounts of Reasoning, Judgment, and Social Cognition,” Annual Review of Psychology 59 (2008): 255–78. Among the pioneers are Seymour Epstein, Jonathan Evans, Steven Sloman, Keith Stanovich, and Richard West. I borrow the terms System 1 and System 2 from early writings of Stanovich and West that greatly influenced my thinking: Keith E. Stanovich and Richard F. West, “Individual Differences in Reasoning: Implications for the Rationality Debate,” Behavioral and Brain Sciences 23 (2000): 645–65.
提供了许多标签 。关于该领域的评论,见Jonathan St. B. T. Evans和Keith Frankish, eds., In Two Minds :Dual Processes and Beyond (纽约:牛津大学出版社,2009年);Jonathan St:255-78.在这些先驱中,有西摩-爱泼斯坦、乔纳森-埃文斯、史蒂芬-斯洛曼、 Keith Stanovich和理查德-韦斯特。我从斯坦诺维奇和韦斯特的早期著作中借用了系统1和系统2的术语,这些术语极大地影响了我的思维。Keith E. Stanovich和Richard F. West,"推理的个体差异。对理性辩论的影响",《 行为与脑科学 》23(2000)。645-65.
subjective experience of agency : This sense of free will is sometimes illusory, as shown in Daniel M. Wegner, The Illusion of Conscious Will (Cambridge, MA: Bradford Books, 2003).
主观的代理体验 。这种自由意志感有时是虚幻的, ,如Daniel M. Wegner, The Illusion of Conscious Will (Cambridge, MA: Bradford Books, 2003)所示。
attention is totally focused elsewhere : Nilli Lavie, “Attention, Distraction and Cognitive Control Under Load,” Current Directions in Psychological Science 19 (2010): 143–48.
注意力完全集中在其他地方 。Nilli Lavie,"注意力、分心和负载下的认知控制",《 心理科学的当前方向 》19(2010)。143-48.
conflict between the two systems : In the classic Stroop task, you are shown a display of patches of different colors, or of words printed in various colors. Your task is to call out the names of the colors, ignoring the words. The task is extremely difficult when the colored words are themselves names of color (e.g., GREEN printed in red, followed by YELLOW printed in green, etc.).
这 两个系统之间的冲突 。在经典的斯特罗普任务中,你会看到一个由不同颜色的斑块组成的显示屏,或 ,上面印有各种颜色的字。你的任务是叫出这些颜色的名称,而忽略这些字。当彩色单词本身就是颜色的名称时,这个任务就非常困难了(例如,用红色印刷的绿色,接着是用绿色印刷的黄色,等等)。
psychopathic charm : Professor Hare wrote me to say, “Your teacher was right,” March 16, 2011. Robert D. Hare, Without Conscience : The Disturbing World of the Psychopaths Among Us (New York: Guilford Press, 1999). Paul Babiak and Robert D. Hare, Snakes in Suits : When Psychopaths Go to Work (New York: Harper, 2007).
心理变态的魅力 。哈雷教授写信给我说:"你的老师是对的,"2011年3月16日。罗伯特-D-黑尔,《 没有良知 》。 我们中的精神病患者的令人不安的世界 (纽约:吉尔福德出版社,1999年)。保罗-巴比亚克和罗伯特-D-黑尔,《 穿西装的蛇 》。 当精神病患者去工作时 (纽约:哈珀,2007)。
little people : Agents within the mind are called homunculi and are (quite properly) objects of professional derision.
小人 。心灵中的代理人被称为同体人,是专业人员嘲笑的对象(非常恰当)。
space in your working memory : Alan D. Baddeley, “Working Memory: Looking Back and Looking Forward,” Nature Reviews: Neuroscience 4 (2003): 829–38. Alan D. Baddeley, Your Memory: A User’s Guide (New York: Firefly Books, 2004).
你工作记忆中的空间 。Alan D. Baddeley,"工作记忆。回顾过去,展望未来", Nature Reviews:神经科学 4(2003)。829-38.Alan D. Baddeley,《 你的记忆》。A User's Guide (New York: Firefly Books, 2004).
2: ATTENTION AND EFFORT
2:注意和努力
Attention and Effort : Much of the material of this chapter draws on my Attention and Effort (1973). It is available for free download on my website ( www.princeton.edu/~kahneman/docs/attention_and_effort/Attention_hi_quality.pdf ). The main theme of that book is the idea of a limited ability to pay attention and exert mental effort. Attention and effort were considered general resources that could be used to support many mental tasks. The idea of general capacity is controversial, but it has been extended by other psychologists and neuroscientists, who found support for it in brain research. See Marcel A. Just and Patricia A. Carpenter, “A Capacity Theory of Comprehension: Individual Differences in Working Memory,” Psychological Review 99 (1992): 122–49; Marcel A. Just et al., “Neuroindices of Cognitive Workload: Neuroimaging, Pupillometric and Event-Related Potential Studies of Brain Work,” Theoretical Issues in Ergonomics Science 4 (2003): 56–88. There is also growing experimental evidence for general-purpose resources of attention, as in Evie Vergauwe et al., “Do Mental Processes Share a Domain-General Resource?” Psychological Science 21 (2010): 384–90. There is imaging evidence that the mere anticipation of a high-effort task mobilizes activity in many areas of the brain, relative to a low-effort task of the same kind. Carsten N. Boehler et al., “Task-Load-Dependent Activation of Dopaminergic Midbrain Areas in the Absence of Reward,” Journal of Neuroscience 31 (2011): 4955–61.
注意和努力 。本章的大部分材料都借鉴了我的《 注意与努力 》(1973)。它可以在我的网站上免费下载 (www.princeton.edu/~kahneman/docs/attention_and_effort/Attention_hi_quality.pdf )。那本书的主要主题 ,是关于注意和施加心理努力的能力有限的想法。注意力和努力被认为是一般资源,可以用来支持许多心理任务。一般能力的想法是有争议的,但它已被其他心理学家和神经科学家扩展,他们在大脑研究中发现了对它的支持。见Marcel A. Just和Patricia A. Carpenter, "A Capacity Theory of Comprehension:工作记忆的个体差异》,《 心理学评论》 99(1992)。122-49;Marcel A. Just等人,"认知工作量的神经指标。大脑工作的神经成像、瞳孔测量和事件相关电位研究",《 人体工程学科学的理论问题 》4(2003)。56-88.也有越来越多的实验证据表明注意力的通用资源, ,如Evie Vergauwe等人,"心理过程是否共享一个领域通用资源?" Psychological Science 21 (2010):384-90.有影像学证据表明,相对于同类的低强度任务,仅仅是对高强度任务的预期就能调动大脑许多区域的活动。Carsten N. Boehler等人,"在没有奖励的情况下多巴胺能中脑区域的任务负荷依赖性激活 ," 《神经科学杂志》 31(2011)。4955-61.
pupil of the eye : Eckhard H. Hess, “Attitude and Pupil Size,” Scientific American 212 (1965): 46–54.
眼睛的瞳孔 。Eckhard H. Hess,"态度和瞳孔大小",《 科学美国人》 212(1965)。46-54.
on the subject’s mind : The word subject reminds some people of subjugation and slavery, and the American Psychological Association enjoins us to use the more democratic participant . Unfortunately, the politically correct label is a mouthful, which occupies memory space and slows thinking. I will do my best to use participant whenever possible but will switch to subject when necessary.
在主体的思想上 。 主体 这个词让一些人想起了征服和奴役,美国心理学会责成我们使用更民主的 参与者 。不幸的是,这个政治上正确的标签是一个口误, ,它占据了记忆空间,减缓了思维速度。我将尽可能地使用 参与人 ,但在必要时将改用 主体 。
heart rate increases : Daniel Kahneman et al., “Pupillary, Heart Rate, and Skin Resistance Changes During a Mental Task,” Journal of Experimental Psychology 79 (1969): 164–67.
心率增加 。Daniel Kahneman等人,"Pupillary, Heart Rate, and Skin Resistance Changes During a Mental Task," Journal of Experimental Psychology 79 (1969):164-67.
rapidly flashing letters : Daniel Kahneman, Jackson Beatty, and Irwin Pollack, “Perceptual Deficit During a Mental Task,” Science 15 (1967): 218–19. We used a halfway mirror so that the observers saw the letters directly in front of them while facing the camera. In a control condition, the participants looked at the letter through a narrow aperture, to prevent any effect of the changing pupil size on their visual acuity. Their detection results showed the inverted-V pattern observed with other subjects.
快速闪烁的字母 。Daniel Kahneman, Jackson Beatty, and Irwin Pollack, "Perceptual Deficit During a Mental Task, " Science 15 (1967):218-19.我们使用了一面半身镜,使观察者在面对摄像机时看到他们面前的字母。在一个控制条件下,参与者通过一个狭窄的光圈看字母,以防止瞳孔大小的变化对他们的视力产生任何影响。他们的检测结果显示了在其他受试者身上观察到的倒V模式 。
Much like the electricity meter : Attempting to perform several tasks at once may run into difficulties of several kinds. For example, it is physically impossible to say two different things at exactly the same time, and it may be easier to combine an auditory and a visual task than to combine two visual or two auditory tasks. Prominent psychological theories have attempted to attribute all mutual interference between tasks to competition for separate mechanisms. See Alan D. Baddeley, Working Memory (New York: Oxford University Press, 1986). With practice, people’s ability to multitask in specific ways may improve. However, the wide variety of very different tasks that interfere with each other supports the existence of a general resource of attention or effort that is necessary in many tasks.
很像电表 。试图同时执行几个任务可能会遇到几种困难。例如,在物理上不可能完全在同一时间说两件不同的事情,而把听觉和视觉任务结合起来可能比把两个视觉或两个听觉任务结合起来更容易。著名的心理学理论 ,试图将任务之间的所有相互干扰归结为独立机制的竞争。见Alan D. Baddeley, Working Memory (New York: Oxford University Press, 1986)。通过练习,人们以特定方式进行多任务处理的能力可能会提高。然而,各种非常不同的任务之间的相互干扰支持存在一种普遍的注意力资源或 努力,这在许多任务中是必要的。
Studies of the brain : Michael E. Smith, Linda K. McEvoy, and Alan Gevins, “Neurophysiological Indices of Strategy Development and Skill Acquisition,” Cognitive Brain Research 7 (1999): 389–404. Alan Gevins et al., “High-Resolution EEG Mapping of Cortical Activation Related to Working Memory: Effects of Task Difficulty, Type of Processing and Practice,” Cerebral Cortex 7 (1997): 374–85.
对大脑的研究 。Michael E. Smith, Linda K. McEvoy, and Alan Gevins, "Neurophysiological Indices of Strategy Development and Skill Acquisition," Cognitive Brain Research 7 (1999):389-404.Alan Gevins等人,"与工作记忆有关的皮质激活的高分辨率EEG图谱。任务难度、处理类型和实践的影响, " Cerebral Cortex 7 (1997):374-85.
less effort to solve the same problems : For example, Sylvia K. Ahern and Jackson Beatty showed that individuals who scored higher on the SAT showed smaller pupillary dilations than low scorers in responding to the same task. “Physiological Signs of Information Processing Vary with Intelligence,” Science 205 (1979): 1289–92.
在 解决同样的问题时,要付出更少的努力 。例如, Sylvia K. Ahern和Jackson Beatty表明,在SAT考试中得分较高的人在回答同样的任务时,瞳孔放大的程度比得分低的人要小。"信息处理的生理迹象随智力变化",《 科学》 205(1979)。1289-92.
“law of least effort” : Wouter Kool et al., “Decision Making and the Avoidance of Cognitive Demand,” Journal of Experimental Psychology—General 139 (2010): 665–82. Joseph T. McGuire and Matthew M. Botvinick, “The Impact of Anticipated Demand on Attention and Behavioral Choice,” in Effortless Attention , ed. Brian Bruya (Cambridge, MA: Bradford Books, 2010), 103–20.
"最小努力法则"。 Wouter Kool 等人,"决策和避免认知需求", 《实验心理学杂志-General 》139(2010)。665-82.Joseph T. McGuire和Matthew M. Botvinick,"预期需求对注意力和行为选择的影响",载于 Effortless Attention ,Brian Bruya编辑(Cambridge, MA: Bradford Books, 2010),103-20。
balance of benefits and costs : Neuroscientists have identified a region of the brain that assesses the overall value of an action when it is completed. The effort that was invested counts as a cost in this neural computation. Joseph T. McGuire and Matthew M. Botvinick, “Prefrontal Cortex, Cognitive Control, and the Registration of Decision Costs,” PNAS 107 (2010): 7922–26.
利益和成本的平衡 。神经科学家已经确定了大脑中的一个 区域,该区域在一个行动完成后评估其总体价值。在这个神经计算中,所投入的努力也算作成本。Joseph T. McGuire和Matthew M. Botvinick,"前额叶皮层、认知控制和决策成本的登记", PNAS 107(2010)。7922-26.
read distracting words : Bruno Laeng et al., “Pupillary Stroop Effects,” Cognitive Processing 12 (2011): 13–21.
阅读分散注意力的词语 。Bruno Laeng等人,"Pupillary Stroop Effects", 认知处理 12(2011)。13-21.
associate with intelligence : Michael I. Posner and Mary K. Rothbart, “Research on Attention Networks as a Model for the Integration of Psychological Science,” Annual Review of Psychology 58 (2007): 1–23. John Duncan et al., “A Neural Basis for General Intelligence,” Science 289 (2000): 457–60.
与智力有关 。Michael I. Posner和Mary K. Rothbart,"作为心理科学整合模式的注意力网络研究",《 心理学年度评论 》58(2007)。1-23.约翰-邓肯等人,"一般智力的神经基础", 《科学》 289(2000)。457-60.
under time pressure : Stephen Monsell, “Task Switching,” Trends in Cognitive Sciences 7 (2003): 134–40.
在时间压力下 :Stephen Monsell,"任务转换", 认知科学趋势 7(2003)。134-40.
working memory : Baddeley, Working Memory .
工作记忆 。Baddeley, 工作记忆 。
tests of general intelligence : Andrew A. Conway, Michael J. Kane, and Randall W. Engle, “Working Memory Capacity and Its Relation to General Intelligence,” Trends in Cognitive Sciences 7 (2003): 547–52.
对一般智力的测试 。Andrew A. Conway, Michael J. Kane, and Randall W. Engle, "Working Memory Capacity and Its Relation to General Intelligence, " Trends in Cognitive Sciences 7 (2003):547-52.
Israeli Air Force pilots : Daniel Kahneman, Rachel Ben-Ishai, and Michael Lotan, “Relation of a Test of Attention to Road Accidents,” Journal of Applied Psychology 58 (1973): 113–15. Daniel Gopher, “A Selective Attention Test as a Predictor of Success in Flight Training,” Human Factors 24 (1982): 173–83.
以色列空军飞行员 。Daniel Kahneman, Rachel Ben-Ishai, and Michael Lotan, "Relation of a Test of Attention to Road Accidents, " Journal of Applied Psychology 58 (1973):113-15.丹尼尔-戈弗,"作为飞行训练成功预测因素的选择性注意力测试", 人类因素 24(1982)。173-83.
3: THE LAZY CONTROLLER
3:懒惰的控制器
“optimal experience” : Mihaly Csikszentmihalyi, Flow: The Psychology of Optimal Experience (New York: Harper, 1990).
"最佳体验"。 Mihaly Csikszentmihalyi, Flow: The Psychology of Optimal Experience (New York: Harper, 1990).
sweet tooth : Baba Shiv and Alexander Fedorikhin, “Heart and Mind in Conflict: The Interplay of Affect and Cognition in Consumer Decision Making,” Journal of Consumer Research 26 (1999): 278–92. Malte Friese, Wilhelm Hofmann, and Michaela Wänke, “When Impulses Take Over: Moderated Predictive Validity of Implicit and Explicit Attitude Measures in Predicting Food Choice and Consumption Behaviour,” British Journal of Social Psychology 47 (2008): 397–419.
甜食 。Baba Shiv和Alexander Fedorikhin,"冲突中的心和思想。情感和认知在消费者决策中的相互作用 ," 《消费者研究杂志 》26(1999)。278-92.Malte Friese, Wilhelm Hofmann, and Michaela Wänke, "When Impulses Take Over:隐性和显性态度措施在预测食物选择和消费行为方面的预测效力的调节," 英国社会心理学杂志 47(2008)。397-419.
cognitively busy : Daniel T. Gilbert, “How Mental Systems Believe,” American Psychologist 46 (1991): 107–19. C. Neil Macrae and Galen V. Bodenhausen, “Social Cognition: Thinking Categorically about Others,” Annual Review of Psychology 51 (2000): 93–120.
认知上的忙碌 。丹尼尔-T-吉尔伯特, "心理系统如何相信", 美国心理学家 46(1991)。107-19.C. Neil Macrae和Galen V. Bodenhausen,"社会认知。对他人的分类思考》,《 心理学年度评论 》51(2000)。93-120.
pointless anxious thoughts : Sian L. Beilock and Thomas H. Carr, “When High-Powered People Fail: Working Memory and Choking Under Pressure in Math,” Psychological Science 16 (2005): 101–105.
无意义的焦虑想法 。Sian L. Beilock和Thomas H. Carr, "When High-Powered People Fail:工作记忆和数学压力下的窒息",《 心理科学》 16(2005)。 101-105.
exertion of self-control : Martin S. Hagger et al., “Ego Depletion and the Strength Model of Self-Control: A Meta-Analysis,” Psychological Bulletin 136 (2010): 495–525.
自我 控制力的发挥 。Martin S. Hagger等人,"自我耗损和自我控制的强度模型。A Meta-Analysis," Psychological Bulletin 136 (2010):495-525.
resist the effects of ego depletion : Mark Muraven and Elisaveta Slessareva, “Mechanisms of Self-Control Failure: Motivation and Limited Resources,” Personality and Social Psychology Bulletin 29 (2003): 894–906. Mark Muraven, Dianne M. Tice, and Roy F. Baumeister, “Self-Control as a Limited Resource: Regulatory Depletion Patterns,” Journal of Personality and Social Psychology 74 (1998): 774–89.
抵制自我消耗的影响 。Mark Muraven和Elisaveta Slessareva,"自我控制失败的机制。动机和有限资源",《 人格与社会心理学公报》 29(2003)。894-906.Mark Muraven, Dianne M. Tice, and Roy F. Baumeister, "自我控制是一种有限的资源。调节性耗损模式", 《人格与社会心理学杂志 》74(1998)。774-89.
more than a mere metaphor : Matthew T. Gailliot et al., “Self-Control Relies on Glucose as a Limited Energy Source: Willpower Is More Than a Metaphor,” Journal of Personality and Social Psychology 92 (2007): 325–36. Matthew T. Gailliot and Roy F. Baumeister, “The Physiology of Willpower: Linking Blood Glucose to Self-Control,” Personality and Social Psychology Review 11 (2007): 303–27.
不 仅仅是一个比喻 。Matthew T. Gailliot等人,"自我控制依赖于作为有限能量来源的葡萄糖。Willpower Is More Than a Metaphor," Journal of Personality and Social Psychology 92 (2007):325-36.Matthew T. Gailliot 和 Roy F. Baumeister,"意志力的生理学。将血糖与自我控制联系起来》,《 人格与社会心理学评论 》11(2007)。303-27.
ego depletion : Gailliot, “Self-Control Relies on Glucose as a Limited Energy Source.”
自我耗损 。Gailliot,"自我控制依赖于作为有限能量来源的葡萄糖"。
depletion effects in judgment : Shai Danziger, Jonathan Levav, and Liora Avnaim-Pesso, “Extraneous Factors in Judicial Decisions,” PNAS 108 (2011): 6889–92.
判断中的耗损效应 。Shai Danziger, Jonathan Levav, and Liora Avnaim-Pesso, "Extraneous Factors in Judicial Decisions," PNAS 108 (2011): 6889-92.
intuitive—incorrect—answer : Shane Frederick, “Cognitive Reflection and Decision Making,” Journal of Economic Perspectives 19 (2005): 25–42.
直观的-正确的-答案 。谢恩-弗雷德里克,"认知反思和决策",《 经济展望》杂志 19(2005)。25-42.
syllogism as valid : This systematic error is known as the belief bias. Evans, “Dual-Processing Accounts of Reasoning, Judgment, and Social Cognition.”
判断一个人是否为 有效的合论 。这种系统性错误被称为信念偏差。埃文斯,"推理、判断和社会认知的双重处理账户"。
call them more rational : Keith E. Stanovich, Rationality and the Reflective Mind (New York: Oxford University Press, 2011).
称他们更加理性 。Keith E. Stanovich, Rationality and the Reflective Mind (New York: Oxford University Press, 2011).
cruel dilemma : Walter Mischel and Ebbe B. Ebbesen, “Attention in Delay of Gratification,” Journal of Personality and Social Psychology 16 (1970): 329–37.
残酷的困境 :Walter Mischel和Ebbe B. Ebbesen,"延迟满足中的注意", 《人格与社会心理学杂志 》16(1970)。329-37.
“There were no toys … distress” : Inge-Marie Eigsti et al., “Predicting Cognitive Control from Preschool to Late Adolescence and Young Adulthood,” Psychological Science 17 (2006): 478–84.
"没有玩具......苦恼"。 Inge-Marie Eigsti等人,"预测从学龄前到青春期晚期和成年后的认知控制",《 心理科学》 17(2006)。478-84.
higher scores on tests of intelligence : Mischel and Ebbesen, “Attention in Delay of Gratification.” Walter Mischel, “Processes in Delay of Gratification,” in Advances in Experimental Social Psychology , Vol. 7, ed. Leonard Berkowitz (San Diego, CA: Academic Press, 1974), 249–92. Walter Mischel, Yuichi Shoda, and Monica L. Rodriguez, “Delay of Gratification in Children,” Science 244 (1989): 933–38. Eigsti, “Predicting Cognitive Control from Preschool to Late Adolescence.”
在智力测试中获得更高的分数 。Mischel和Ebbesen,"延迟满足中的注意"。Walter Mischel, "Processes in Delay of Gratification," in Advances in Experimental Social Psychology , Vol. 7, ed.Leonard Berkowitz(San Diego, CA: Academic Press, 1974),249-92。Walter Mischel, Yuichi Shoda, and Monica L. Rodriguez, "Delay of Gratification in Children," Science 244 (1989):933-38.Eigsti,"预测认知控制 ,从学龄前到青春期晚期"。
improvement was maintained : M. Rosario Rueda et al., “Training, Maturation, and Genetic Influences on the Development of Executive Attention,” PNAS 102 (2005): 14931–36.
进展保持不变 。M. Rosario Rueda等人,"训练、成熟和遗传对执行注意力发展的影响", PNAS 102(2005)。14931-36.
conventional measures of intelligence : Maggie E. Toplak, Richard F. West, and Keith E. Stanovich, “The Cognitive Reflection Test as a Predictor of Performance on Heuristics-and-Biases Tasks,” Memory & Cognition (in press).
传统的智力测量方法 。Maggie E. Toplak, Richard F. West, and Keith E. Stanovich, "The Cognitive Reflection Test as a Predictor of Performance on Heuristics and Biases Tasks," Memory & Cognition (正在出版)。
4: THE ASSOCIATIVE MACHINE
4:关联机
Associative Machine : Carey K. Morewedge and Daniel Kahneman, “Associative Processes in Intuitive Judgment,” Trends in Cognitive Sciences 14 (2010): 435–40.
联想机 。Carey K. Morewedge和Daniel Kahneman, "Associative Processes in Intuitive Judgment," Trends in Cognitive Sciences 14 (2010):435-40.
beyond your control : To avoid confusion, I did not mention in the text that the pupil also dilated. The pupil dilates both during emotional arousal and when arousal accompanies intellectual effort.
你无法控制的 。为了避免混淆,我在文中没有提到瞳孔也会放大。瞳孔在情绪亢奋时和亢奋伴随智力 努力时都会扩张。
think with your body : Paula M. Niedenthal, “Embodying Emotion,” Science 316 (2007): 1002–1005.
用你的身体思考 。Paula M. Niedenthal, "Embodying Emotion," Science 316 (2007):1002-1005.
WASH primes SOAP : The image is drawn from the working of a pump. The first few draws on a pump do not bring up any liquid, but they enable subsequent draws to be effective.
洗涤剂为肥皂打底 。该图片是 ,从泵的工作原理中得出。抽水机的前几次抽水不会带出任何液体,但它们使后续的抽水能够有效。
“finds he it yellow instantly” : John A. Bargh, Mark Chen, and Lara Burrows, “Automaticity of Social Behavior: Direct Effects of Trait Construct and Stereotype Activation on Action,” Journal of Personality and Social Psychology 71 (1996): 230–44.
"瞬间发现他是黄色的"。 John A. Bargh, Mark Chen, and Lara Burrows, "社会行为的自动性。直接 特质结构和定型激活对行动的影响," 《人格与社会心理学杂志》 71(1996)。230-44.
words related to old age : Thomas Mussweiler, “Doing Is for Thinking! Stereotype Activation by Stereotypic Movements,” Psychological Science 17 (2006): 17–21.
与老龄化相关的词汇 。Thomas Mussweiler, "Doing Is for Thinking!定型运动的定型激活》,《 心理科学》 17(2006)。17-21.
The Far Side : Fritz Strack, Leonard L. Martin, and Sabine Stepper, “Inhibiting and Facilitating Conditions of the Human Smile: A Nonobtrusive Test of the Facial Feedback Hypothesis,” Journal of Personality and Social Psychology 54 (1988): 768–77.
远方 。Fritz Strack, Leonard L. Martin, and Sabine Stepper, "人类微笑的抑制和促进条件 。面部反馈假说的非侵入性测试", 《人格与社会心理学杂志 》54(1988)。768-77.
upsetting pictures : Ulf Dimberg, Monika Thunberg, and Sara Grunedal, “Facial Reactions to Emotional Stimuli: Automatically Controlled Emotional Responses,” Cognition and Emotion 16 (2002): 449–71.
令人不安的图片 。Ulf Dimberg, Monika Thunberg, and Sara Grunedal, "Facial Reactions to Emotional Stimuli: Automatically Controlled Emotional Responses, " Cognition and Emotion 16 (2002):449-71.
listen to messages : Gary L. Wells and Richard E. Petty, “The Effects of Overt Head Movements on Persuasion: Compatibility and Incompatibility of Responses,” Basic and Applied Social Psychology 1 (1980): 219–30.
听取信息 。Gary L. Wells和Richard E. Petty," ,公开的头部移动对劝说的影响。兼容和不兼容的反应," 基础和应用社会心理学 1(1980)。219-30.
increase the funding of schools : Jonah Berger, Marc Meredith, and S. Christian Wheeler, “Contextual Priming: Where People Vote Affects How They Vote,” PNAS 105 (2008): 8846–49.
增加学校的资金 。Jonah Berger, Marc Meredith, and S. Christian Wheeler, "Contextual Priming:人们在哪里投票会影响他们的投票方式", PNAS 105 (2008):8846-49.
Reminders of money : Kathleen D. Vohs, “The Psychological Consequences of Money,” Science 314 (2006): 1154–56.
对金钱的提醒 。Kathleen D. Vohs, "The Psychological Consequences of Money," Science 314 (2006):1154-56.
appeal of authoritarian ideas : Jeff Greenberg et al., “Evidence for Terror Management Theory II: The Effect of Mortality Salience on Reactions to Those Who Threaten or Bolster the Cultural Worldview,” Journal of Personality and Social Psychology 58 (1990): 308–18.
专制思想的吸引力 。杰夫-格林伯格等人,"恐怖管理理论的证据二:死亡率显著性对威胁或支持文化世界观的人的反应的影响", 《人格与社会心理学杂志 》58(1990)。308-18.
“Lady Macbeth effect” : Chen-Bo Zhong and Katie Liljenquist, “Washing Away Your Sins: Threatened Morality and Physical Cleansing,” Science 313 (2006): 1451–52.
"麦克白夫人效应"。 Chen-Bo Zhong 和 Katie Liljenquist, "Washing Away Your Sins:受威胁的 道德和身体清洁," 科学 313(2006)。1451-52.
preferred mouthwash over soap : Spike Lee and Norbert Schwarz, “Dirty Hands and Dirty Mouths: Embodiment of the Moral-Purity Metaphor Is Specific to the Motor Modality Involved in Moral Transgression,” Psychological Science 21 (2010): 1423–25.
喜欢用漱口水而不是肥皂 :斯派克-李和诺伯特-施瓦茨,"肮脏的手和肮脏的嘴。道德纯洁隐喻的体现是特定于涉及道德越轨行为的运动模式," 《心理科学》 21(2010)。1423-25.
at a British university : Melissa Bateson, Daniel Nettle, and Gilbert Roberts, “Cues of Being Watched Enhance Cooperation in a Real-World Setting,” Biology Letters 2 (2006): 412–14.
在英国的一所大学 。Melissa Bateson, Daniel Nettle, and Gilbert Roberts, "Cues of Being Watched Enhance Cooperation in a Real-World Setting," Biology Letters 2 (2006):412-14.
introduced to that stranger : Timothy Wilson’s Strangers to Ourselves (Cambridge, MA: Belknap Press, 2002) presents a concept of an “adaptive unconscious” that is similar to System 1.
被介绍给那个陌生人 。蒂莫西-威尔逊的《 我们自己的陌生人 》(Cambridge, MA: Belknap Press, 2002)提出了一个 "适应性无意识 "的概念,与系统1相似。
5: COGNITIVE EASE
5:认知上的轻松
“Easy” and “Strained” : The technical term for cognitive ease is fluency .
"轻松 "和 "紧张"。 认知轻松的专业术语是 流畅性 。
diverse inputs and outputs : Adam L. Alter and Daniel M. Oppenheimer, “Uniting the Tribes of Fluency to Form a Metacognitive Nation,” Personality and Social Psychology Review 13 (2009): 219–35.
不同的输入和输出 。Adam L. Alter和Daniel M. Oppenheimer, "团结流畅性部落,形成元认知国家",《 人格与社会心理学评论》 13(2009)。219-35.
“Becoming Famous Overnight” : Larry L. Jacoby, Colleen Kelley, Judith Brown, and Jennifer Jasechko, “Becoming Famous Overnight: Limits on the Ability to Avoid Unconscious Influences of the Past,” Journal of Personality and Social Psychology 56 (1989): 326–38.
"一夜成名"。 拉里-L-雅各比、科琳-凯利、朱迪斯-布朗和珍妮弗-贾西科,"一夜成名。对避免过去无意识影响的能力的限制," 《人格与社会心理学杂志 》 56(1989)。326-38.
nicely stated the problem : Bruce W. A. Whittlesea, Larry L. Jacoby, and Krista Girard, “Illusions of Immediate Memory: Evidence of an Attributional Basis for Feelings of Familiarity and Perceptual Quality,” Journal of Memory and Language 29 (1990): 716–32.
很好地说明了这个问题 :Bruce W. A. Whittlesea, Larry L. Jacoby, and Krista Girard, "Illusions of Immediate Memory:熟悉感和知觉质量的归属基础的证据," 记忆和语言杂志 29(1990)。716-32.
The impression of familiarity : Normally, when you meet a friend you can immediately place and name him; you often know where you met him last, what he was wearing, and what you said to each other. The feeling of familiarity becomes relevant only when such specific memories are not available. It is a fallback. Although its reliability is imperfect, the fallback is much better than nothing. It is the sense of familiarity that protects you from the embarrassment of being (and acting) astonished when you are greeted as an old friend by someone who only looks vaguely familiar.
熟悉的印象 。通常情况下, ,当你遇到一个朋友时,你可以立即找到他并说出他的名字;你经常知道 ,你最后一次在哪里遇到他,他穿着什么,以及你们彼此说了什么。只有在没有这种具体记忆的时候,熟悉感才会变得相关。它是一种退路。尽管它的可靠性并不完善,但后备措施总比没有好得多。正是这种熟悉感保护你不至于当你被一个看起来仅有几分熟悉的人问候为老朋友的时候, ,而感到尴尬(和表现)惊讶。
“body temperature of a chicken” : Ian Begg, Victoria Armour, and Thérèse Kerr, “On Believing What We Remember,” Canadian Journal of Behavioural Science 17 (1985): 199–214.
"鸡的体温"。 Ian Begg, Victoria Armour, and Thérèse Kerr, "On Believing What We Remember," Canadian Journal of Behavioural Science 17 (1985):199-214.
low credibility : Daniel M. Oppenheimer, “Consequences of Erudite Vernacular Utilized Irrespective of Necessity: Problems with Using Long Words Needlessly,” Applied Cognitive Psychology 20 (2006): 139–56.
低可信度 。Daniel M. Oppenheimer, "Consequences of Erudite Vernacular Utilized irrespective of Necessity:无谓地使用长词的问题", 应用 认知心理学 20(2006)。139-56.
when they rhymed : Matthew S. McGlone and Jessica Tofighbakhsh, “Birds of a Feather Flock Conjointly (?): Rhyme as Reason in Aphorisms,” Psychological Science 11 (2000): 424–28.
当他们押韵的时候 。Matthew S. McGlone 和 Jessica Tofighbakhsh, "Birds of a Feather Flock Conjointly (?):箴言中作为理由的韵律》,《 心理科学》 11(2000)。424-28.
fictitious Turkish companies : Anuj K. Shah and Daniel M. Oppenheimer, “Easy Does It: The Role of Fluency in Cue Weighting,” Judgment and Decision Making Journal 2 (2007): 371–79.
虚构的土耳其公司 。Anuj K. Shah 和 Daniel M. Oppenheimer, "Easy Does It: The Role of Fluency in Cue Weighting," Judgment and Decision Making Journal 2 (2007):371-79.
engaged and analytic mode : Adam L. Alter, Daniel M. Oppenheimer, Nicholas Epley, and Rebecca Eyre, “Overcoming Intuition: Metacognitive Difficulty Activates Analytic Reasoning,” Journal of Experimental Psychology—General 136 (2007): 569–76.
参与和分析的模式 。Adam L. Alter, Daniel M. Oppenheimer, Nicholas Epley, and Rebecca Eyre, "克服直觉。Metacognitive Difficulty Activates Analytic Reasoning," Journal of Experimental Psychology-General 136 (2007):569-76.
pictures of objects : Piotr Winkielman and John T. Cacioppo, “Mind at Ease Puts a Smile on the Face: Psychophysiological Evidence That Processing Facilitation Increases Positive Affect,” Journal of Personality and Social Psychology 81 (2001): 989–1000.
物体的图片 。Piotr Winkielman和John T. Cacioppo,"心态平和,脸上有笑容。心理生理学证据表明,加工促进 ,增加了积极的情感," 人格与社会心理学杂志 81(2001)。989-1000.
small advantage : Adam L. Alter and Daniel M. Oppenheimer, “Predicting Short-Term Stock Fluctuations by Using Processing Fluency,” PNAS 103 (2006). Michael J. Cooper, Orlin Dimitrov, and P. Raghavendra Rau, “A Rose.com by Any Other Name,” Journal of Finance 56 (2001): 2371–88.
小的优势 。Adam L. Alter和Daniel M. Oppenheimer, "Predicting Short-Term Stock Fluctuations by Using Processing Fluency," PNAS 103 (2006).Michael J. Cooper, Orlin Dimitrov, and P. Raghavendra Rau, "A Rose.com by Any Other Name, " Journal of Finance 56 (2001):2371-88.
clunky labels : Pascal Pensa, “Nomen Est Omen: How Company Names Influence Short- and Long-Run Stock Market Performance,” Social Science Research Network Working Paper , September 2006.
笨重的标签 。Pascal Pensa, "Nomen Est Omen: How Company Names Influence Short- and Long-Run Stock Market Performance," Social Science Research Network Working Paper , September 2006.
mere exposure effect : Robert B. Zajonc, “Attitudinal Effects of Mere Exposure,” Journal of Personality and Social Psychology 9 (1968): 1–27.
单纯的接触效应 。Robert B. Zajonc, "Attitudinal Effects of Mere Exposure," Journal of Personality and Social Psychology 9 (1968):1-27.
favorite experiments : Robert B. Zajonc and D. W. Rajecki, “Exposure and Affect: A Field Experiment,” Psychonomic Science 17 (1969): 216–17.
最喜欢的实验 。Robert B. Zajonc和D. W. Rajecki,"曝光和情感。A Field Experiment," Psychonomic Science 17 (1969):216-17.
never consciously sees : Jennifer L. Monahan, Sheila T. Murphy, and Robert B. Zajonc, “Subliminal Mere Exposure: Specific, General, and Diffuse Effects,” Psychological Science 11 (2000): 462–66.
从未有意识地看到 。Jennifer L. Monahan, Sheila T. Murphy, and Robert B. Zajonc, "Subliminal Mere Exposure: Specific, General, and Diffuse Effects, " Psychological Science 11 (2000):462-66.
inhabiting the shell : D. W. Rajecki, “Effects of Prenatal Exposure to Auditory or Visual Stimulation on Postnatal Distress Vocalizations in Chicks,” Behavioral Biology 11 (1974): 525–36.
栖息在壳 内。D. W. Rajecki, "Effects of Prenatal Exposure to Auditory or Visual Stimulation on Postnatal Distress Vocalizations in Chicks, " Behavioral Biology 11 (1974):525-36.
“The consequences … social stability” : Robert B. Zajonc, “Mere Exposure: A Gateway to the Subliminal,” Current Directions in Psychological Science 10 (2001): 227.
"后果......社会稳定"。 Robert B. Zajonc, "Mere Exposure:通往潜意识的通道",《 心理科学的当前方向 》10(2001)。227.
triad of words : Annette Bolte, Thomas Goschke, and Julius Kuhl, “Emotion and Intuition: Effects of Positive and Negative Mood on Implicit Judgments of Semantic Coherence,” Psychological Science 14 (2003): 416–21.
词的三元组 。Annette Bolte, Thomas Goschke, and Julius Kuhl, "Emotion and Intuition:积极和消极情绪对语义连贯性隐性判断的影响",《 心理科学》 14(2003)。416-21.
association is retrieved : The analysis excludes all cases in which the subject actually found the correct solution. It shows that even subjects who will ultimately fail to find a common association have some idea of whether there is one to be found.
联想被检索到 。分析 ,排除了所有主体实际找到正确答案的情况。它表明,即使是那些最终无法找到共同联想的受试者,也对是否存在一个联想有一定的了解。
increase cognitive ease : Sascha Topolinski and Fritz Strack, “The Architecture of Intuition: Fluency and Affect Determine Intuitive Judgments of Semantic and Visual Coherence and Judgments of Grammaticality in Artificial Grammar Learning,” Journal of Experimental Psychology — General 138 (2009): 39–63.
增加认知的便利性 。Sascha Topolinski和Fritz Strack,"直觉的结构。流利性和情感决定了语义和视觉连贯性的直觉判断 ,以及人工语法学习中的语法性判断," 《实验心理学杂志- 综合 138》(2009)。39-63.
doubled accuracy : Bolte, Goschke, and Kuhl, “Emotion and Intuition.”
翻倍的准确性 :Bolte, Goschke, and Kuhl, "情感和直觉"。
form a cluster : Barbara Fredrickson, Positivity: Groundbreaking Research Reveals How to Embrace the Hidden Strength of Positive Emotions, Overcome Negativity, and Thrive (New York: Random House, 2009). Joseph P. Forgas and Rebekah East, “On Being Happy and Gullible: Mood Effects on Skepticism and the Detection of Deception,” Journal of Experimental Social Psychology 44 (2008): 1362–67.
形成一个集群 。Barbara Fredrickson, Positivity:突破性的研究揭示了如何拥抱积极情绪的隐藏力量,克服消极情绪,并茁壮成长 (纽约:兰登书屋,2009)。 Joseph P. Forgas和Rebekah East,"关于快乐和轻信。心情对怀疑和检测欺骗的影响", 《实验社会心理学杂志 》44(2008)。1362-67.
smiling reaction : Sascha Topolinski et al., “The Face of Fluency: Semantic Coherence Automatically Elicits a Specific Pattern of Facial Muscle Reactions,” Cognition and Emotion 23 (2009): 260–71.
微笑的反应 。Sascha Topolinski等人,"流畅的脸。语义连贯性自动引起面部肌肉反应的特定模式," 认知与情感 23(2009)。260-71.
“previous research … individuals” : Sascha Topolinski and Fritz Strack, “The Analysis of Intuition: Processing Fluency and Affect in Judgments of Semantic Coherence,” Cognition and Emotion 23 (2009): 1465–1503.
"以前的研究......个人 " 。Sascha Topolinski和Fritz Strack,"直觉的分析。语义连贯性判断中的处理流畅性和情感," 认知与情感 23(2009)。1465-1503.
6: NORMS, SURPRISES, AND CAUSES
6:规范、意外和原因
An observer : Daniel Kahneman and Dale T. Miller, “Norm Theory: Comparing Reality to Its Alternatives,” Psychological Review 93 (1986): 136–53.
一个观察者 。丹尼尔-卡尼曼和戴尔-T-米勒,"规范理论。将现实与它的替代品相比较," 《心理学评论》 93(1986)。136-53.
“tattoo on my back” : Jos J. A. Van Berkum, “Understanding Sentences in Context: What Brain Waves Can Tell Us,” Current Directions in Psychological Science 17 (2008): 376–80.
"我背上的纹身"。 Jos J. A. Van Berkum, "在语境中理解句子。脑电波能告诉我们什么》,《 心理科学的当前方向 》17(2008)。376-80.
the word pickpocket : Ran R. Hassin, John A. Bargh, and James S. Uleman, “Spontaneous Causal Inferences,” Journal of Experimental Social Psychology 38 (2002): 515–22.
扒手 这个词 。Ran R. Hassin, John A. Bargh, and James S. Uleman, "Spontaneous Causal Inferences," Journal of Experimental Social Psychology 38 (2002):515-22.
indicate surprise : Albert Michotte, The Perception of Causality (Andover, MA: Methuen, 1963). Alan M. Leslie and Stephanie Keeble, “Do Six-Month-Old Infants Perceive Causality?” Cognition 25 (1987): 265–88.
表示惊讶 。Albert Michotte, The Perception of Causality (Andover, MA: Methuen, 1963).Alan M. Leslie和Stephanie Keeble,"6个月大的婴儿能感知到因果关系吗?" 认知 25(1987)。265-88.
explosive finale : Fritz Heider and Mary-Ann Simmel, “An Experimental Study of Apparent Behavior,” American Journal of Psychology 13 (1944): 243–59.
爆炸性的结局 。弗里茨-海德和玛丽-安-西梅尔,"表面行为的实验研究", 美国心理学杂志 13(1944)。243-59.
identify bullies and victims : Leslie and Keeble, “Do Six-Month-Old Infants Perceive Causality?”
识别欺凌者和受害者 。Leslie和Keeble,"6个月大的婴儿能感知到因果关系吗?"
as we die : Paul Bloom, “Is God an Accident?” Atlantic , December 2005.
随着我们的死亡 。保罗-布鲁姆,"上帝是个意外吗?" 大西洋 》,2005年12月。
7: A MACHINE FOR JUMPING TO CONCLUSIONS
7: 妄下结论的机器
elegant experiment : Daniel T. Gilbert, Douglas S. Krull, and Patrick S. Malone, “Unbelieving the Unbelievable: Some Problems in the Rejection of False Information,” Journal of Personality and Social Psychology 59 (1990): 601–13.
优雅的实验 。Daniel T. Gilbert, Douglas S. Krull, and Patrick S. Malone, "Unbelieving the Unbelievable:拒绝虚假信息的一些问题", 《人格与社会心理学杂志 》59(1990)。601-13.
descriptions of two people : Solomon E. Asch, “Forming Impressions of Personality,” Journal of Abnormal and Social Psychol ogy 41 (1946): 258–90.
对两个人的描述 。Solomon E. Asch,"形成人格印象", 《不正常 和社会心理学杂志 》41 (1946)。258-90.
all six adjectives : Ibid.
所有六个形容词 。同上。
Wisdom of Crowds : James Surowiecki, The Wisdom of Crowds (New York: Anchor Books, 2005).
群众的智慧 。James Surowiecki, The Wisdom of Crowds (New York: Anchor Books, 2005).
one-sided evidence : Lyle A. Brenner, Derek J. Koehler, and Amos Tversky, “On the Evaluation of One-Sided Evidence,” Journal of Behavioral Decision Making 9 (1996): 59–70.
片面的证据 。Lyle A. Brenner, Derek J. Koehler, and Amos Tversky, "On the Evaluation of One-Sided Evidence, " Journal of Behavioral Decision Making 9 (1996):59-70.
8: HOW JUDGMENTS HAPPEN
8: 判决是如何发生的
biological roots : Alexander Todorov, Sean G. Baron, and Nikolaas N. Oosterhof, “Evaluating Face Trustworthiness: A Model-Based Approach,” Social Cognitive and Affective Neuroscience 3 (2008): 119–27.
生物的根基 。Alexander Todorov, Sean G. Baron, and Nikolaas N. Oosterhof, "Evaluating Face Trustworthiness:基于模型的方法", 社会认知和情感神经科学 3(2008)。119-27.
friendly or hostile : Alexander Todorov, Chris P. Said, Andrew D. Engell, and Nikolaas N. Oosterhof, “Understanding Evaluation of Faces on Social Dimensions,” Trends in Cognitive Sciences 12 (2008): 455–60.
友好或敌对 :Alexander Todorov, Chris P. Said, Andrew D. Engell, and Nikolaas N. Oosterhof, "Understanding Evaluation of Faces on Social Dimensions," Trends in Cognitive Sciences 12 (2008):455-60.
may spell trouble : Alexander Todorov, Manish Pakrashi, and Nikolaas N. Oosterhof, “Evaluating Faces on Trustworthiness After Minimal Time Exposure,” Social Cognition 27 (2009): 813–33.
可能意味着麻烦 。Alexander Todorov, Manish Pakrashi, and Nikolaas N. Oosterhof, "Evaluating Faces on Trustworthiness After Minimal Time Exposure, " Social Cognition 27 (2009):813-33.
Australia, Germany, and Mexico : Alexander Todorov et al., “Inference of Competence from Faces Predict Election Outcomes,” Science 308 (2005): 1623–26. Charles C. Ballew and Alexander Todorov, “Predicting Political Elections from Rapid and Unreflective Face Judgments,” PNAS 104 (2007): 17948–53. Christopher Y. Olivola and Alexander Todorov, “Elected in 100 Milliseconds: Appearance-Based Trait Inferences and Voting,” Journal of Nonverbal Behavior 34 (2010): 83–110.
澳大利亚、德国和墨西哥 。Alexander Todorov等人,"从脸部推断能力预测选举结果", 科学 308(2005)。1623-26.Charles C. Ballew 和 Alexander Todorov, "Predicting Political Elections from Rapid and Unreflective Face Judgments," PNAS 104 (2007):17948-53.Christopher Y. Olivola和Alexander Todorov, "100毫秒内当选。基于外表的特征推断和投票", 《非语言行为杂志 》34(2010)。83-110.
watch less television : Gabriel Lenz and Chappell Lawson, “Looking the Part: Television Leads Less Informed Citizens to Vote Based on Candidates’ Appearance,” American Journal of Political Science (forthcoming).
少看电视 。Gabriel Lenz和Chappell Lawson, "Looking the Part:电视导致不太了解情况的公民根据候选人的外表投票," 美国政治杂志 科学 (即将出版)。
absence of a specific task set : Amos Tversky and Daniel Kahneman, “Extensional Versus Intuitive Reasoning: The Conjunction Fallacy in Probability Judgment,” Psychological Review 90 (1983): 293–315.
没有特定的任务集 。Amos Tversky和Daniel Kahneman,"扩展性推理与直觉推理。概率判断中的连接谬误",《 心理学评论》 90(1983)。293-315.
Exxon Valdez : William H. Desvousges et al., “Measuring Natural Resource Damages with Contingent Valuation: Tests of Validity and Reliability,” in Contingent Valuation: A Critical Assessment , ed. Jerry A. Hausman (Amsterdam: North-Holland, 1993), 91–159.
Exxon Valdez :William H. Desvousges等人,"用或然估值衡量自然资源损害。对有效性和可靠性的测试",在 权宜之计。A Critical Assessment , ed.Jerry A. Hausman (Amsterdam: North-Holland, 1993), 91-159.
sense of injustice : Stanley S. Stevens, Psychophysics : Introduction to Its Perceptual, Neural, and Social Prospect (New York: Wiley, 1975).
不公正的感觉 。Stanley S. Stevens, Psychophysics : Introduction to Its Perceptual, Neural, and Social Prospect (New York: Wiley, 1975).
detected that the words rhymed : Mark S. Seidenberg and Michael K. Tanenhaus, “Orthographic Effects on Rhyme Monitoring,” Journal of Experimental Psychology—Human Learning and Memory 5 (1979): 546–54.
检测到这些词是押韵的 。Mark S. Seidenberg和Michael K. Tanenhaus, "Orthographic Effects on Rhyme Monitoring," Journal of Experimental Psychology- Human Learning and Memory 5 (1979):546-54.
sentence was literally true : Sam Glucksberg, Patricia Gildea, and Howard G. Bookin, “On Understanding Nonliteral Speech: Can People Ignore Metaphors?” Journal of Verbal Learning and Verbal Behavior 21 (1982): 85–98.
这句话在字面上是真实的 :Sam Glucksberg, Patricia Gildea, and Howard G. Bookin, "On Understanding Nonliteral Speech:人们可以忽略隐喻吗?" 言语学习和言语行为杂志 》21(1982)。85-98.
9: ANSWERING AN EASIER QUESTION
9:回答一个更容易的问题
an intuitive answer to it came readily to mind : An alternative approach to judgment heuristics has been proposed by Gerd Gigerenzer, Peter M. Todd, and the ABC Research Group, in Simple Heuristics That Make Us Smart (New York: Oxford University Press, 1999). They describe “fast and frugal” formal procedures such as “Take the best [cue],” which under some circumstances generate quite accurate judgments on the basis of little information. As Gigerenzer has emphasized, his heuristics are different from those that Amos and I studied, and he has stressed their accuracy rather than the biases to which they inevitably lead. Much of the research that supports fast and frugal heuristic uses statistical simulations to show that they could work in some real-life situations, but the evidence for the psychological reality of these heuristics remains thin and contested. The most memorable discovery associated with this approach is the recognition heuristic, illustrated by an example that has become well-known: a subject who is asked which of two cities is larger and recognizes one of them should guess that the one she recognizes is larger. The recognition heuristic works fairly well if the subject knows that the city she recognizes is large; if she knows it to be small, however, she will quite reasonably guess that the unknown city is larger. Contrary to the theory, the subjects use more than the recognition cue: Daniel M. Oppenheimer, “Not So Fast! (and Not So Frugal!): Rethinking the Recognition Heuristic,” Cognition 90 (2003): B1–B9. A weakness of the theory is that, from what we know of the mind, there is no need for heuristics to be frugal. The brain processes vast amounts of information in parallel, and the mind can be fast and accurate without ignoring information. Furthermore, it has been known since the early days of research on chess masters that skill need not consist of learning to use less information. On the contrary, skill is more often an ability to deal with large amounts of information quickly and efficiently.
一个直观的答案很容易就出现在脑海中 。Gerd Gigerenzer, Peter M. Todd和ABC研究小组在《 使我们聪明的简单启发式方法 》(纽约:牛津大学出版社,1999年)中提出了另一种判断启发式方法。他们描述了 "快速和节俭 "的正式程序,如 "采取最好的[线索]",在某些情况下,在很少的信息基础上产生相当准确的判断。正如Gigerenzer所强调的,他的启发式方法与Amos和我研究的那些不同, ,他强调了它们的准确性,而不是它们不可避免地导致的偏见。支持快速和节俭启发式的大部分研究使用统计模拟 ,以表明它们在一些现实生活中 可以 发挥作用,但这些启发式的心理现实的证据仍然很薄,而且有争议。与这种方法相关的最令人难忘的发现是 识别启发式,通过一个已经众所周知的例子来说明:一个被试如果被问到两个城市中哪个更大,并且认出了其中一个,那么她应该猜到她认出的那个更大。如果受试者知道她所认识的城市是大的,那么识别启发式的效果就相当好;但是如果她知道它是小的,她就会很合理地猜测那个未知的 城市是大的。与理论相反的是,受试者使用的不仅仅是识别线索。丹尼尔-M-奥本海默,"不要那么快!(和不那么节俭!)。重新思考识别启发式," 认知 90(2003)。B1-B9。该理论的一个弱点是,从我们对心智的了解来看,启发式方法没有必要是节俭的。大脑并行地处理大量的信息,心灵 ,可以在不忽略信息的情况下做到快速和准确。此外,从早期对国际象棋大师的研究开始,人们就知道技能不需要包括学习使用更少的信息。相反,技能更多时候是一种快速和有效地处理大量信息的能力。
best examples of substitution : Fritz Strack, Leonard L. Martin, and Norbert Schwarz, “Priming and Communication: Social Determinants of Information Use in Judgments of Life Satisfaction,” European Journal of Social Psychology 18 (1988): 429–42.
替换的最佳例子 。Fritz Strack, Leonard L. Martin, and Norbert Schwarz, "Priming and Communication:生活满意度判断中信息使用的社会决定因素》,《 欧洲社会心理学杂志 》18(1988)。429-42.
correlations between psychological measures : The correlation was .66.
心理测量之间的相关性 。相关性为0.66。
dominates happiness reports : Other substitution topics include marital satisfaction, job satisfaction, and leisure time satisfaction: Norbert Schwarz, Fritz Strack, and Hans-Peter Mai, “Assimilation and Contrast Effects in Part-Whole Question Sequences: A Conversational Logic Analysis,” Public Opinion Quarterly 55 (1991): 3–23.
主导了幸福报告 。其他替代性话题包括婚姻满意度、工作满意度和休闲时间满意度。Norbert Schwarz, Fritz Strack, and Hans-Peter Mai, "部分-整体问题序列的同化和对比效应。A Conversational Logic Analysis," Public Opinion Quarterly 55 (1991):3-23.
evaluate their happiness : A telephone survey conducted in Germany included a question about general happiness. When the self-reports of happiness were correlated with the local weather at the time of the interview, a pronounced correlation was found. Mood is known to vary with the weather, and substitution explains the effect on reported happiness. However, another version of the telephone survey yielded a somewhat different result. These respondents were asked about the current weather before they were asked the happiness question. For them, weather had no effect at all on reported happiness! The explicit priming of weather provided them with an explanation of their mood, undermining the connection that would normally be made between current mood and overall happiness.
评估他们的幸福 。在德国进行的一项电话调查包括一个关于一般幸福的问题。当自我报告的幸福感与采访时的当地天气相关时,发现了一个明显的相关 。众所周知,情绪会随着天气的变化而变化,而替代性解释了对报告的幸福感的影响。然而,另一个版本的电话调查产生了一个有点不同的结果。这些受访者在被问及幸福问题之前被问及当前的天气。对他们来说,天气对报告的幸福感根本没有任何影响对天气的明确提示 ,为他们提供了对其心情的解释,破坏了通常会在当前心情和整体幸福之间建立的联系。
view of the benefits : Melissa L. Finucane et al., “The Affect Heuristic in Judgments of Risks and Benefits,” Journal of Behavioral Decision Making 13 (2000): 1–17.
对利益的看法 。Melissa L. Finucane等人,"风险和利益判断中的情感启发式", 《行为决策杂志》 13(2000)。1-17.
10: THE LAW OF SMALL NUMBERS
10:小数法则
“It is both … without additives” : Howard Wainer and Harris L. Zwerling, “Evidence That Smaller Schools Do Not Improve Student Achievement,” Phi Delta Kappan 88 (2006): 300–303. The example was discussed by Andrew Gelman and Deborah Nolan, Teaching Statistics: A Bag of Tricks (New York: Oxford University Press, 2002).
"它既是......没有添加剂"。 Howard Wainer和Harris L. Zwerling,"小型学校不能提高学生成绩的证据", Phi Delta Kappan 88 (2006)。300-303.Andrew Gelman和Deborah Nolan讨论了这个例子, 统计学教学。A Bag of Tricks (New York: Oxford University Press, 2002)。
50% risk of failing : Jacob Cohen, “The Statistical Power of Abnormal-Social Psychological Research: A Review,” Journal of Abnormal and Social Psychology 65 (1962): 145–53.
50%的失败风险 。雅各布-科恩,"非正常社会心理学研究的统计能力。A Review," Journal of Abnormal and Social Psychology 65 (1962):145-53.
“Belief in the Law of Small Numbers” : Amos Tversky and Daniel Kahneman, “Belief in the Law of Small Numbers,” Psychological Bulletin 76 (1971): 105–10.
"对小数法则的信念"。 Amos Tversky和Daniel Kahneman,"相信小数法则",《 心理学通报》 76(1971)。105-10.
“statistical intuitions … whenever possible” : The contrast that we drew between intuition and computation seems to foreshadow the distinction between Systems 1 and 2, but we were a long way from the perspective of this book. We used intuition to cover anything but a computation, any informal way to reach a conclusion.
"统计直觉......只要可能"。 我们在直觉和计算之间所做的对比似乎预示了系统1和2之间的区别,但是我们 ,离本书的视角 。我们用 直觉 来涵盖除计算之外的任何东西,任何得出结论的非正式方式。
German spies : William Feller, Introduction to Probability Theory and Its Applications (New York: Wiley, 1950).
德国间谍 。William Feller, Introduction to Probability Theory and Its Applications (New York: Wiley, 1950).
randomness in basketball : Thomas Gilovich, Robert Vallone, and Amos Tversky, “The Hot Hand in Basketball: On the Misperception of Random Sequences,” Cognitive Psychology 17 (1985): 295–314.
篮球运动中的随机性 。Thomas Gilovich, Robert Vallone, and Amos Tversky, "The Hot Hand in Basketball:关于对随机序列的误解," 认知心理学 17(1985)。 295-314.
11: ANCHORS
11: 锚点
“‘reasonable’ volume” : Robyn Le Boeuf and Eldar Shafir, “The Long and Short of It: Physical Anchoring Effects,” Journal of Behavioral Decision Making 19 (2006): 393–406.
"'合理'量"。 Robyn Le Boeuf和Eldar Shafir, "The Long and Short of It: Physical Anchoring Effects," Journal of Behavioral Decision Making 19 (2006):393-406.
nod their head : Nicholas Epley and Thomas Gilovich, “Putting Adjustment Back in the Anchoring and Adjustment Heuristic: Differential Processing of Self-Generated and Experimenter-Provided Anchors,” Psychological Science 12 (2001): 391–96.
点头 。Nicholas Epley和Thomas Gilovich,"把调整放回锚定和调整启发式中。对自我产生的和实验者提供的锚的不同处理," 心理学 科学 12(2001)。391-96.
stay closer to the anchor : Epley and Gilovich, “The Anchoring-and-Adjustment Heuristic.”
靠近锚点 。Epley和Gilovich,"锚定和调整启发式"。
associative coherence : Thomas Mussweiler, “The Use of Category and Exemplar Knowledge in the Solution of Anchoring Tasks,” Journal of Personality and Social Psychology 78 (2000): 1038–52.
联想的一致性 。Thomas Mussweiler, "The Use of Category and Exemplar Knowledge in the Solution of Anchoring Tasks," Journal of Personality and Social Psychology 78 (2000):1038-52.
San Francisco Exploratorium : Karen E. Jacowitz and Daniel Kahneman, “Measures of Anchoring in Estimation Tasks,” Personality and Social Psychology Bulletin 21 (1995): 1161–66.
旧金山探索馆 。Karen E. Jacowitz和Daniel Kahneman,"估算任务中锚定的措施 ",《 人格与社会心理学公报》 21(1995)。1161-66.
substantially lower : Gregory B. Northcraft and Margaret A. Neale, “Experts, Amateurs, and Real Estate: An Anchoring-and-Adjustment Perspective on Property Pricing Decisions,” Organizational Behavior and Human Decision Processes 39 (1987): 84–97. The high anchor was 12% above the listed price, the low anchor was 12% below that price.
大幅降低 。Gregory B. Northcraft 和 Margaret A. Neale, "专家、业余爱好者和房地产。An Anchoring-and-Adjustment Perspective on Property Pricing Decisions," Organizational Behavior and Human Decision Processes 39 (1987):84-97.高位锚定比上市价格高12%,低位锚定比该价格低12%, 。
rolled a pair of dice : Birte Englich, Thomas Mussweiler, and Fritz Strack, “Playing Dice with Criminal Sentences: The Influence of Irrelevant Anchors on Experts’ Judicial Decision Making,” Personality and Social Psychology Bulletin 32 (2006): 188–200.
掷 了一双骰子 。Birte Englich, Thomas Mussweiler, and Fritz Strack, "Playing Dice with Criminal Sentences:不相关的锚对专家的司法决策的影响",《 人格与社会心理学公报》 32(2006)。188-200.
NO LIMIT PER PERSON : Brian Wansink, Robert J. Kent, and Stephen J. Hoch, “An Anchoring and Adjustment Model of Purchase Quantity Decisions,” Journal of Marketing Research 35 (1998): 71–81.
每个人都没有限制 :Brian Wansink, Robert J. Kent, and Stephen J. Hoch, "An Anchoring and Adjustment Model of Purchase Quantity Decisions," Journal of Marketing Research 35 (1998):71-81.
resist the anchoring effect : Adam D. Galinsky and Thomas Mussweiler, “First Offers as Anchors: The Role of Perspective-Taking and Negotiator Focus,” Journal of Personality and Social Psychology 81 (2001): 657–69.
抵制锚定效应 。Adam D. Galinsky和Thomas Mussweiler,"作为锚的第一次报价。Perspective-Taking and Negotiator Focus的作用," Journal of Personality and Social Psychology 81(2001)。657-69.
otherwise be much smaller : Greg Pogarsky and Linda Babcock, “Damage Caps, Motivated Anchoring, and Bargaining Impasse,” Journal of Legal Studies 30 (2001): 143–59.
否则就会小得多 。Greg Pogarsky和Linda Babcock,"损害赔偿上限、有动机的锚定和讨价还价的僵局", Journal of Legal Studies 30(2001)。143-59.
amount of damages : For an experimental demonstration, see Chris Guthrie, Jeffrey J. Rachlinski, and Andrew J. Wistrich, “Judging by Heuristic-Cognitive Illusions in Judicial Decision Making,” Judicature 86 (2002): 44–50.
损害赔偿的数额 。关于实验证明,见Chris Guthrie, Jeffrey J. Rachlinski, and Andrew J. Wistrich, "Judging by Heuristic-Cognitive Illusions in Judicial Decision Making, " Judicature 86 (2002):44-50.
12: THE SCIENCE OF AVAILABILITY
12:可用性的科学
“the ease with which” : Amos Tversky and Daniel Kahneman, “Availability: A Heuristic for Judging Frequency and Probability,” Cognitive Psychology 5 (1973): 207–32.
"的容易程度"。 Amos Tversky和Daniel Kahneman,"可用性。A Heuristic for Judging Frequency and Probability," Cognitive Psychology 5 (1973):207-32.
self-assessed contributions : Michael Ross and Fiore Sicoly, “Egocentric Biases in Availability and Attribution,” Journal of Personality and Social Psychology 37 (1979): 322–36.
自我评估的贡献 。Michael Ross和Fiore Sicoly, "Egocentric Biases in Availability and Attribution," Journal of Personality and Social Psychology 37 (1979):322-36.
A major advance : Schwarz et al., “Ease of Retrieval as Information.”
一个重大的进步 。Schwarz等人,"作为信息的检索便利性"。
role of fluency : Sabine Stepper and Fritz Strack, “Proprioceptive Determinants of Emotional and Nonemotional Feelings,” Journal of Personality and Social Psychology 64 (1993): 211–20.
流利性的作用 。Sabine Stepper 和 Fritz Strack, "Proprioceptive Determinants of Emotional and Nonemotional Feelings," Journal of Personality and Social Psychology 64 (1993):211-20.
experimenters dreamed up : For a review of this area of research, see Rainer Greifeneder, Herbert Bless, and Michel T. Pham, “When Do People Rely on Affective and Cognitive Feelings in Judgment? A Review,” Personality and Social Psychology Review 15 (2011): 107–41.
实验者梦想出来的 。关于这一领域的研究回顾,见Rainer Greifeneder, Herbert Bless, and Michel T. Pham, "When Do People Rely on Affective and Cognitive Feelings in Judgment?A Review," Personality and Social Psychology Review 15 (2011):107-41.
affect their cardiac health : Alexander Rotliman and Norbert Schwarz, “Constructing Perceptions of Vulnerability: Personal Relevance and the Use of Experimental Information in Health Judgments,” Personality and Social Psychology Bulletin 24 (1998): 1053–64.
影响他们的心脏健康 。Alexander Rotliman和Norbert Schwarz,"建构对脆弱性的认识。个人相关性和健康判断中实验信息的使用",《 人格与社会心理学公报》 24(1998)。1053-64.
effortful task at the same time : Rainer Greifeneder and Herbert Bless, “Relying on Accessible Content Versus Accessibility Experiences: The Case of Processing Capacity,” Social Cognition 25 (2007): 853–81.
在 同一时间,努力的任务 。Rainer Greifeneder和Herbert Bless,"依赖可及内容与可及经验。处理能力的案例," 社会 认知 25(2007)。853-81.
happy episode in their life : Markus Ruder and Herbert Bless, “Mood and the Reliance on the Ease of Retrieval Heuristic,” Journal of Personality and Social Psychology 85 (2003): 20–32.
他们生活中的快乐情节 。Markus Ruder和Herbert Bless, "Mood and the Reliance on the Ease of Retrieval Heuristic," Journal of Personality and Social Psychology 85 (2003):20-32.
low on a depression scale : Rainer Greifeneder and Herbert Bless, “Depression and Reliance on Ease-of-Retrieval Experiences,” European Journal of Social Psychology 38 (2008): 213–30.
抑郁症量表上的低值 。Rainer Greifeneder和Herbert Bless, "Depression and Reliance on Ease-of-Retrieval Experiences," European Journal of Social Psychology 38 (2008):213-30.
knowledgeable novices : Chezy Ofir et al., “Memory-Based Store Price Judgments: The Role of Knowledge and Shopping Experience,” Journal of Retailing 84 (2008): 414–23.
有知识的新手 。Chezy Ofir等人,"基于记忆的商店价格判断。知识和购物经验的作用," 《零售杂志》 84(2008)。414-23.
true experts : Eugene M. Caruso, “Use of Experienced Retrieval Ease in Self and Social Judgments,” Journal of Experimental Social Psychology 44 (2008): 148–55.
真正的专家 。尤金-M-卡鲁索,"在自我和社会判断中使用经验检索的便利性", 《实验社会心理学杂志》 44(2008)。148-55.
faith in intuition : Johannes Keller and Herbert Bless, “Predicting Future Affective States: How Ease of Retrieval and Faith in Intuition Moderate the Impact of Activated Content,” European Journal of Social Psychology 38 (2008): 1–10.
对直觉的信仰 。Johannes Keller和Herbert Bless,"预测未来的情感状态。 Ease of Retrieval and Faith in Intuition Moderate the Impact of Activated Content," European Journal of Social Psychology 38 (2008):1-10.
if they are … powerful : Mario Weick and Ana Guinote, “When Subjective Experiences Matter: Power Increases Reliance on the Ease of Retrieval,” Journal of Personality and Social Psychology 94 (2008): 956–70.
如果它们是......强大的 。Mario Weick和Ana Guinote,"当主观经验重要时。权力增加了对检索便利性的依赖," 《人格与社会心理学杂志 》94(2008)。956-70.
13: AVAILABILITY, EMOTION, AND RISK
13:可用性、情感和风险
because of brain damage : Damasio’s idea is known as the “somatic marker hypothesis” and it has gathered substantial support: Antonio R. Damasio, Descartes’ Error: Emotion, Reason, and the Human Brain (New York: Putnam, 1994). Antonio R. Damasio, “The Somatic Marker Hypothesis and the Possible Functions of the Prefrontal Cortex,” Philosophical Transactions: Biological Sciences 351 (1996): 141–20.
因为脑损伤 。达马西奥的想法被称为 "体细胞标记假说",它已经获得了大量的支持。安东尼奥-R-达马西奥,《 笛卡尔的错误》。情感、理性和人脑 (纽约:Putnam,1994年)。安东尼奥-R-达马西奥,"躯体标记假说和前额叶皮质的可能功能", 《哲学交易》。Biological Sciences 351 (1996):141-20.
risks of each technology : Finucane et al., “The Affect Heuristic in Judgments of Risks and Benefits.” Paul Slovic, Melissa Finucane, Ellen Peters, and Donald G. MacGregor, “The Affect Heuristic,” in Thomas Gilovich, Dale Griffin, and Daniel Kahneman, eds., Heuristics and Biases (New York: Cambridge University Press, 2002), 397–420. Paul Slovic, Melissa Finucane, Ellen Peters, and Donald G. MacGregor, “Risk as Analysis and Risk as Feelings: Some Thoughts About Affect, Reason, Risk, and Rationality,” Risk Analysis 24 (2004): 1–12. Paul Slovic, “Trust, Emotion, Sex, Politics, and Science: Surveying the Risk-Assessment Battlefield,” Risk Analysis 19 (1999): 689–701.
每项技术的风险 。Finucane等人,"风险和利益判断中的情感启发式"。Paul Slovic, Melissa Finucane, Ellen Peters, and Donald G. MacGregor, "The Affect Heuristic," in Thomas Gilovich, Dale Griffin, and Daniel Kahneman, eds., Heuristics and Biases (New York: Cambridge University Press, 2002), 397-420。Paul Slovic, Melissa Finucane, Ellen Peters, and Donald G. MacGregor, "Risk as Analysis and Risk as Feelings:关于情感、理性、风险和理性的一些思考",《 风险分析》 24(2004)。1-12.Paul Slovic, "信任、情感、性、政治和科学。调查风险评估的战场," 风险分析 19(1999)。689-701.
British Toxicology Society : Slovic, “Trust, Emotion, Sex, Politics, and Science.” The technologies and substances used in these studies are not alternative solutions to the same problem. In realistic problems, where competitive solutions are considered, the correlation between costs and benefits must be negative; the solutions that have the largest benefits are also the most costly. Whether laypeople and even experts might fail to recognize the correct relationship even in those cases is an interesting question.
英国毒理学会 。Slovic,"信任、情感、性、政治和科学"。这些研究中使用的技术和物质并不是同一问题的替代 。在现实的问题中,如果考虑到竞争性的解决方案,成本和效益之间的相关性一定是负的;具有最大效益的解决方案也是最昂贵的。即使在这些情况下,非专业人士甚至专家是否可能认识不到正确的关系,这是一个有趣的问题。
“wags the rational dog” : Jonathan Haidt, “The Emotional Dog and Its Rational Tail: A Social Institutionist Approach to Moral Judgment,” Psychological Review 108 (2001): 814–34.
"摇动理性的狗"。 Jonathan Haidt, "The Emotional Dog and Its Rational Tail: A Social Institutionist Approach to Moral Judment, " Psychological Review 108 (2001):814-34.
“‘Risk’ does not exist” : Paul Slovic, The Perception of Risk (Sterling, VA: EarthScan, 2000).
"'风险'并不存在"。 Paul Slovic, The Perception of Risk (Sterling, VA: EarthScan, 2000)。
availability cascade : Timur Kuran and Cass R. Sunstein, “Availability Cascades and Risk Regulation,” Stanford Law Review 51 (1999): 683–768. CERCLA , the Comprehensive Environmental Response, Compensation, and Liability Act, passed in 1980.
可用性级联 。Timur Kuran和Cass R. Sunstein, "Availability Cascades and Risk Regulation," Stanford Law Review 51 (1999):683-768. CERCLA ,综合 环境反应、赔偿和责任法案,1980年通过。
nothing in between : Paul Slovic, who testified for the apple growers in the Alar case, has a rather different view: “The scare was triggered by the CBS 60 Minutes broadcast that said 4, 000 children will die of cancer (no probabilities there) along with frightening pictures of bald children in a cancer ward—and many more incorrect statements. Also the story exposed EPA’s lack of competence in attending to and evaluating the safety of Alar, destroying trust in regulatory control. Given this, I think the public’s response was rational.” (Personal communication, May 11, 2011.)
两者之间没有任何关系 。在阿拉尔案中为苹果种植者作证的保罗-斯洛维奇有一个相当不同的观点。"恐慌是由哥伦比亚广播公司的 《60分钟》 节目引发的,该节目说有4000名儿童将死于癌症(没有概率),同时还有令人恐惧的光头儿童在癌症病房的照片,以及更多 不正确的说法。此外,该报道还暴露了环保局在关注和评估Alar的安全性方面缺乏能力,破坏了对监管的信任。鉴于此,我认为公众的反应是理性的"。(个人通讯,2011年5月11日。)
14: TOM W’S SPECIALTY
14: Tom W的专长
“a shy poetry lover” : I borrowed this example from Max H. Bazerman and Don A. Moore, Judgment in Managerial Decision Making (New York: Wiley, 2008).
"一个害羞的诗歌爱好者"。 我从Max H. Bazerman和Don A. Moore, Judgment in Managerial Decision Making (New York: Wiley, 2008)中借用了这个例子。
always weighted more : Jonathan St. B. T. Evans, “Heuristic and Analytic Processes in Reasoning,” British Journal of Psychology 75 (1984): 451–68.
总是加权更多 。Jonathan St. B. T. Evans, "Heuristic and Analytic Processes in Reasoning," British Journal of Psychology 75 (1984):451-68.
the opposite effect : Norbert Schwarz et al., “Base Rates, Representativeness, and the Logic of Conversation: The Contextual Relevance of ‘Irrelevant’ Information,” Social Cognition 9 (1991): 67–84.
相反的效果 。Norbert Schwarz等人,"基准率、代表性和对话的逻辑。不相关 "信息的背景相关性", 社会认知 9(1991)。67-84.
told to frown : Alter, Oppenheimer, Epley, and Eyre, “Overcoming Intuition.”
被告知要皱眉头 。Alter, Oppenheimer, Epley, and Eyre, "克服直觉"。
Bayes’s rule : The simplest form of Bayes’s rule is in odds form, posterior odds = prior odds × likelihood ratio, where the posterior odds are the odds (the ratio of probabilities) for two competing hypotheses. Consider a problem of diagnosis. Your friend has tested positive for a serious disease. The disease is rare: only 1 in 600 of the cases sent in for testing actually has the disease. The test is fairly accurate. Its likelihood ratio is 25:1, which means that the probability that a person who has the disease will test positive is 25 times higher than the probability of a false positive. Testing positive is frightening news, but the odds that your friend has the disease have risen only from 1/600 to 25/600, and the probability is 4%.
贝叶斯法则 。贝叶斯规则的最简单形式是赔率形式,后验赔率=先验赔率×似然率,其中后验赔率是两个竞争假设的赔率(概率之比)。考虑一个诊断的问题。你的朋友对一种严重的疾病检测呈阳性。这种疾病很罕见:在 送去检测的病例中,只有1/600真正患有这种疾病。该测试是相当准确的。它的可能性比为25:1,这意味着一个患有该疾病的人检测出阳性的概率比假阳性的概率高25倍。检测结果呈阳性是一个可怕的消息,但是你的朋友患这种疾病的概率只从1/600上升到25/600,概率是 4%。
For the hypothesis that Tom W is a computer scientist, the prior odds that correspond to a base rate of 3% are (.03/.97 = .031). Assuming a likelihood ratio of 4 (the description is 4 times as likely if Tom W is a computer scientist than if he is not), the posterior odds are 4 × .031 = 12.4. From these odds you can compute that the posterior probability of Tom W being a computer scientist is now 11% (because 12.4/112.4 = .11).
对于汤姆-W是一名计算机科学家的假设,对应于3%的基本比率的先验几率是(.03/.97=0.031)。假设可能性比为4(如果汤姆-W是计算机科学家,描述的可能性是他不是的4倍),后验几率为4×0.031=12.4。根据这些几率,你可以计算出汤姆-W是计算机科学家的后验概率 ,现在是11%(因为12.4/112.4=0.11)。
15: LINDA: LESS IS MORE
15: 林达:少即是多
the role of heuristics : Amos Tversky and Daniel Kahneman, “Extensional Versus Intuitive Reasoning: The Conjunction Fallacy in Probability Judgment,” Psychological Review 90(1983), 293-315.
启发式的作用 。Amos Tversky和Daniel Kahneman,"扩展性推理与直觉推理。概率判断中的连接谬误",《 心理学评论》 90(1983),293-315。
“a little homunculus” : Stephen Jay Gould, Bully for Brontosaurus (New York: Norton, 1991).
"一个小同胞"。 Stephen Jay Gould, Bully for Brontosaurus (New York: Norton, 1991).
weakened or explained : See, among others, Ralph Hertwig and Gerd Gigerenzer, “The ‘Conjunction Fallacy’ Revisited: How Intelligent Inferences Look Like Reasoning Errors,” Journal of Behavioral Decision Making 12 (1999): 275–305; Ralph Hertwig, Bjoern Benz, and Stefan Krauss, “The Conjunction Fallacy and the Many Meanings of And,” Cognition 108 (2008): 740–53.
削弱或解释 。除其他外,见Ralph Hertwig 和Gerd Gigerenzer, "The 'Conjunction Fallacy' Revisited:智能推断如何看起来像推理错误," 《行为决策杂志 》12(1999)。275-305;Ralph Hertwig, Bjoern Benz, and Stefan Krauss, "The Conjunction Fallacy and the Many Meanings of And, " Cognition 108 (2008):740-53.
settle our differences : Barbara Mellers, Ralph Hertwig, and Daniel Kahneman, “Do Frequency Representations Eliminate Conjunction Effects? An Exercise in Adversarial Collaboration,” Psychological Science 12 (2001): 269–75.
解决我们的分歧 。Barbara Mellers, Ralph Hertwig, and Daniel Kahneman, "Do Frequency Representations Eliminate Conjunction Effects?An Exercise in Adversarial Collaboration," Psychological Science 12 (2001):269-75.
16: CAUSES TRUMP STATISTICS
16:原因胜过统计数据
correct answer is 41% : Applying Bayes’s rule in odds form, the prior odds are the odds for the Blue cab from the base rate, and the likelihood ratio is the ratio of the probability of the witness saying the cab is Blue if it is Blue, divided by the probability of the witness saying the cab is Blue if it is Green: posterior odds = (.15/.85) × (.80/.20) = .706. The odds are the ratio of the probability that the cab is Blue, divided by the probability that the cab is Green. To obtain the probability that the cab is Blue, we compute: Probability(Blue) = .706/1.706 = .41. The probability that the cab is Blue is 41%.
正确答案是41% 。以几率的形式应用贝叶斯法则,先验几率是基数中蓝色出租车的几率,可能性比是证人说出租车是蓝色的概率,除以证人说出租车是绿色的概率 :后验几率=(.15/.85)×(.80/.20)=.706。赔率是出租车是蓝色的概率除以出租车是绿色的概率的比率。为了得到出租车是蓝色的概率,我们计算一下。概率(蓝色)=.706/1.706=.41。出租车是蓝色的概率是41%。
not too far from the Bayesian : Amos Tversky and Daniel Kahneman, “Causal Schemas in Judgments Under Uncertainty,” in Progress in Social Psychology , ed. Morris Fishbein (Hillsdale, NJ: Erlbaum, 1980), 49–72.
与贝叶斯 ,相差 不大 。Amos Tversky和Daniel Kahneman, "Causal Schemas in Judgments Under Uncertainty," in Progress in Social Psychology , ed.Morris Fishbein(Hillsdale, NJ: Erlbaum, 1980),49-72。
University of Michigan : Richard E. Nisbett and Eugene Borgida, “Attribution and the Psychology of Prediction,” Journal of Personality and Social Psychology 32 (1975): 932–43.
密歇根大学 。Richard E. Nisbett和Eugene Borgida,"归因和预测心理学", 《人格与社会心理学杂志 》32(1975)。932-43.
relieved of responsibility : John M. Darley and Bibb Latane, “Bystander Intervention in Emergencies: Diffusion of Responsibility,” Journal of Personality and Social Psychology 8 (1968): 377–83.
解除了责任 。John M. Darley 和 Bibb Latane, "Bystander Intervention in Emergencies:责任的扩散", 《人格与社会心理学杂志 》8(1968)。377-83.
17: REGRESSION TO THE MEAN
17:向平均数倒退
help of the most brilliant statisticians : Michael Bulmer, Francis Galton: Pioneer of Heredity and Biometry (Baltimore: Johns Hopkins University Press, 2003).
最杰出的统计学家的帮助 。Michael Bulmer, Francis Galton:遗传学和生物统计学的先驱 (巴尔的摩:约翰霍普金斯大学出版社,2003)。
standard scores : Researchers transform each original score into a standard score by subtracting the mean and dividing the result by the standard deviation. Standard scores have a mean of zero and a standard deviation of 1, can be compared across variables (especially when the statistical distributions of the original scores are similar), and have many desirable mathematical properties, which Galton had to work out to understand the nature of correlation and regression.
标准分 。研究人员通过减去平均值并将结果除以标准差,将每个原始分数 ,转化为标准分数。标准分数的平均值为0,标准差为1,可以在不同的变量之间进行比较(特别是当原始分数的统计分布相似时),并具有许多理想的数学特性,高尔顿为了理解相关性和 回归的性质,不得不把这些特性弄清楚。
correlation between parent and child : This will not be true in an environment in which some children are malnourished. Differences in nutrition will become important, the proportion of shared factors will diminish, and with it the correlation between the height of parents and the height of children (unless the parents of malnourished children were also stunted by hunger in childhood).
父母和孩子之间的相互关系 。在一些儿童营养不良的环境中,这一点将不会是真的。营养方面的差异将变得很重要,共同因素的比例将减少,父母的身高和孩子的身高之间的相关性也随之减少(除非营养不良孩子的父母在童年时也因饥饿而发育不良)。
height and weight : The correlation was computed for a very large sample of the population of the United States (the Gallup-Healthways Well-Being Index).
身高和体重 。相关性是针对美国人口的一个非常大的样本(盖洛普健康状况指数)计算出来的。
income and education : The correlation appears impressive, but I was surprised to learn many years ago from the sociologist Christopher Jencks that if everyone had the same education, the inequality of income (measured by standard deviation) would be reduced only by about 9%. The relevant formula is √(1−r 2 ), where r is the correlation.
收入和教育 。这种相关性似乎令人印象深刻,但多年前我惊讶地从社会学家克里斯托弗-詹克斯那里得知,如果每个人都有相同的教育,收入的不平等(以标准差衡量) ,只能减少约9%。相关的公式是√(1-r2),其中 r 是相关度。
correlation and regression : This is true when both variables are measured in standard scores—that is, where each score is transformed by removing the mean and dividing the result by the standard deviation.
相关性和回归性 。当两个变量都以标准分来衡量时,这一点是正确的,也就是说,每个分数都是通过去除平均值并将结果除以标准差来转换的。
confusing mere correlation with causation : Howard Wainer, “The Most Dangerous Equation,” American Scientist 95 (2007): 249–56.
将单纯的相关关系与因果关系混淆 。Howard Wainer, "The Most Dangerous Equation," American Scientist 95 (2007):249-56.
18: TAMING INTUITIVE PREDICTIONS
18: 驯服直观的预测
far more moderate : The proof of the standard regression as the optimal solution to the prediction problem assumes that errors are weighted by the squared deviation from the correct value. This is the least-squares criterion, which is commonly accepted. Other loss functions lead to different solutions.
远为温和 。将标准回归作为预测问题的最佳解决方案的证明,假定错误是由与正确值的平方偏差来加权的。这是普遍接受的最小二乘法准则。其他损失函数会导致不同的解决方案。
19: THE ILLUSION OF UNDERSTANDING
19: 理解的幻觉
narrative fallacy : Nassim Nicholas Taleb, The Black Swan : The Impact of the Highly Improbable (New York: Random House, 2007).
叙述性谬误 。纳西姆-尼古拉斯-塔勒布,《 黑天鹅 : 极不可能的影响 》(纽约:兰登书屋,2007)。
one attribute that is particularly significant : See chapter 7 .
有一个属性是特别重要的 。见 第七章 。
throwing the ball : Michael Lewis, Moneyball: The Art of Winning an Unfair Game (New York: Norton, 2003).
扔球 。Michael Lewis, Moneyball:赢得不公平游戏的艺术 》(纽约:诺顿,2003)。
sell their company : Seth Weintraub, “Excite Passed Up Buying Google for $750,000 in 1999,” Fortune , September 29, 2011.
卖掉他们的公司 。Seth Weintraub, "Excite在1999年以75万美元的价格放弃了对谷歌的收购," 财富 ,2011年9月29日。
ever felt differently : Richard E. Nisbett and Timothy D. Wilson, “Telling More Than We Can Know: Verbal Reports on Mental Processes,” Psychological Review 84 (1977): 231–59.
曾经有不同的感受 。Richard E. Nisbett和Timothy D. Wilson, "Telling More Than We Can Know:关于心理过程的口头报告》,《 心理学评论》 84(1977)。231-59.
United States and the Soviet Union : Baruch Fischhoff and Ruth Beyth, “I Knew It Would Happen: Remembered Probabilities of Once Future Things,” Organizational Behavior and Human Performance 13 (1975): 1–16.
美国和苏联 。Baruch Fischhoff和Ruth Beyth, "I Knew It Would Happen:记忆中曾经的未来事物的概率",《 组织行为与人类绩效 》13(1975)。 1-16.
quality of a decision : Jonathan Baron and John C. Hershey, “Outcome Bias in Decision Evaluation,” Journal of Personality and Social Psychology 54 (1988): 569–79.
决策的质量 。Jonathan Baron和John C. Hershey,"决策评估中的结果偏差", 《人格与社会心理学杂志 》54(1988)。569-79.
should have hired the monitor : Kim A. Kamin and Jeffrey Rachlinski, “Ex Post ≠ Ex Ante: Determining Liability in Hindsight,” Law and Human Behavior 19 (1995): 89–104. Jeffrey J. Rachlinski, “A Positive Psychological Theory of Judging in Hindsight,” University of Chicago Law Review 65 (1998): 571–625.
应该雇用监督员 。Kim A. Kamin 和 Jeffrey Rachlinski, "事后≠事前。在事后确定责任," 法律与人类行为 19(1995)。89-104.Jeffrey J. Rachlinski, "A Positive Psychological Theory of Judging in Hindsight," University of Chicago Law Review 65 (1998):571-625.
tidbit of intelligence : Jeffrey Goldberg, “Letter from Washington: Woodward vs. Tenet,” New Yorker , May 21, 2007, 35–38. Also Tim Weiner, Legacy of Ashes: The History of the CIA (New York: Doubleday, 2007); “Espionage: Inventing the Dots,” Economist , November 3, 2007, 100.
情报的花边新闻 。杰弗里-戈德堡,"来自华盛顿的信。伍德沃德与特尼特》,《 纽约客》 ,2007年5月21日,35-38。还有蒂姆-韦纳,《 灰烬的遗产》。中情局的历史 》(纽约:Doubleday,2007);《间谍活动。发明点滴》,《 经济学家》 ,2007年11月3日,100。
reluctance to take risks : Philip E. Tetlock, “Accountability: The Neglected Social Context of Judgment and Choice,” Research in Organizational Behavior 7 (1985): 297–332.
不愿承担风险 。Philip E. Tetlock, "Accountability:被忽视的判断和选择的社会背景," 《组织行为研究 》7(1985)。297-332.
before their current appointment : Marianne Bertrand and Antoinette Schoar, “Managing with Style: The Effect of Managers on Firm Policies,” Quarterly Journal of Economics 118 (2003): 1169–1208. Nick Bloom and John Van Reenen, “Measuring and Explaining Management Practices Across Firms and Countries,” Quarterly Journal of Economics 122 (2007): 1351–1408.
在他们目前的任命之前 。Marianne Bertrand和Antoinette Schoar,"用风格管理。经理人对公司政策的影响》,《 经济学季刊》 118(2003)。1169-1208.Nick Bloom和John Van Reenen,"衡量和解释管理实践 ,跨公司和国家," 《 经济学季刊 》122(2007)。1351-1408.
“How often will you find …” : I am indebted to Professor James H. Steiger of Vanderbilt University, who developed an algorithm that answers this question, under plausible assumptions. Steiger’s analysis shows that correlations of .20 and .40 are associated, respectively, with inversion rates of 43% and 37%.
"你多长时间会发现......"。 我感谢范德比尔特大学的James H. Steiger教授,他开发了一种算法,在合理的假设下,回答了这个问题。Steiger的分析表明,0.20和0.40的相关性分别与43%和37%的反转率有关。
his penetrating book : The Halo Effect was praised as one of the best business books of the year by both the Financial Times and The Wall Street Journal : Phil Rosenzweig, The Halo Effect: … and the Eight Other Business Delusions That Deceive Managers (New York: Simon & Schuster, 2007). See also Paul Olk and Phil Rosenzweig, “The Halo Effect and the Challenge of Management Inquiry: A Dialog Between Phil Rosenzweig and Paul Olk,” Journal of Management Inquiry 19 (2010): 48–54.
他极具穿透力的书 。 晕轮效应》 被《 金融时报 》和 《华尔街日报》 赞誉为年度最佳商业书籍之一。Phil Rosenzweig, The Halo Effect: ... and the Eight Other Business Delusions That Deceive Managers (New York: Simon & Schuster, 2007).另见Paul Olk和Phil Rosenzweig,"光环效应和管理调查的挑战 。Phil Rosenzweig和 Paul Olk之间的对话," 《管理探索》杂志 19(2010)。48-54.
“a visionary company” : James C. Collins and Jerry I. Porras, Built to Last: Successful Habits of Visionary Companies (New York: Harper, 2002).
"一个有远见的公司"。 James C. Collins和Jerry I. Porras, Built to Last: Successful Habits of Visionary Companies (New York: Harper, 2002).
flip of a coin : In fact, even if you were the CEO yourself, your forecasts would not be impressively reliable; the extensive research on insider trading shows that executives do beat the market when they trade their own stock, but the margin of their outperformance is barely enough to cover the costs of trading. See H. Nejat Seyhun, “The Information Content of Aggregate Insider Trading,” Journal of Business 61 (1988): 1–24; Josef Lakonishok and Inmoo Lee, “Are Insider Trades Informative?” Review of Financial Studies 14 (2001): 79–111; Zahid Iqbal and Shekar Shetty, “An Investigation of Causality Between Insider Transactions and Stock Returns,” Quarterly Review of Economics and Finance 42 (2002): 41–57.
抛硬币 :事实上,即使你自己是首席执行官,你的预测也不会令人印象深刻地可靠;关于内幕交易的广泛研究表明,当高管们交易自己的股票时,他们确实战胜了市场 ,但他们的表现幅度几乎不足以支付交易的成本。见H.Nejat Seyhun, "The Information Content of Aggregate Insider Trading," Journal of Business 61 (1988):1-24;Josef Lakonishok 和 Inmoo Lee, "Are Insider Trades Informative? 金融研究评论 》14(2001)。79-111;Zahid Iqbal和Shekar Shetty,"内幕交易和股票收益之间的因果关系的调查 ," 经济和金融季度评论 42(2002)。41-57.
In Search of Excellence : Rosenzweig, The Halo Effect .
寻找卓越 。Rosenzweig, The Halo Effect .
“Most Admired Companies” : Deniz Anginer, Kenneth L. Fisher, and Meir Statman, “Stocks of Admired Companies and Despised Ones,” working paper, 2007.
"最受赞赏的公司"。 Deniz Anginer, Kenneth L. Fisher, and Meir Statman, "Stocks of Admired Companies and Despised Ones," working paper, 2007.
regression to the mean : Jason Zweig observes that the lack of appreciation for regression has detrimental implications for the recruitment of CEOs. Struggling firms tend to turn to outsiders, recruiting CEOs from companies with high recent returns. The incoming CEO then gets credit, at least temporarily, for his new firm’s subsequent improvement. (Meanwhile, his replacement at his former firm is now struggling, leading the new bosses to believe that they definitely hired “the right guy.”) Anytime a CEO jumps ship, the new company must buy out his stake (in stock and options) at his old firm, setting a baseline for future compensation that has nothing to do with performance at the new firm. Tens of millions of dollars in compensation get awarded for “personal” achievements that are driven mainly by regression and halo effects (personal communication, December 29, 2009).
向平均值回归 。杰森-茨威格观察到,缺乏对回归的欣赏 ,对招聘首席执行官有不利的影响。陷入困境的公司倾向于求助于外部人士,从近期回报率高的公司招聘CEO。然后,即将上任的首席执行官会因为他的新公司随后的改善而获得荣誉,至少是暂时的。(与此同时,他在前公司的继任者现在也在挣扎,导致新老板相信他们肯定 ,雇用了 "正确的人")。任何时候CEO跳槽,新公司都必须买断他在旧公司的股份(股票和期权),为未来的报酬设定一个基线,而这与新公司的业绩无关。数以千万美元计的报酬被授予 "个人 "成就,而这些成就主要是由回归和光环效应驱动的(个人通信,2009年12月 29)。
20: THE ILLUSION OF VALIDITY
20: 有效性的幻觉
this startling conclusion : Brad M. Barber and Terrance Odean, “Trading Is Hazardous to Your Wealth: The Common Stock Investment Performance of Individual Investors,” Journal of Finance 55 (2002): 773–806.
这个惊人的结论 。Brad M. Barber和Terrance Odean,"交易对你的财富是有害的。个人投资者的普通股投资表现》,《 金融杂志 》55(2002)。773-806.
men acted on their useless ideas : Brad M. Barber and Terrance Odean, “Boys Will Be Boys: Gender, Overconfidence, and Common Stock Investment,” Quarterly Journal of Economics 116 (2006): 261–92.
男人按照他们无用的想法行事 。Brad M. Barber 和 Terrance Odean, "Boys Will Be Boys: Gender, Overconfidence, and Common Stock Investment," Quarterly Journal of Economics 116 (2006):261-92.
selling “winners” : This “disposition effect” is discussed further in chapter 32 .
出售 "赢家"。 这种 "处置效应 "将在 第32章 进一步讨论。
responding to news : Brad M. Barber and Terrance Odean, “All That Glitters: The Effect of Attention and News on the Buying Behavior of Individual and Institutional Investors,” Review of Financial Studies 21 (2008): 785–818.
对新闻的反应 :Brad M. Barber 和 Terrance Odean, "All That Glitters:注意力和新闻对个人和机构投资者购买行为的影响》,《 金融研究评论 》21(2008)。785-818.
wealth from amateurs : Research on stock trades in Taiwan concluded that the transfer of wealth from individuals to financial institutions amounts to a staggering 2.2% of GDP: Brad M. Barber, Yi-Tsung Lee, Yu-Jane Liu, and Terrance Odean, “Just How Much Do Individual Investors Lose by Trading?” Review of Financial Studies 22 (2009): 609–32.
来自业余爱好者的财富 。对 台湾的股票交易的研究得出结论,从个人到金融机构的财富转移达到了惊人的GDP的2.2%。Brad M. Barber, Yi-Tsung Lee, Yu-Jane Liu, and Terrance Odean, "个人投资者通过交易损失了多少钱?" 金融研究评论 》22(2009)。609-32.
underperform the overall market : John C. Bogle, Common Sense on Mutual Funds: New Imperatives for the Intelligent Investor (New York: Wiley, 2000), 213.
表现不如整体市场 。John C. Bogle, Common Sense on Mutual Funds:智能 投资者的新要求 (纽约:Wiley,2000),213。
persistent differences in skill : Mark Grinblatt and Sheridan Titman, “The Persistence of Mutual Fund Performance,” Journal of Finance 42 (1992): 1977–84. Edwin J. Elton et al., “The Persistence of Risk-Adjusted Mutual Fund Performance,” Journal of Business 52 (1997): 1–33. Edwin Elton et al., “Efficiency With Costly Information: A Re-interpretation of Evidence from Managed Portfolios,” Review of Financial Studies 6 (1993): 1–21.
技巧的持久性差异 。Mark Grinblatt和Sheridan Titman,"共同基金业绩的持久性",《 金融杂志 》42(1992)。1977-84.Edwin J. Elton等人,"风险调整后的共同基金业绩的持久性",《 商业杂志 》52(1997)。 1-33.埃德温-埃尔顿等人,"有成本的信息的效率。重新解释证据 ,《 金融研究评论 》6(1993)。1-21.
“In this age of academic hyperspecialization” : Philip E. Tetlock, Expert Political Judgment: How Good is It? How Can We Know? (Princeton: Princeton University Press, 2005), 233.
"在这个学术过度专业化的时代"。 Philip E. Tetlock, Expert Political Judgment:它有多好?我们怎么能知道? (Princeton: Princeton University Press, 2005), 233.
21: INTUITIONS VS. FORMULAS
21:直觉vs.公式
“There is no controversy” : Paul Meehl, “Causes and Effects of My Disturbing Little Book,” Journal of Personality Assessment 50 (1986): 370–75.
"不存在争议"。 Paul Meehl,"我那本令人不安的小书的原因和影响", 《人格评估杂志 》 50(1986)。370-75.
a factor of 10 or more : During the 1990–1991 auction season, for example, the price in London of a case of 1960 Château Latour averaged $464; a case of the 1961 vintage (one of the best ever) fetched an average of $5,432.
10倍或更多 。例如,在1990-1991年的拍卖季节,一箱1960年的拉图酒庄的价格平均为464美元;一箱1961年的葡萄酒(有史以来最好的葡萄酒之一)的价格平均为5432美元。
Experienced radiologists : Paul J. Hoffman, Paul Slovic, and Leonard G. Rorer, “An Analysis-of-Variance Model for the Assessment of Configural Cue Utilization in Clinical Judgment,” Psychological Bulletin 69 (1968): 338–39.
有经验的放射科医生 。Paul J. Hoffman, Paul Slovic, and Leonard G. Rorer, "An Analysis of Variance Model for the Assessment of Configural Cue Utilization in Clinical Judgment," Psychological Bulletin 69 (1968):338-39.
internal corporate audits : Paul R. Brown, “Independent Auditor Judgment in the Evaluation of Internal Audit Functions,” Journal of Accounting Research 21 (1983): 444–55.
公司内部审计 。Paul R. Brown, "独立审计师对内部审计职能的评价的判断",《 会计研究杂志》 21 (1983)。444-55.
41 separate studies : James Shanteau, “Psychological Characteristics and Strategies of Expert Decision Makers,” Acta Psychologica 68 (1988): 203–15.
41项独立的研究 。James Shanteau,"专家决策者的心理特征和策略", Acta Psychologica 68(1988)。203-15.
successive food breaks : Danziger, Levav, and Avnaim-Pesso, “Extraneous Factors in Judicial Decisions.”
连续断粮 。Danziger, Levav, and Avnaim-Pesso, "司法判决的外在因素"。
lowering validity : Richard A. DeVaul et al., “Medical-School Performance of Initially Rejected Students,” JAMA 257 (1987): 47–51. Jason Dana and Robyn M. Dawes, “Belief in the Unstructured Interview: The Persistence of an Illusion,” working paper, Department of Psychology, University of Pennsylvania, 2011. William M. Grove et al., “Clinical Versus Mechanical Prediction: A Meta-Analysis,” Psychological Assessment 12 (2000): 19–30.
降低有效性 。Richard A. DeVaul等人,"最初被拒绝学生的医学院表现",《 美国医学会杂志 》257(1987)。47-51.Jason Dana和Robyn M. Dawes,"对非结构化访谈的信任。一个幻觉的持续存在,"工作文件,宾夕法尼亚大学心理学系,2011。William M. Grove等人,"临床预测与机械预测。A Meta-Analysis," Psychological Assessment 12 (2000):19-30.
Dawes’s famous article : Robyn M. Dawes, “The Robust Beauty of Improper Linear Models in Decision Making,” American Psychologist 34 (1979): 571–82.
Dawes的著名文章 。Robyn M. Dawes, "决策中不正确的线性模型的稳健之美", 美国心理学家 34(1979)。571-82.
not affected by accidents of sampling : Jason Dana and Robyn M. Dawes, “The Superiority of Simple Alternatives to Regression for Social Science Predictions,” Journal of Educational and Behavioral Statistics 29 (2004): 317–31.
不受取样事故的影响 。Jason Dana和Robyn M. Dawes,"社会 科学预测的简单替代物的优越性", 《教育和行为统计杂志》 29(2004)。317-31.
Dr. Apgar : Virginia Apgar, “A Proposal for a New Method of Evaluation of the Newborn Infant,” Current Researches in Anesthesia and Analgesia 32 (1953): 260–67. Mieczyslaw Finster and Margaret Wood, “The Apgar Score Has Survived the Test of Time,” Anesthesiology 102 (2005): 855–57.
阿普加博士 Virginia Apgar, "A Proposal for a New Method of Evaluation of the Newborn Infant," Current Researches in Anesthesia and Analgesia 32 (1953):260-67.Mieczyslaw Finster和Margaret Wood,"阿普加评分经受住了时间的考验," Anesthesiology 102(2005)。855-57.
virtues of checklists : Atul Gawande, The Checklist Manifesto: How to Get Things Right (New York: Metropolitan Books, 2009).
检查表的优点 。Atul Gawande, The Checklist Manifesto:如何把事情做对 》(纽约:大都会出版社,2009年)。
organic fruit : Paul Rozin, “The Meaning of ‘Natural’: Process More Important than Content,” Psychological Science 16 (2005): 652–58.
有机水果 。Paul Rozin,"'自然'的含义。过程比内容更重要," 《心理科学》 16(2005)。652-58.
22: EXPERT INTUITION: WHEN CAN WE TRUST IT?
22:专家的直觉:我们何时能相信它?
moderated by an arbiter : Mellers, Hertwig, and Kahneman, “Do Frequency Representations Eliminate Conjunction Effects?”
由一个仲裁者主持 。Mellers, Hertwig, and Kahneman, "频率表征是否能消除联结效应?"
articulated this position : Klein, Sources of Power .
阐述了这一立场 。克莱因,《 权力的来源 》。
kouros : The Getty Museum in Los Angeles brings in the world’s leading experts on Greek sculpture to view a kouros—a marble statue of a striding boy—that it is about to buy. One after another, the experts react with what one calls “intuitive repulsion”—a powerful hunch that the kouros is not 2,500 years old but a modern fake. None of the experts can immediately say why they think the sculpture is a forgery. The closest any of them could come to a rationale is an Italian art historian’s complaint that something—he does not know exactly what—“seemed wrong” with the statue’s fingernails. A famous American expert said that the first thought that came to his mind was the word fresh , and a Greek expert flatly stated, “Anyone who has ever seen a sculpture coming out of the ground could tell that that thing has never been in the ground.” The lack of agreement on the reasons for the shared conclusion is striking, and rather suspect.
kouros 。洛杉矶的盖蒂博物馆请来了世界上最顶尖的希腊雕塑专家,观看它准备购买的一个库罗斯--一个大步流星的大理石雕像。一个 ,专家们的反应是一个所谓的 "直觉上的排斥"--一种强烈的预感,即这个库罗斯不是2500年的,而是一个现代的赝品。没有一个专家 ,可以立即说出他们认为雕塑是赝品的原因。他们中最接近的理由是一位意大利艺术史学家抱怨说,雕像的指甲 "似乎有问题",他也不知道具体是什么。一位著名的美国专家说,他脑海中浮现的第一个念头是 新鲜 这个词,而一位希腊专家则直截了当地说:"任何一个见过 ,从地下出来的雕塑的人都可以看出,那东西从来没有在地下呆过"。在共同结论的原因上缺乏一致,这一点令人震惊,而且相当可疑。
admired as a hero : Simon was one of the towering intellectual figures of the twentieth century. He wrote a classic on decision making in organizations while still in his twenties, and among many other achievements he went on to be one of the founders of the field of artificial intelligence, a leader in cognitive science, an influential student of the process of scientific discovery, a forerunner of behavioral economics and, almost incidentally, a Nobel laureate in economics.
西蒙是 一位令人钦佩的英雄 :西蒙是二十世纪最伟大的知识分子之一。他在20多岁时就写下了关于组织决策的经典著作,在其他许多成就中, ,他后来成为人工智能领域的创始人之一,认知科学的领导者,对科学发现过程有影响力的学生,行为经济学的先驱,并且几乎是偶然地成为诺贝尔经济学奖得主。
“nothing less than recognition” : Simon, “What Is an Explanation of Behavior?” David G. Myers, Intuition: Its Powers and Perils (New Haven: Yale University Press, 2002), 56.
"无非是承认"。 西蒙,"什么是行为的解释?"David G. Myers, Intuition:Its Powers and Perils (New Haven: Yale University Press, 2002), 56.
“without knowing how he knows” : Seymour Epstein, “Demystifying Intuition: What It Is, What It Does, How It Does It,” Psychological Inquiry 21 (2010): 295–312.
"不知道他是怎么知道的"。 西摩-爱泼斯坦,"揭开直觉的神秘面纱。它是什么,它做什么,它如何做," 心理学探索 21(2010)。295-312.
10,000 hours : Foer, Moonwalking with Einstein .
10,000小时 。福尔,《 与爱因斯坦的月球漫步 》。
23: THE OUTSIDE VIEW
23:外面的风景
inside view and the outside view : The labels are often misunderstood. Numerous authors believed that the correct terms were “insider view” and “outsider view,” which are not even close to what we had in mind.
内观和外观 。这些标签经常被误解。众多作者认为,正确的术语是 "内部人 "和 "外部人的观点",这与我们所想的根本不一样。
very different answers : Dan Lovallo and Daniel Kahneman, “Timid Choices and Bold Forecasts: A Cognitive Perspective on Risk Taking,” Management Science 39 (1993): 17–31. Daniel Kahneman and Dan Lovallo, “Delusions of Success: How Optimism Undermines Executives’ Decisions,” Harvard Business Review 81 (2003): 56–63.
非常不同的答案 。丹-洛瓦洛和丹尼尔-卡尼曼,"迟钝的选择和大胆的预测。关于风险承担的认知视角," 管理科学 39(1993)。17-31.丹尼尔-卡尼曼和丹-洛瓦洛,"成功的妄想。乐观主义如何破坏高管们的决策》,《 哈佛商业评论》 81期(2003年)。56-63.
“Pallid” statistical information : Richard E. Nisbett and Lee D. Ross, Human Inference: Strategies and Shortcomings of Social Judgment (Englewood Cliffs, NJ: Prentice-Hall, 1980).
"苍白 "的统计信息 :Richard E. Nisbett和Lee D. Ross, Human Inference:社会判断的策略和缺点 (Englewood Cliffs, NJ: Prentice-Hall, 1980)。
impersonality of procedures : For an example of the doubts about evidence-based medicine, see Jerome Groopman, How Doctors Think (New York: Mariner Books, 2008), 6.
程序的非个性化 。关于对循证医学的怀疑的例子,见Jerome Groopman, How Doctors Think (New York: Mariner Books, 2008), 6。
planning fallacy : Daniel Kahneman and Amos Tversky, “Intuitive Prediction: Biases and Corrective Procedures,” Management Science 12 (1979): 313–27.
规划谬误 。Daniel Kahneman和Amos Tversky,"直觉预测。Biases and Corrective Procedures," Management Science 12 (1979):313-27.
Scottish Parliament building : Rt. Hon. The Lord Fraser of Carmyllie, “The Holyrood Inquiry, Final Report,” September 8, 2004, www.holyroodinquiry.org/FINAL_report/report.htm .
苏格兰议会大厦 。Rt. Hon. The Lord Fraser of Carmyllie, "The Holyrood Inquiry, Final Report," September 8, 2004, www.holyroodinquiry.org/FINAL_report/report.htm.
did not become more reliant on it : Brent Flyvbjerg, Mette K. Skamris Holm, and Søren L. Buhl, “How (In)accurate Are Demand Forecasts in Public Works Projects?” Journal of the American Planning Association 71 (2005): 131–46.
并没有变得更加依赖它 。Brent Flyvbjerg, Mette K. Skamris Holm, and Søren L. Buhl, "公共工程项目的需求预测如何(不)准确?" 美国规划协会杂志》 71(2005)。131-46.
survey of American homeowners : “2002 Cost vs. Value Report,” Remodeling , November 20, 2002.
对美国房主的调查 。"2002年成本与价值报告",《 改造》 ,2002年11月20日。
completion times : Brent Flyvbjerg, “From Nobel Prize to Project Management: Getting Risks Right,” Project Management Journal 37 (2006): 5–15.
完成时间 。Brent Flyvbjerg,"从诺贝尔奖到项目管理。让风险正确," 项目管理杂志 37(2006)。5-15.
sunk-cost fallacy : Hal R. Arkes and Catherine Blumer, “The Psychology of Sunk Cost,” Organizational Behavior and Human Decision Processes 35 (1985): 124–40. Hal R. Arkes and Peter Ayton, “The Sunk Cost and Concorde Effects: Are Humans Less Rational Than Lower Animals?” Psychological Bulletin 125 (1998): 591–600.
沉没成本谬论 :Hal R. Arkes和Catherine Blumer,"沉没成本的心理学", 组织 行为和人类决策过程 35(1985)。124-40.Hal R. Arkes 和 Peter Ayton, "沉没成本和协和效应。人类是否比低等动物更不理性?"《 心理学通报》 125(1998)。591-600.
24: THE ENGINE OF CAPITALISM
24:资本主义的引擎
you already feel fortunate : Miriam A. Mosing et al., “Genetic and Environmental Influences on Optimism and Its Relationship to Mental and Self-Rated Health: A Study of Aging Twins,” Behavior Genetics 39 (2009): 597–604. David Snowdon, Aging with Grace: What the Nun Study Teaches Us About Leading Longer, Healthier, and More Meaningful Lives (New York: Bantam Books, 2001).
你已经感到很幸运 了。Miriam A. Mosing等人,"乐观主义的遗传和环境影响及其与心理和自我评价健康的关系。 A Study of Aging Twins," Behavior Genetics 39 (2009):597-604.大卫-斯诺登,《 优雅地老去》。Nun研究告诉我们如何过上更长久、更健康和更有意义的生活 (纽约:Bantam Books,2001)。
bright side of everything : Elaine Fox, Anna Ridgewell, and Chris Ashwin, “Looking on the Bright Side: Biased Attention and the Human Serotonin Transporter Gene,” Proceedings of the Royal Society B 276 (2009): 1747–51.
一切的光明面 :Elaine Fox, Anna Ridgewell, and Chris Ashwin, "Looking on the Bright Side:有偏见的注意力和人类血清素转运体基因," 皇家学会B刊 276 (2009)。1747-51.
“triumph of hope over experience” : Manju Puri and David T. Robinson, “Optimism and Economic Choice,” Journal of Financial Economics 86 (2007): 71–99.
"希望对经验的胜利"。 Manju Puri和David T. Robinson,"乐观主义和经济选择", 《金融经济学杂志》 86(2007)。71-99.
more sanguine than midlevel managers : Lowell W. Busenitz and Jay B. Barney, “Differences Between Entrepreneurs and Managers in Large Organizations: Biases and Heuristics in Strategic Decision-Making,” Journal of Business Venturing 12 (1997): 9–30.
比中层管理人员更乐观 。Lowell W. Busenitz和Jay B. Barney,"大型组织中的企业家和经理之间的差异。战略决策中的偏见和启发式方法", 期刊 of Business Venturing 12 (1997):9-30.
admiration of others : Entrepreneurs who have failed are sustained in their confidence by the probably mistaken belief that they have learned a great deal from the experience. Gavin Cassar and Justin Craig, “An Investigation of Hindsight Bias in Nascent Venture Activity,” Journal of Business Venturing 24 (2009): 149–64.
别人的钦佩 。失败的创业者由于可能错误地认为他们已经从经验中吸取了大量的教训而保持自信。Gavin Cassar和Justin Craig,"对新兴创业活动中事后诸葛亮的调查", 《商业冒险杂志 》24(2009)。149-64.
influence on the lives of others : Keith M. Hmieleski and Robert A. Baron, “Entrepreneurs’ Optimism and New Venture Performance: A Social Cognitive Perspective,” Academy of Management Journal 52 (2009): 473–88. Matthew L. A. Hayward, Dean A. Shepherd, and Dale Griffin, “A Hubris Theory of Entrepreneurship,” Management Science 52 (2006): 160–72.
对他人生活的影响 。 Keith M. Hmieleski和Robert A. Baron,"企业家的乐观情绪和新企业的表现。社会认知视角",《 管理学院学报》 52(2009)。473-88.Matthew L. A. Hayward, Dean A. Shepherd, and Dale Griffin, "A Hubris Theory of Entrepreneurship, " Management Science 52 (2006):160-72.
chance of failing was zero : Arnold C. Cooper, Carolyn Y. Woo, and William C. Dunkelberg, “Entrepreneurs’ Perceived Chances for Success,” Journal of Business Venturing 3 (1988): 97–108.
失败的机会是零 。Arnold C. Cooper, Carolyn Y. Woo, and William C. Dunkelberg, "Entrepreneurs' Perceived Chances for Success, " Journal of Business Venturing 3 (1988):97-108.
given the lowest grade : Thomas Åstebro and Samir Elhedhli, “The Effectiveness of Simple Decision Heuristics: Forecasting Commercial Success for Early-Stage Ventures,” Management Science 52 (2006): 395–409.
给予最低分 :Thomas Åstebro和Samir Elhedhli,"简单决策启发式方法的有效性。预测早期风险投资的商业成功," 管理科学 52(2006)。395-409.
widespread, stubborn, and costly : Thomas Åstebro, “The Return to Independent Invention: Evidence of Unrealistic Optimism, Risk Seeking or Skewness Loving?” Economic Journal 113 (2003): 226–39.
广泛的、顽固的、昂贵的 :Thomas Åstebro,"独立发明的回归。证据 :不现实的乐观主义、寻求风险还是喜欢偏斜?" 经济杂志 113(2003)。226-39.
bet small amounts of money : Eleanor F. Williams and Thomas Gilovich, “Do People Really Believe They Are Above Average?” Journal of Experimental Social Psychology 44 (2008): 1121–28.
赌少量的钱 。Eleanor F. Williams和Thomas Gilovich,"人们真的相信他们高于平均水平吗?" 实验社会心理学杂志 》44(2008)。1121-28.
“hubris hypothesis” : Richard Roll, “The Hubris Hypothesis of Corporate Takeovers,” Journal of Business 59 (1986): 197–216, part 1. This remarkable early article presented a behavioral analysis of mergers and acquisitions that abandoned the assumption of rationality, long before such analyses became popular.
"傲慢的假说"。 理查德-罗尔,"公司收购的傲慢假说",《 商业杂志 》59(1986)。197-216, 第一部分。这篇引人注目的早期文章提出了对并购的行为分析,放弃了理性的假设,比这种分析流行的时间还早。
“value-destroying mergers” : Ulrike Malmendier and Geoffrey Tate, “Who Makes Acquisitions? CEO Overconfidence and the Market’s Reaction,” Journal of Financial Economics 89 (2008): 20–43.
"破坏价值的兼并"。 Ulrike Malmendier和Geoffrey Tate,"谁在进行收购?CEO的过度自信和市场的反应》, 《金融经济学杂志 》89(2008)。20-43.
“engage in earnings management” : Ulrike Malmendier and Geoffrey Tate, “Superstar CEOs,” Quarterly Journal of Economics 24 (2009), 1593–1638.
"从事收益管理 " 。Ulrike Malmendier和Geoffrey Tate,"超级明星CEO",《 经济学季刊》 24(2009),1593-1638。
self-aggrandizement to a cognitive bias : Paul D. Windschitl, Jason P. Rose, Michael T. Stalkfleet, and Andrew R. Smith, “Are People Excessive or Judicious in Their Egocentrism? A Modeling Approach to Understanding Bias and Accuracy in People’s Optimism,” Journal of Personality and Social Psychology 95 (2008): 252–73.
自我吹嘘是一种认知偏见 。Paul D. Windschitl, Jason P. Rose, Michael T. Stalkfleet, and Andrew R. Smith, "Are People Excessive or Judicious in Their Egocentrism?理解人们乐观主义中的偏见和准确性的建模方法", 《人格与社会 心理学杂志 》95(2008)。252-73.
average outcome is a loss : A form of competition neglect has also been observed in the time of day at which sellers on eBay choose to end their auctions. The easy question is: At what time is the total number of bidders the highest? Answer: around 7:00 p.m. EST. The question sellers should answer is harder: Considering how many other sellers end their auctions during peak hours, at what time will there be the most bidders looking at my auction? The answer: around noon, when the number of bidders is large relative to the number of sellers. The sellers who remember the competition and avoid prime time get higher prices. Uri Simonsohn, “eBay’s Crowded Evenings: Competition Neglect in Market Entry Decisions,” Management Science 56 (2010): 1060–73.
平均结果是亏损 。在eBay上卖家选择结束拍卖的时间上也观察到了一种被忽视的竞争形式。一个简单的问题是:在什么时间,竞标者的总数是最高的?答案是:美国东部时间晚上7点左右。卖家应该回答的问题更难。考虑到有多少其他卖家在 高峰时段结束他们的拍卖,在什么时间会有最多的竞标者关注我的拍卖?答案是:中午前后,此时竞拍者的数量相对于卖家的数量来说是很大的。那些记住竞争并避开黄金时段的卖家会得到更高的价格。Uri Simonsohn,"eBay的拥挤的晚上。市场进入决策中对竞争的忽视",《 管理科学》 56(2010)。1060-73.
“diagnosis antemortem” : Eta S. Berner and Mark L. Graber, “Overconfidence as a Cause of Diagnostic Error in Medicine,” American Journal of Medicine 121 (2008): S2–S23.
"死前诊断 " 。Eta S. Berner和Mark L. Graber,"过度自信是医学诊断错误的原因", 美国医学杂志 121(2008)。S2-S23。
“disclosing uncertainty to patients” : Pat Croskerry and Geoff Norman, “Overconfidence in Clinical Decision Making,” American Journal of Medicine 121 (2008): S24–S29.
"向病人披露不确定性"。 Pat Croskerry和Geoff Norman,"临床决策中的过度自信", 美国医学杂志 121(2008)。S24-S29。
background of risk taking : Kahneman and Lovallo, “Timid Choices and Bold Forecasts.”
承担风险的背景 。卡尼曼和洛瓦洛,"迟钝的选择和大胆的预测"。
Royal Dutch Shell : J. Edward Russo and Paul J. H. Schoemaker, “Managing Overconfidence,” Sloan Management Review 33 (1992): 7–17.
荷兰皇家壳牌公司 。J. Edward Russo和Paul J. H. Schoemaker,"管理过度自信",《 斯隆管理评论》 33(1992)。7-17.
25: BERNOULLI’S ERRORS
25: 伯努利的错误
Mathematical Psychology : Clyde H. Coombs, Robyn M. Dawes, and Amos Tversky, Mathematical Psychology: An Elementary Introduction (Englewood Cliffs, NJ: Prentice-Hall, 1970).
数学心理学 。Clyde H. Coombs, Robyn M. Dawes, and Amos Tversky, Mathematical Psychology:An Elementary Introduction (Englewood Cliffs, NJ: Prentice-Hall, 1970)。
for the rich and for the poor : This rule applies approximately to many dimensions of sensation and perception. It is known as Weber’s law, after the German physiologist Ernst Heinrich Weber, who discovered it. Fechner drew on Weber’s law to derive the logarithmic psychophysical function.
对于富人和穷人 来说都是如此。这一规则大约适用于感觉和知觉的许多 方面。它被称为韦伯定律,以发现它的德国生理学家恩斯特-海因里希-韦伯的名字命名。费希纳根据韦伯定律推导出对数心理物理学函数。
$10 million from $100 million : Bernoulli’s intuition was correct, and economists still use the log of income or wealth in many contexts. For example, when Angus Deaton plotted the average life satisfaction of residents of many countries against the GDP of these countries, he used the logarithm of GDP as a measure of income. The relationship, it turns out, is extremely close: Residents of high-GDP countries are much more satisfied with the quality of their lives than are residents of poor countries, and a doubling of income yields approximately the same increment of satisfaction in rich and poor countries alike.
1亿美元中的1000 万美元。伯努利的直觉是正确的,经济学家在许多情况下仍然使用收入或财富的对数。例如,当安格斯-迪顿(Angus Deaton)将许多国家居民的平均生活满意度 ,与这些国家的国内生产总值作对比时,他用国内生产总值的对数作为收入的衡量标准。结果发现,两者的关系极为密切。高GDP国家的居民对他们的生活质量的满意度比穷国的居民高得多,收入翻倍在富国和穷国产生的满意度增量大致相同 。
“St. Petersburg paradox” : Nicholas Bernoulli, a cousin of Daniel Bernoulli, asked a question that can be paraphrased as follows: “You are invited to a game in which you toss a coin repeatedly. You receive $2 if it shows heads, and the prize doubles with every successive toss that shows heads. The game ends when the coin first shows tails. How much would you pay for an opportunity to play that game?” People do not think the gamble is worth more than a few dollars, although its expected value is infinite—because the prize keeps growing, the expected value is $1 for each toss, to infinity. However, the utility of the prizes grows much more slowly, which explains why the gamble is not attractive.
"圣彼得堡悖论": 丹尼尔-伯努利的表弟尼古拉斯-伯努利提出了一个问题,可以解读如下。"你被邀请参加一个游戏,在游戏中你反复抛出一枚硬币。如果硬币显示为正面,你可以得到2美元,每连续抛出一次显示为正面,奖金就增加一倍。当硬币第一次显示为反面时,游戏结束。你会花多少钱 ,以获得玩这个游戏的机会?"人们认为这场赌博的价值不超过几美元,尽管它的预期价值是无限的--因为奖金一直在增长,每次抛硬币的预期价值都是1美元,直到无穷大。然而,奖品的效用增长得更慢,这解释了为什么赌博没有吸引力。
“history of one’s wealth” : Other factors contributed to the longevity of Bernoulli’s theory. One is that it is natural to formulate choices between gambles in terms of gains, or mixed gains and losses. Not many people thought about choices in which all options are bad, although we were by no means the first to observe risk seeking. Another fact that favors Bernoulli’s theory is that thinking in terms of final states of wealth and ignoring the past is often a very reasonable thing to do. Economists were traditionally concerned with rational choices, and Bernoulli’s model suited their goal.
"一个人的财富历史"。 其他因素促成了伯努利理论的 寿命。一个是在赌博中以收益,或混合 ,收益和损失来制定选择是很自然的。没有多少人想到所有选项都是坏的选择,尽管我们绝非是第一个观察到风险寻求的人。另一个有利于伯努利理论的事实是,从财富的最终状态来思考并忽略过去,往往是 ,是非常合理的事情。经济学家传统上关注的是理性选择,伯努利的模型适合他们的目标。
26: PROSPECT THEORY
26: 前景理论
subjective value of wealth : Stanley S. Stevens, “To Honor Fechner and Repeal His Law,” Science 133 (1961): 80–86. Stevens, Psychophysics .
财富的主观价值 。Stanley S. Stevens, "To Honor Fechner and Repeal His Law," Science 133 (1961):80-86.史蒂文斯, 心理物理学 。
The three principles : Writing this sentence reminded me that the graph of the value function has already been used as an emblem. Every Nobel laureate receives an individual certificate with a personalized drawing, which is presumably chosen by the committee. My illustration was a stylized rendition of figure 10 .
这 三个原则 。写这句话时提醒我,价值函数的图形已经 ,被用作徽章。每个诺贝尔奖获得者都会收到一份带有个性化图画的个人证书,估计是由委员会选择的。我的插图是 图10 的风格化演绎。
“loss aversion ratio” : The loss aversion ratio is often found to be in the range of 1.5 and 2.5: Nathan Novemsky and Daniel Kahneman, “The Boundaries of Loss Aversion,” Journal of Marketing Research 42 (2005): 119–28.
"损失厌恶率"。 损失厌恶率经常被发现在1.5和2.5之间:Nathan Novemsky和Daniel Kahneman,"损失厌恶的界限", Journal of Marketing Research 42 (2005):119-28.
emotional reaction to losses : Peter Sokol-Hessner et al., “Thinking Like a Trader Selectively Reduces Individuals’ Loss Aversion,” PNAS 106 (2009): 5035–40.
对损失的情绪反应 。Peter Sokol-Hessner等人,"像交易员一样思考,选择性地减少个人的损失厌恶", PNAS 106 (2009)。5035-40.
Rabin’s theorem : For several consecutive years, I gave a guest lecture in the introductory finance class of my colleague Burton Malkiel. I discussed the implausibility of Bernoulli’s theory each year. I noticed a distinct change in my colleague’s attitude when I first mentioned Rabin’s proof. He was now prepared to take the conclusion much more seriously than in the past. Mathematical arguments have a definitive quality that is more compelling than appeals to common sense. Economists are particularly sensitive to this advantage.
拉宾定理 。连续几年,我在我的同事伯顿-马尔基尔的金融学入门课上做客讲课。我每年都会讨论伯努利理论的不可信性。我注意到 ,当我第一次提到拉宾的证明时,我同事的态度发生了明显的变化。他现在准备比过去更认真地对待这个结论。数学论证有一种确定的品质,比对常识的诉求更有说服力。经济学家对这种优势特别敏感。
rejects that gamble : The intuition of the proof can be illustrated by an example. Suppose an individual’s wealth is W, and she rejects a gamble with equal probabilities to win $11 or lose $10. If the utility function for wealth is concave (bent down), the preference implies that the value of $1 has decreased by over 9% over an interval of $21! This is an extraordinarily steep decline and the effect increases steadily as the gambles become more extreme.
拒绝该赌博 。该证明的直觉可以通过一个 的例子来说明。假设一个人的财富是W,她拒绝一个概率相同的赌博,赢11美元或输10美元。如果财富的效用函数是凹的(向下弯曲),那么这个偏好意味着1美元的价值在21美元的区间内下降了9%以上!这是一个特别陡峭的数字。这是一个异常陡峭的下降,而且随着赌博变得更加极端,这种影响也在稳步增加。
“Even a lousy lawyer” : Matthew Rabin, “Risk Aversion and Expected-Utility Theory: A Calibration Theorem,” Econometrica 68 (2000): 1281–92. Matthew Rabin and Richard H. Thaler, “Anomalies: Risk Aversion,” Journal of Economic Perspectives 15 (2001): 219–32.
"即使是一个糟糕的律师 " 。马修-拉宾,"风险厌恶和预期效用理论。A Calibration Theorem," Econometrica 68 (2000):1281-92.马修-拉宾和理查德-H-塞勒,"反常现象。风险规避",《 经济展望杂志 》15(2001)。219-32.
economists and psychologists : Several theorists have proposed versions of regret theories that are built on the idea that people are able to anticipate how their future experiences will be affected by the options that did not materialize and/or by the choices they did not make: David E. Bell, “Regret in Decision Making Under Uncertainty,” Operations Research 30 (1982): 961–81. Graham Loomes and Robert Sugden, “Regret Theory: An Alternative to Rational Choice Under Uncertainty,” Economic Journal 92 (1982): 805–25. Barbara A. Mellers, “Choice and the Relative Pleasure of Consequences,” Psychological Bulletin 126 (2000): 910–24. Barbara A. Mellers, Alan Schwartz, and Ilana Ritov, “Emotion-Based Choice,” Journal of Experimental Psychology—General 128 (1999): 332–45. Decision makers’ choices between gambles depend on whether they expect to know the outcome of the gamble they did not choose. Ilana Ritov, “Probability of Regret: Anticipation of Uncertainty Resolution in Choice,” Organizational Behavior and Human Decision Processes 66 (1966): 228–36.
经济学家和心理学家 。一些理论家提出了各种版本的后悔理论,这些理论建立在人们能够预测他们 ,未来的经历将如何受到那些没有实现的选择和/或他们没有做出的选择的影响。David E. Bell, "Regret in Decision Making Under Uncertainty," Operations Research 30 (1982):961-81.Graham Loomes和Robert Sugden,"后悔理论。不确定条件下理性选择的替代方案》,《 经济杂志》 92(1982)。805-25.Barbara A. Mellers, "选择和后果的相对 乐趣",《 心理学通报》 126(2000)。910-24.Barbara A. Mellers, Alan Schwartz, and Ilana Ritov, "Emotion-Based Choice," Journal of Experimental Psychology-General 128 (1999):332-45.决策者在赌局之间的选择取决于他们是否期望知道他们没有选择的赌局的结果。Ilana Ritov, "后悔的概率。对不确定性的预期 ," 组织行为和人类决策过程 66(1966)。228-36.
27: THE ENDOWMENT EFFECT
27:禀赋 效果
What is missing from the figure : A theoretical analysis that assumes loss aversion predicts a pronounced kink of the indifference curve at the reference point: Amos Tversky and Daniel Kahneman, “Loss Aversion in Riskless Choice: A Reference-Dependent Model,” Quarterly Journal of Economics 106 (1991): 1039–61. Jack Knetsch observed these kinks in an experimental study: “Preferences and Nonreversibility of Indifference Curves,” Journal of Economic Behavior & Organization 17 (1992): 131–39.
图中缺少什么 。假设损失规避的理论分析预示着冷漠曲线在参考点处会出现明显的扭结。Amos Tversky 和 Daniel Kahneman, "无风险选择中的损失规避。A Reference-Dependent Model," Quarterly Journal of Economics 106 (1991):1039-61.Jack Knetsch在一项实验研究中观察到了这些结点。"Preferences and Nonreversibility of Indifference Curves," Journal of Economic Behavior & Organization 17(1992):131-39.
period of one year : Alan B. Krueger and Andreas Mueller, “Job Search and Job Finding in a Period of Mass Unemployment: Evidence from High-Frequency Longitudinal Data,” working paper, Princeton University Industrial Relations Section, January 2011.
一年的时间 。Alan B. Krueger和Andreas Mueller,"大规模失业时期的求职和找工作。Evidence from High-Frequency Longitudinal Data," working paper, Princeton University Industrial Relations Section, January 2011.
did not own the bottle : Technically, the theory allows the buying price to be slightly lower than the selling price because of what economists call an “income effect”: The buyer and the seller are not equally wealthy, because the seller has an extra bottle. However, the effect in this case is negligible since $50 is a minute fraction of the professor’s wealth. The theory would predict that this income effect would not change his willingness to pay by even a penny.
并不拥有这瓶酒 。从技术上讲,该理论允许买价略低于卖价,因为经济学家称之为 "收入效应"。买方和卖方的财富并不相等,因为卖方有一个额外的瓶子。然而,在这种情况下的影响是可以忽略不计的,因为50美元只是教授财富的一小部分 。理论预测,这种收入效应不会使他的支付意愿发生哪怕一分钱的变化。
would be puzzled by it : The economist Alan Krueger reported on a study he conducted on the occasion of taking his father to the Super Bowl: “We asked fans who had won the right to buy a pair of tickets for $325 or $400 each in a lottery whether they would have been willing to pay $3,000 a ticket if they had lost in the lottery and whether they would have sold their tickets if someone had offered them $3,000 apiece. Ninety-four percent said they would not have bought for $3,000, and ninety-two percent said they would not have sold at that price.” He concludes that “rationality was in short supply at the Super Bowl.” Alan B. Krueger, “Supply and Demand: An Economist Goes to the Super Bowl,” Milken Institute Review : A Journal of Economic Policy 3 (2001): 22–29.
会对它感到疑惑 。经济学家艾伦-克鲁格报告了他在带父亲看超级碗时进行的一项研究。 "我们问那些在彩票中赢得了以每张325美元或400美元购买一对门票的权利的球迷,如果他们在彩票中输了,他们是否愿意支付3000美元一张门票,如果有人给他们每张3000美元,他们是否会卖掉他们的门票。94%的人说他们不会以3000美元的价格购买,92%的人说他们不会以这个价格出售"。他的结论是:"在超级碗比赛中,理性是很缺乏的。"艾伦-B-克鲁格,"供应和需求。一个 经济学家去超级碗," 米尔肯研究所评论 。 A Journal of Economic Policy 3 (2001):22-29.
giving up a bottle of nice wine : Strictly speaking, loss aversion refers to the anticipated pleasure and pain, which determine choices. These anticipations could be wrong in some cases. Deborah A. Kermer et al., “Loss Aversion Is an Affective Forecasting Error,” Psychological Science 17 (2006): 649–53.
放弃 一瓶好酒 。严格来说,损失厌恶是指预期的快乐和痛苦,它决定了选择。这些预期在某些情况下可能是错误的。Deborah A. Kermer等人,"损失厌恶是一种情感预测错误",《 心理科学》 17(2006)。 649-53.
market transactions : Novemsky and Kahneman, “The Boundaries of Loss Aversion.”
市场交易 。Novemsky和Kahneman,"损失厌恶的界限"。
half of the tokens will change hands : Imagine that all the participants are ordered in a line by the redemption value assigned to them. Now randomly allocate tokens to half the individuals in the line. Half of the people in the front of the line will not have a token, and half of the people at the end of the line will own one. These people (half of the total) are expected to move by trading places with each other, so that in the end everyone in the first half of the line has a token, and no one behind them does.
一半的代币将易手 。想象一下,所有参与者都按分配给他们的兑换价值排成一列。现在将代币随机分配给队伍中的一半人。排在最前面的一半人将没有代币,而排在 ,最后的一半人将拥有一个。这些人(总数的一半)预计将通过相互交换位置来移动,所以最后队伍前半部分的每个人都有一个令牌,而后面的人都没有。
Brain recordings : Brian Knutson et al., “Neural Antecedents of the Endowment Effect,” Neuron 58 (2008): 814–22. Brian Knutson and Stephanie M. Greer, “Anticipatory Affect: Neural Correlates and Consequences for Choice,” Philosophical Transactions of the Royal Society B 363 (2008): 3771–86.
大脑记录 。布莱恩-克努森等人,"禀赋效应的神经前因",《 神经元》 58(2008)。814-22.Brian Knutson和Stephanie M. Greer, "预期情感。Neural Correlates and Consequences for Choice," Philosophical Transactions of the Royal Society B 363 (2008):3771-86.
riskless and risky decisions : A review of the price of risk, based on “international data from 16 different countries during over 100 years,” yielded an estimate of 2.3, “in striking agreement with estimates obtained in the very different methodology of laboratory experiments of individual decision-making”: Moshe Levy, “Loss Aversion and the Price of Risk,” Quantitative Finance 10 (2010): 1009–22.
无风险和有风险的决定 。根据 "100多年来16个不同国家的国际数据",对风险价格的审查得出了2.3的估计值,"与个人决策的实验室实验的非常不同的方法中得到的估计值惊人地一致"。 Moshe Levy,"损失规避和风险价格", 定量金融 10(2010)。1009-22.
effect of price increases : Miles O. Bidwel, Bruce X. Wang, and J. Douglas Zona, “An Analysis of Asymmetric Demand Response to Price Changes: The Case of Local Telephone Calls,” Journal of Regulatory Economics 8 (1995): 285–98. Bruce G. S. Hardie, Eric J. Johnson, and Peter S. Fader, “Modeling Loss Aversion and Reference Dependence Effects on Brand Choice,” Marketing Science 12 (1993): 378–94.
价格上涨的影响 。Miles O. Bidwel, Bruce X. Wang, and J. Douglas Zona, "An Analysis of Asymmetric Demand Response to Price Changes:本地电话的案例," Journal of Regulatory Economics 8 (1995):285-98.Bruce G. S. Hardie, Eric J. Johnson, and Peter S. Fader, "模拟损失厌恶 和品牌选择的参考依赖效应," Marketing Science 12 (1993):378-94.
illustrate the power of these concepts : Colin Camerer, “Three Cheers—Psychological, Theoretical, Empirical—for Loss Aversion,” Journal of Marketing Research 42 (2005): 129–33. Colin F. Camerer, “Prospect Theory in the Wild: Evidence from the Field,” in Choices, Values, and Frames , ed. Daniel Kahneman and Amos Tversky (New York: Russell Sage Foundation, 2000), 288–300.
说明了这些概念的力量 。Colin Camerer, "Three Cheers-Psychological, Theoretical, Empirical-for Loss Aversion," Journal of Marketing Research 42 (2005):129-33.Colin F. Camerer,"野外的前景理论:来自实地的证据",在《 选择、价值和框架 》中。Daniel Kahneman 和 Amos Tversky (New York: Russell Sage Foundation, 2000), 288-300.
condo apartments in Boston : David Genesove and Christopher Mayer, “Loss Aversion and Seller Behavior: Evidence from the Housing Market,” Quarterly Journal of Economics 116 (2001): 1233–60.
波士顿的共管公寓 。David Genesove 和 Christopher Mayer, "损失规避和卖方行为。来自住房市场的证据",《 经济学季刊 》116(2001)。1233-60.
effect of trading experience : John A. List, “Does Market Experience Eliminate Market Anomalies?” Quarterly Journal of Economics 118 (2003): 47–71.
交易经验的影响 。John A. List,"市场经验是否能消除市场异常?" 经济学季刊》 118(2003)。47-71.
Jack Knetsch also : Jack L. Knetsch, “The Endowment Effect and Evidence of Nonreversible Indifference Curves,” American Economic Review 79 (1989): 1277–84.
Jack Knetsch也是如此 。Jack L. Knetsch,"禀赋效应和不可逆冷漠曲线的证据",《 美国经济评论》 79(1989)。1277-84.
ongoing debate about the endowment effect : Charles R. Plott and Kathryn Zeiler, “The Willingness to Pay–Willingness to Accept Gap, the ‘Endowment Effect,’ Subject Misconceptions, and Experimental Procedures for Eliciting Valuations,” American Economic Review 95 (2005): 530–45. Charles Plott, a leading experimental economist, has been very skeptical of the endowment effect and has attempted to show that it is not a “fundamental aspect of human preference” but rather an outcome of inferior technique. Plott and Zeiler believe that participants who show the endowment effect are under some misconception about what their true values are, and they modified the procedures of the original experiments to eliminate the misconceptions. They devised an elaborate training procedure in which the participants experienced the roles of both buyers and sellers, and were explicitly taught to assess their true values. As expected, the endowment effect disappeared. Plott and Zeiler view their method as an important improvement of technique. Psychologists would consider the method severely deficient, because it communicates to the participants a message of what the experimenters consider appropriate behavior, which happens to coincide with the experimenters’ theory. Plott and Zeiler’s favored version of Knetsch’s exchange experiment is similarly biased: It does not allow the owner of the good to have physical possession of it, which is crucial to the effect. See Charles R. Plott and Kathryn Zeiler, “Exchange Asymmetries Incorrectly Interpreted as Evidence of Endowment Effect Theory and Prospect Theory?” American Economic Review 97 (2007): 1449–66. There may be an impasse here, where each side rejects the methods required by the other.
正在进行的关于禀赋效应的辩论 。Charles R. Plott 和 Kathryn Zeiler, "支付意愿与接受意愿的差距,'禀赋效应',主体误解,以及激发评价的实验程序," 美国 经济评论 95 (2005):530-45.领先的实验经济学家查尔斯-普洛特(Charles Plott)一直对禀赋效应持非常怀疑的态度,并试图证明它不是 "人类偏好的基本方面",而是一种低劣技术的结果。Plott和Zeiler认为,表现出禀赋效应的参与者对他们的真实价值有一些误解, ,他们修改了原始实验的程序以消除误解。他们设计了一个复杂的训练程序,让参与者体验买家和卖家的角色,并明确地教导他们评估自己的真实价值。正如预期的那样,禀赋效应消失了。Plott和Zeiler认为他们的方法是技术上的一个重要改进。心理学家 ,会认为这种方法有严重的缺陷,因为它向参与者传达了实验者认为适当行为的信息,而这恰好与实验者的理论相吻合。Plott和Zeiler对Knetsch的交换实验所青睐的版本也有类似的偏见。它不允许物品的所有者对其进行实际占有,这对效果来说是至关重要的 。见Charles R. Plott和Kathryn Zeiler,"交换不对称性不正确地被解释为禀赋效应理论和前景理论的证据?" 美国经济评论》 97(2007)。1449-66.这里可能有一个僵局,双方都拒绝对方要求的方法。
People who are poor : In their studies of decision making under poverty, Eldar Shafir, Sendhil Mullainathan, and their colleagues have observed other instances in which poverty induces economic behavior that is in some respects more realistic and more rational than that of people who are better off. The poor are more likely to respond to real outcomes than to their description. Marianne Bertrand, Sendhil Mullainathan, and Eldar Shafir, “Behavioral Economics and Marketing in Aid of Decision Making Among the Poor,” Journal of Public Policy & Marketing 25 (2006): 8–23.
穷人 :在对贫困下的决策研究中,Eldar Shafir, Sendhil Mullainathan, 和他们的同事观察到其他一些情况,在这些情况下,贫困诱发的经济行为在某些方面比富裕的人的行为更现实,更理性。穷人更有可能对真实的结果作出反应,而不是对其描述作出反应。Marianne Bertrand, Sendhil Mullainathan, and Eldar Shafir, "Behavioral Economics and Marketing in Aid of Decision Making Among the Poor," Journal of Public Policy & Marketing 25 (2006):8-23.
in the United States and in the UK : The conclusion that money spent on purchases is not experienced as a loss is more likely to be true for people who are relatively well-off. The key may be whether you are aware when you buy one good that you will not be unable to afford another good. Novemsky and Kahneman, “The Boundaries of Loss Aversion.” Ian Bateman et al., “Testing Competing Models of Loss Aversion: An Adversarial Collaboration,” Journal of Public Economics 89 (2005): 1561–80.
在美国和英国 。对于相对富裕的人来说,花在购买上的钱不被视为损失的结论更有可能是真的。关键可能在于,当你购买一种商品时,你是否意识到你将无法负担 另一种商品。Novemsky和Kahneman,"损失厌恶的边界 "。Ian Bateman等人,"测试损失规避的竞争模型。一个对抗性的合作", 《公共经济学杂志 》89(2005)。1561-80.
28: BAD EVENTS
28: BAD EVENTS
heartbeat accelerated : Paul J. Whalen et al., “Human Amygdala Responsivity to Masked Fearful Eye Whites,” Science 306 (2004): 2061. Individuals with focal lesions of the amygdala showed little or no loss aversion in their risky choices: Benedetto De Martino, Colin F. Camerer, and Ralph Adolphs, “Amygdala Damage Eliminates Monetary Loss Aversion,” PNAS 107 (2010): 3788–92.
心脏跳动加快 。Paul J. Whalen等人,"人类杏仁核对掩盖的恐惧眼白的反应", 《科学》 306(2004)。2061.杏仁核局灶性病变的人在 风险选择中很少或没有表现出损失厌恶。Benedetto De Martino, Colin F. Camerer, and Ralph Adolphs, "杏仁核损伤消除了货币损失厌恶", PNAS 107 (2010):3788-92.
bypassing the visual cortex : Joseph LeDoux, The Emotional Brain: The Mysterious Underpinnings of Emotional Life (New York: Touchstone, 1996).
绕过视觉皮层 。约瑟夫-勒杜,《 情绪化的大脑》。情感生活的神秘基础 》(纽约:Touchstone,1996)。
processed faster : Elaine Fox et al., “Facial Expressions of Emotion: Are Angry Faces Detected More Efficiently?” Cognition & Emotion 14 (2000): 61–92.
处理得更快 。Elaine Fox等人,"情绪的面部表达。愤怒的脸被更有效地检测到 ?" 认知与情感 14(2000)。61-92.
“pops out” : Christine Hansen and Ranald Hansen, “Finding the Face in the Crowd: An Anger Superiority Effect,” Journal of Personality and Social Psychology 54 (1988): 917–24.
"跳出来"。 Christine Hansen和Ranald Hansen,"在人群中寻找面孔。愤怒的优势效应", 《人格与社会心理学杂志 》54(1988)。917-24.
“acceptable/unacceptable” : Jos J. A. Van Berkum et al., “Right or Wrong? The Brain’s Fast Response to Morally Objectionable Statements,” Psychological Science 20 (2009): 1092–99.
"可接受/不可接受"。 Jos J. A. Van Berkum等人,"对还是错?大脑对道德上可反对的声明的快速反应》,《 心理科学》 20(2009)。 1092-99.
negativity dominance : Paul Rozin and Edward B. Royzman, “Negativity Bias, Negativity Dominance, and Contagion,” Personality and Social Psychology Review 5 (2001): 296–320.
消极性支配力 。Paul Rozin和Edward B. Royzman,"消极性偏见、消极性支配和传染",《 人格与社会心理学评论》 5(2001)。296-320.
resistant to disconfirmation : Roy F. Baumeister, Ellen Bratslavsky, Catrin Finkenauer, and Kathleen D. Vohs, “Bad Is Stronger Than Good,” Review of General Psychology 5 (2001): 323.
对不确认的抵抗 。Roy F. Baumeister, Ellen Bratslavsky, Catrin Finkenauer, and Kathleen D. Vohs, "Bad Is Stronger Than Good," Review of General Psychology 5 (2001):323.
biologically significant improvement : Michel Cabanac, “Pleasure: The Common Currency,” Journal of Theoretical Biology 155 (1992): 173–200.
生物学上的重大改进 。Michel Cabanac, "Pleasure: The Common Currency," Journal of Theoretical Biology 155 (1992):173-200.
not equally powerful : Chip Heath, Richard P. Larrick, and George Wu, “Goals as Reference Points,” Cognitive Psychology 38 (1999): 79–109.
并非同样强大 。Chip Heath, Richard P. Larrick, and George Wu, "Goals as Reference Points," Cognitive Psychology 38 (1999):79-109.
rain-drenched customers : Colin Camerer, Linda Babcock, George Loewenstein, and Richard Thaler, “Labor Supply of New York City Cabdrivers: One Day at a Time,” Quarterly Journal of Economics 112 (1997): 407–41. The conclusions of this research have been questioned: Henry S. Farber, “Is Tomorrow Another Day? The Labor Supply of New York Cab Drivers,” NBER Working Paper 9706, 2003. A series of studies of bicycle messengers in Zurich provides strong evidence for the effect of goals, in accord with the original study of cabdrivers: Ernst Fehr and Lorenz Goette, “Do Workers Work More if Wages Are High? Evidence from a Randomized Field Experiment,” American Economic Review 97 (2007): 298–317.
被 雨淋湿的顾客 。科林-卡梅勒、琳达-巴布科克、乔治-洛温斯坦和理查德-塞勒,"纽约市出租车司机的劳动供应。一次一天", 经济学季刊 112(1997)。407-41.这项研究的结论受到了质疑。Henry S. Farber, "Is Tomorrow Another Day?纽约出租车司机的劳动供给》,NBER工作文件9706,2003。在苏黎世,对自行车送餐员的一系列研究为目标的影响提供了有力的证据,与最初对出租车司机的研究一致。Ernst Fehr和Lorenz Goette,"如果工资高,工人会更多工作吗?来自随机现场实验的证据," 美国经济评论 97(2007)。298-317.
communicate a reference point : Daniel Kahneman, “Reference Points, Anchors, Norms, and Mixed Feelings,” Organizational Behavior and Human Decision Processes 51 (1992): 296–312.
沟通一个参考点 。丹尼尔-卡尼曼,"参考点、锚、规范和混合感觉",《 组织行为和人类决策过程 》51(1992)。296-312.
“wins the contest” : John Alcock, Animal Behavior: An Evolutionary Approach (Sunderland, MA: Sinauer Associates, 2009), 278–84, cited by Eyal Zamir, “Law and Psychology: The Crucial Role of Reference Points and Loss Aversion,” working paper, Hebrew University, 2011.
"赢得比赛"。 John Alcock, Animal Behavior:An Evolutionary Approach (Sunderland, MA: Sinauer Associates, 2009), 278-84, 转引自Eyal Zamir, "法律与心理学。参考点和损失厌恶的关键作用,"工作文件,希伯来大学,2011年。
merchants, employers, and landlords : Daniel Kahneman, Jack L. Knetsch, and Richard H. Thaler, “Fairness as a Constraint on Profit Seeking: Entitlements in the Market,” The American Economic Review 76 (1986): 728–41.
商人、雇主和房东 。Daniel Kahneman, Jack L. Knetsch, and Richard H. Thaler, "Fairness as a Constraint on Profit Seeking:市场中的权利," 《美国经济评论》 76(1986)。728-41.
fairness concerns are economically significant : Ernst Fehr, Lorenz Goette, and Christian Zehnder, “A Behavioral Account of the Labor Market: The Role of Fairness Concerns,” Annual Review of Economics 1 (2009): 355–84. Eric T. Anderson and Duncan I. Simester, “Price Stickiness and Customer Antagonism,” Quarterly Journal of Economics 125 (2010): 729–65.
公平问题在经济上是有意义的 。Ernst Fehr, Lorenz Goette, and Christian Zehnder, "A Behavioral Account of the Labor Market:公平关注的作用",《 经济学年度评论 》1(2009)。355-84.Eric T. Anderson和Duncan I. Simester, "Price Stickiness and Customer Antagonism," Quarterly Journal of Economics 125(2010)。729-65.
altruistic punishment is accompanied : Dominique de Quervain et al., “The Neural Basis of Altruistic Punishment,” Science 305 (2004): 1254–58.
伴随着利他主义惩罚 。Dominique de Quervain等人," Altruistic Punishment的神经基础", 科学 305(2004)。1254-58.
actual losses and foregone gains : David Cohen and Jack L. Knetsch, “Judicial Choice and Disparities Between Measures of Economic Value,” Osgoode Hall Law Review 30 (1992): 737–70. Russell Korobkin, “The Endowment Effect and Legal Analysis,” Northwestern University Law Review 97 (2003): 1227–93.
实际损失和放弃的收益 。David Cohen 和 Jack L. Knetsch, "Judicial Choice and Disparities Between Measures of Economic Value," Osgoode Hall Law Review 30 (1992):737-70.Russell Korobkin, "捐赠效应和法律分析" 西北大学法律评论 97 (2003):1227-93.
asymmetrical effects on individual well-being : Zamir, “Law and Psychology.”
对个人幸福感的不对称影响 。Zamir,"法律与心理学"。
29: THE FOURFOLD PATTERN
29:四重模式
and other disasters : Including exposure to a “Dutch book,” which is a set of gambles that your incorrect preferences commit you to accept and is guaranteed to end up in a loss.
和其他灾难 。包括接触 "荷兰书",这是你不正确的偏好使你承诺接受的一组赌博,并保证以损失告终。
puzzle that Allais constructed : Readers who are familiar with the Allais paradoxes will recognize that this version is new. It is both much simpler and actually a stronger violation than the original paradox. The left-hand option is preferred in the first problem. The second problem is obtained by adding a more valuable prospect to the left than to the right, but the right-hand option is now preferred.
阿莱构建的难题 。熟悉阿莱悖论的读者会认识到,这个版本是新的。它既简单得多,实际上又比原来的悖论有更强的违反性 。在第一个问题中,左边的选项是首选。第二个问题是通过在左边增加一个比右边更有价值的前景得到的,但现在右边的选项更受欢迎。
sorely disappointed : As the distinguished economist Kenneth Arrow recently described the event, the participants in the meeting paid little attention to what he called “Allais’s little experiment.” Personal conversation, March 16, 2011.
感到 非常失望 。正如杰出的经济学家肯尼思-阿罗最近描述的那样,参加会议的人很少注意 ,他称之为 "Allais的小实验"。个人谈话,2011年3月16日。
estimates for gains : The table shows decision weights for gains. Estimates for losses were very similar.
效益的估计 。该表显示了收益的决策权重。对损失的估计非常相似。
estimated from choices : Ming Hsu, Ian Krajbich, Chen Zhao, and Colin F. Camerer, “Neural Response to Reward Anticipation under Risk Is Nonlinear in Probabilities,” Journal of Neuroscience 29 (2009): 2231–37.
从选择中估计 。Ming Hsu, Ian Krajbich, Chen Zhao, and Colin F. Camerer, "Neural Response to Reward Anticipation under Risk Is Nonlinear in Probabilities, " Journal of Neuroscience 29 (2009):2231-37.
parents of small children : W. Kip Viscusi, Wesley A. Magat, and Joel Huber, “An Investigation of the Rationality of Consumer Valuations of Multiple Health Risks,” RAND Journal of Economics 18 (1987): 465–79.
小孩子的父母 。W. Kip Viscusi, Wesley A. Magat, and Joel Huber, "An Investigation of the Rationality of Consumer Valuations of Multiple Health Risks, " RAND Journal of Economics 18 (1987):465-79.
psychology of worry : In a rational model with diminishing marginal utility, people should pay at least two-thirds as much to reduce the frequency of accidents from 15 to 5 units as they are willing to pay to eliminate the risk. Observed preferences violated this prediction.
忧虑心理 :在一个边际效用递减的理性模型中,人们为将事故频率从15个单位减少到5个单位所付出的代价,至少应该是他们为消除风险所愿意付出的三分之二 。观察到的偏好违反了这一预测。
not made much of it : C. Arthur Williams, “Attitudes Toward Speculative Risks as an Indicator of Attitudes Toward Pure Risks,” Journal of Risk and Insurance 33 (1966): 577–86. Howard Raiffa, Decision Analysis: Introductory Lectures on Choices under Uncertainty (Reading, MA: Addison-Wesley, 1968).
并没有对其进行过多的 阐述。C. Arthur Williams, "对投机性风险的态度是对纯粹风险的态度的指标", 《风险与保险杂志》 33(1966)。577-86.霍华德-拉伊法,《 决策分析》。 决策分析: 关于不确定情况下的选择的介绍性讲座 (Reading, MA: Addison-Wesley, 1968)。
shadow of civil trials : Chris Guthrie, “Prospect Theory, Risk Preference, and the Law,” Northwestern University Law Review 97 (2003): 1115–63. Jeffrey J. Rachlinski, “Gains, Losses and the Psychology of Litigation,” Southern California Law Review 70 (1996): 113–85. Samuel R. Gross and Kent D. Syverud, “Getting to No: A Study of Settlement Negotiations and the Selection of Cases for Trial,” Michigan Law Review 90 (1991): 319–93.
民事审判的阴影 :Chris Guthrie,"前景理论、风险偏好和法律",《 西北大学法律评论》 97(2003)。1115-63.Jeffrey J. Rachlinski, "Gains, Losses and the Psychology of Litigation, " Southern California Law Review 70 (1996):113-85.Samuel R. Gross 和 Kent D. Syverud, "Getting to No: A Study of Settlement Negotiations and the Selection of Cases for Trial," Michigan Law Review 90 (1991): 319-93.
the frivolous claim : Chris Guthrie, “Framing Frivolous Litigation: A Psychological Theory,” University of Chicago Law Review 67 (2000): 163–216.
无意义 的索赔 :Chris Guthrie, "Framing Frivolous Litigation:A Psychological Theory," University of Chicago Law Review 67 (2000):163-216.
30: RARE EVENTS
30:罕见的 事件
wish to avoid it : George F. Loewenstein, Elke U. Weber, Christopher K. Hsee, and Ned Welch, “Risk as Feelings,” Psychological Bulletin 127 (2001): 267–86.
希望避免它 。George F. Loewenstein, Elke U. Weber, Christopher K. Hsee, and Ned Welch, "Risk as Feelings, " Psychological Bulletin 127 (2001):267-86.
vividness in decision making : Ibid. Cass R. Sunstein, “Probability Neglect: Emotions, Worst Cases, and Law,” Yale Law Journal 112 (2002): 61–107. See notes to chapter 13 : Damasio, Descartes’ Error . Slovic, Finucane, Peters, and MacGregor, “The Affect Heuristic.”
决策中的生动性 。同上。Cass R. Sunstein, "Probability Neglect: 情感、最坏的情况和法律," 耶鲁大学法律杂志 112(2002)。61-107.见 第13章 的注释:达马西奥, 笛卡尔的错误 。Slovic, Finucane, Peters, and MacGregor, "The Affect Heuristic".
Amos’s student : Craig R. Fox, “Strength of Evidence, Judged Probability, and Choice Under Uncertainty,” Cognitive Psychology 38 (1999): 167–89.
阿莫斯的学生 。克雷格-R-福克斯,"证据的强度、判断的概率和不确定性下的选择", 认知心理学 38(1999)。167-89.
focal event and its : Judgments of the probabilities of an event and its complement do not always add up to 100%. When people are asked about a topic they know very little about (“What is your probability that the temperature in Bangkok will exceed 100° tomorrow at noon?”), the judged probabilities of the event and its complement add up to less than 100%.
焦点事件和它的 。对一个事件 及其补充的概率的判断不一定加起来是100%。当人们被问及一个他们知之甚少的话题时("你认为明天中午曼谷的温度超过100°的概率是多少?"),事件的判断概率和它的补充部分加起来不到100%。
receiving a dozen roses : In cumulative prospect theory, decision weights for gains and losses are not assumed to be equal, as they were in the original version of prospect theory that I describe.
收到一打玫瑰花 。在累积前景理论中,收益和损失的决策权重不 ,因为它们在我描述的前景理论的原始版本中是相等的。
superficial processing : The question about the two urns was invented by Dale T. Miller, William Turnbull, and Cathy McFarland, “When a Coincidence Is Suspicious: The Role of Mental Simulation,” Journal of Personality and Social Psychology 57 (1989): 581–89. Seymour Epstein and his colleagues argued for an interpretation of it in terms of two systems: Lee A. Kirkpatrick and Seymour Epstein, “Cognitive-Experiential Self-Theory and Subjective Probability: Evidence for Two Conceptual Systems,” Journal of Personality and Social Psychology 63 (1992): 534–44.
肤浅的处理 。关于两个骨灰盒的问题是由戴尔-T-米勒、威廉-特恩布尔和凯西-麦克法兰发明的,"当一个巧合是可疑的:心理模拟的作用", 《人格与社会心理学杂志 》57(1989)。581-89.西摩-爱泼斯坦和他的同事主张 ,从两个系统的角度对其进行解释。Lee A. Kirkpatrick和Seymour Epstein,"认知-体验自我理论和主观概率。两个概念系统的证据", 《人格与社会心理学杂志 》63(1992)。534-44.
judged it as more dangerous : Kimihiko Yamagishi, “When a 12.86% Mortality Is More Dangerous Than 24.14%: Implications for Risk Communication,” Applied Cognitive Psychology 11 (1997): 495–506.
判定为更危险 。Kimihiko Yamagishi,"当12.86%的死亡率比24.14%更危险时。风险交流的含义", 应用认知心理学 11(1997)。495-506.
forensic psychologists : Slovic, Monahan, and MacGregor, “Violence Risk Assessment and Risk Communication.”
法医心理学家 。Slovic、Monahan和MacGregor,"暴力风险评估和风险交流"。
“1 of 1,000 capital cases” : Jonathan J. Koehler, “When Are People Persuaded by DNA Match Statistics?” Law and Human Behavior 25 (2001): 493–513.
"1000个死刑案件中的1个"。 乔纳森-J-科勒,"人们何时被DNA匹配统计数字所说服?" Law and Human Behavior 25 (2001):493-513.
studies of choice from experience : Ralph Hertwig, Greg Barron, Elke U. Weber, and Ido Erev, “Decisions from Experience and the Effect of Rare Events in Risky Choice,” Psychological Science 15 (2004): 534–39. Ralph Hertwig and Ido Erev, “The Description-Experience Gap in Risky Choice,” Trends in Cognitive Sciences 13 (2009): 517–23.
对 来自经验的选择 的 研究 。Ralph Hertwig, Greg Barron, Elke U. Weber, and Ido Erev, "Decisions from Experience and the Effect of Rare Events in Risky Choice, " Psychological Science 15 (2004):534-39.Ralph Hertwig和Ido Erev, "The Description-Experience Gap in Risky Choice," Trends in Cognitive Sciences 13 (2009):517-23.
not yet settled : Liat Hadar and Craig R. Fox, “Information Asymmetry in Decision from Description Versus Decision from Experience,” Judgment and Decision Making 4 (2009): 317–25.
还没有解决 。Liat Hadar 和 Craig R. Fox, "Information Asymmetry in Decision from Description Versus Decision from Experience," Judgment and Decision Making 4 (2009):317-25.
“chances of rare events” : Hertwig and Erev, “The Description-Experience Gap.”
"罕见事件的几率"。 Hertwig和Erev,"描述与经验的差距"。
31: RISK POLICIES
31: 风险政策
inferior option BC : The calculation is straightforward. Each of the two combinations consists of a sure thing and a gamble. Add the sure thing to both options of the gamble and you will find AD and BC.
劣质选项BC :计算很简单。这两个组合中的每一个都由一个确定的东西和一个赌博组成。将确定的事情与赌博的两个选项相加,你会发现AD和BC。
the equivalent of “locking in” : Thomas Langer and Martin Weber, “Myopic Prospect Theory vs. Myopic Loss Aversion: How General Is the Phenomenon?” Journal of Economic Behavior & Organization 56 (2005): 25–38.
这 相当于 "锁定"。 托马斯-朗格和马丁-韦伯,"近视 前景理论与近视损失厌恶。这种现象有多普遍?" 经济行为与组织杂志 》56(2005)。25-38.
32: KEEPING SCORE
32:记分
drive into a blizzard : The intuition was confirmed in a field experiment in which a random selection of students who purchased season tickets to the university theater received their tickets at a much reduced price. A follow-up of attendance revealed that students who had paid the full price for their tickets were more likely to attend, especially during the first half of the season. Missing a show one has paid for involves the unpleasant experience of closing an account in the red. Arkes and Blumer, “The Psychology of Sunk Costs.”
开车进入暴风雪 中。 ,这一直觉在一项实地实验中得到了证实,随机选择购买大学剧院季票的学生,以大大降低的价格获得了他们的门票。对出场率的跟踪调查 ,结果显示,付了全价票的学生更有可能参加演出,尤其是在演出季的前半段。错过一场已付款的演出,就会有关闭账户的不愉快经历。Arkes和Blumer,"沉没成本的心理学"。
the disposition effect : Hersh Shefrin and Meir Statman, “The Disposition to Sell Winners Too Early and Ride Losers Too Long: Theory and Evidence,” Journal of Finance 40 (1985): 777–90. Terrance Odean, “Are Investors Reluctant to Realize Their Losses?” Journal of Finance 53 (1998): 1775–98.
处置效应 :Hersh Shefrin和Meir Statman,"过早卖出赢家的处置 ,过早驾驭输家:理论和证据",《 金融杂志 》40(1985)。777-90.Terrance Odean, "投资者不愿意意识到他们的损失吗?" 金融杂志 》53(1998)。1775-98.
less susceptible : Ravi Dhar and Ning Zhu, “Up Close and Personal: Investor Sophistication and the Disposition Effect,” Management Science 52 (2006): 726–40.
不太容易受到影响 。Ravi Dhar 和 Ning Zhu, "Up Close and Personal: Investor Sophistication and the Disposition Effect," Management Science 52 (2006):726-40.
fallacy can be overcome : Darrin R. Lehman, Richard O. Lempert, and Richard E. Nisbett, “The Effects of Graduate Training on Reasoning: Formal Discipline and Thinking about Everyday-Life Events,” American Psychologist 43 (1988): 431–42.
谬误是可以被克服的 。Darrin R. Lehman, Richard O. Lempert, and Richard E. Nisbett, "The Effects of Graduate Training on Reasoning:正规的学科和对日常生活事件的思考," 美国心理学家 43(1988)。431-42.
“a sinking feeling” : Marcel Zeelenberg and Rik Pieters, “A Theory of Regret Regulation 1.0,” Journal of Consumer Psychology 17 (2007): 3–18.
"一种沉沦的感觉"。 Marcel Zeelenberg和Rik Pieters, "A Theory of Regret Regulation 1.0," Journal of Consumer Psychology 17 (2007):3-18.
regret to normality : Kahneman and Miller, “Norm Theory.”
对常态的遗憾 。卡尼曼和米勒,"规范理论"。
habitually taking unreasonable risks : The hitchhiker question was inspired by a famous example discussed by the legal philosophers Hart and Honoré: “A woman married to a man who suffers from an ulcerated condition of the stomach might identify eating parsnips as the cause of his indigestion. The doctor might identify the ulcerated condition as the cause and the meal as a mere occasion.” Unusual events call for causal explanations and also evoke counterfactual thoughts, and the two are closely related. The same event can be compared to either a personal norm or the norm of other people, leading to different counterfactuals, different causal attributions, and different emotions (regret or blame): Herbert L. A. Hart and Tony Honoré, Causation in the Law (New York: Oxford University Press, 1985), 33.
习惯性地承担不合理的风险 。顺风车问题的灵感来自于法律哲学家哈特和奥诺雷讨论的一个著名的例子:"一个女人嫁给了一个患有胃溃疡的男人,她可能把吃防风草作为他消化不良的原因。医生可能认为溃疡是原因,而这顿饭仅仅是个机会。不寻常的事件要求 ,进行因果解释,也唤起反事实的思考,这两者是密切相关的。同一事件可以与个人的规范或其他人的规范相比较,导致不同的反事实,不同的因果归属,以及不同的情绪(后悔或责备)。Herbert L. A. Hart和Tony Honoré,《 法律中的因果关系 》(纽约:牛津大学出版社,1985),33。
remarkably uniform : Daniel Kahneman and Amos Tversky, “The Simulation Heuristic,” in Judgment Under Uncertainty : Heuristics and Biases , ed. Daniel Kahneman, Paul Slovic, and Amos Tversky (New York: Cambridge University Press, 1982), 160–73.
显著的统一 。Daniel Kahneman和Amos Tversky, "The Simulation Heuristic," in Judgment Under Uncertainty : Heuristics and Biases ,ed.Daniel Kahneman, Paul Slovic, and Amos Tversky (New York: Cambridge University Press, 1982), 160-73.
applies to blame : Janet Landman, “Regret and Elation Following Action and Inaction: Affective Responses to Positive Versus Negative Outcomes,” Personality and Social Psychology Bulletin 13 (1987): 524–36. Faith Gleicher et al., “The Role of Counterfactual Thinking in Judgment of Affect,” Personality and Social Psychology Bulletin 16 (1990): 284–95.
适用于指责 。Janet Landman,"行动和不行动之后的后悔和兴奋。对积极与消极结果的情感反应," 人格 和社会心理学公报 13(1987)。524-36.Faith Gleicher等人,"反事实思维在情感判断中的作用",《 人格与社会心理学公报》 16(1990)。284-95.
actions that deviate from the default : Dale T. Miller and Brian R. Taylor, “Counterfactual Thought, Regret, and Superstition: How to Avoid Kicking Yourself,” in What Might Have Been: The Social Psychology of Counterfactual Thinking , ed. Neal J. Roese and James M. Olson (Hillsdale, NJ: Erlbaum, 1995), 305–31.
偏离默认的行动 。Dale T. Miller和Brian R. Taylor,"反事实思维、后悔和迷信。如何避免踢自己",载于 《可能的事》。反事实思维的社会心理学 ,编辑:Neal J. Roese和James R. Taylor。Neal J. Roese和James M. Olson(Hillsdale, NJ: Erlbaum, 1995),305-31。
produce blame and regret : Marcel Zeelenberg, Kees van den Bos, Eric van Dijk, and Rik Pieters, “The Inaction Effect in the Psychology of Regret,” Journal of Personality and Social Psychology 82 (2002): 314–27.
产生责备和后悔 。Marcel Zeelenberg, Kees van den Bos, Eric van Dijk, and Rik Pieters, "The Inaction Effect in the Psychology of Regret," Journal of Personality and Social Psychology 82 (2002):314-27.
brand names over generics : Itamar Simonson, “The Influence of Anticipating Regret and Responsibility on Purchase Decisions,” Journal of Consumer Research 19 (1992): 105–18.
品牌名称比仿制药更 重要。Itamar Simonson,"预期后悔的影响 和责任对购买决定的影响",《 消费者研究杂志》 19(1992)。105-18.
clean up their portfolios : Lilian Ng and Qinghai Wang, “Institutional Trading and the Turn-of-the-Year Effect,” Journal of Financial Economics 74 (2004): 343–66.
清理他们的投资组合 。Lilian Ng和Qinghai Wang,"机构交易和年终效应",《 金融经济学杂志》 74(2004)。343-66.
loss averse for aspects of your life : Tversky and Kahneman, “Loss Aversion in Riskless Choice.” Eric J. Johnson, Simon Gächter, and Andreas Herrmann, “Exploring the Nature of Loss Aversion,” Centre for Decision Research and Experimental Economics, University of Nottingham, Discussion Paper Series , 2006. Edward J. McCaffery, Daniel Kahneman, and Matthew L. Spitzer, “Framing the Jury: Cognitive Perspectives on Pain and Suffering,” Virginia Law Review 81 (1995): 1341–420.
厌恶损失的生活的各个方面 。Tversky和Kahneman,"无风险选择中的损失规避"。Eric J. Johnson, Simon Gächter, and Andreas Herrmann, "探索损失规避的本质", 诺丁汉大学决策研究和实验经济学中心,讨论文件系列 ,2006。Edward J. McCaffery, Daniel Kahneman, and Matthew L. Spitzer, "Framing the Jury:疼痛和痛苦的认知视角," 弗吉尼亚法律评论 81(1995)。1341-420.
classic on consumer behavior : Richard H. Thaler, “Toward a Positive Theory of Consumer Choice,” Journal of Economic Behavior and Organization 39 (1980): 36–90.
关于消费者行为的经典 。Richard H. Thaler, "Toward a Positive Theory of Consumer Choice," Journal of Economic Behavior and Organization 39 (1980):36-90.
taboo tradeoff : Philip E. Tetlock et al., “The Psychology of the Unthinkable: Taboo TradeOffs, Forbidden Base Rates, and Heretical Counterfactuals,” Journal of Personality and Social Psychology 78 (2000): 853–70.
禁忌权衡 :Philip E. Tetlock等人,"不可想象的心理学。禁忌权衡、禁忌基准率和异端反事实", 《人格与社会心理学杂志 》78(2000)。853-70.
where the precautionary principle : Cass R. Sunstein, The Laws of Fear: Beyond the Precautionary Principle (New York: Cambridge University Press, 2005).
其中预防原则 。Cass R. Sunstein, The Laws of Fear: Beyond the Precautionary Principle (New York: Cambridge University Press, 2005).
“psychological immune system” : Daniel T. Gilbert et al., “Looking Forward to Looking Backward: The Misprediction of Regret,” Psychological Science 15 (2004): 346–50.
"心理免疫系统"。 Daniel T. Gilbert等人,"向前看向后看。后悔的预测错误",《 心理科学》 15(2004)。346-50.
33: REVERSALS
33: 逆转
in the man’s regular store : Dale T. Miller and Cathy McFarland, “Counterfactual Thinking and Victim Compensation: A Test of Norm Theory,” Personality and Social Psychology Bulletin 12 (1986): 513–19.
在该男子的正规商店里 。Dale T. Miller和Cathy McFarland,"反事实思维和受害者赔偿。A Test of Norm Theory," Personality and Social Psychology Bulletin 12 (1986):513-19.
reversals of judgment and choice : The first step toward the current interpretation was taken by Max H. Bazerman, George F. Loewenstein, and Sally B. White, “Reversals of Preference in Allocation Decisions: Judging Alternatives Versus Judging Among Alternatives,” Administrative Science Quarterly 37 (1992): 220–40. Christopher Hsee introduced the terminology of joint and separate evaluation, and formulated the important evaluability hypothesis, which explains reversals by the idea that some attributes become evaluable only in joint evaluation: “Attribute Evaluability: Its Implications for Joint-Separate Evaluation Reversals and Beyond,” in Kahneman and Tversky, Choices, Values, and Frames .
判断和选择的逆转 :Max H. Bazerman、George F. Loewenstein和Sally B. White在《分配决策中的偏好逆转》中迈出了当前解释的第一步。判断替代品与判断替代品之间的关系",《 行政科学季刊》 37(1992)。220-40.Christopher Hsee介绍了联合评价和单独评价的术语 ,并提出了重要的可评价性假说,该假说通过一些属性只有在联合评价中才可评价的想法来解释反转。"属性可评价性。其对联合-分离评价逆转的影响及其他",见Kahneman和Tversky,《 选择、价值和框架》 。
conversation between psychologists and economists : Sarah Lichtenstein and Paul Slovic, “Reversals of Preference Between Bids and Choices in Gambling Decisions,” Journal of Experimental Psychology 89 (1971): 46–55. A similar result was obtained independently by Harold R. Lindman, “Inconsistent Preferences Among Gambles,” Journal of Experimental Psychology 89 (1971): 390–97.
心理学家和经济学家之间的对话 。Sarah Lichtenstein和Paul Slovic, "Reversals of Preference Between Bids and Choices in Gambling Decisions," Journal of Experimental Psychology 89 (1971):46-55.哈罗德-R-林德曼(Harold R. Lindman)也独立获得了类似的结果,"Inconsistent Preferences Among Gambles," Journal of Experimental Psychology 89 (1971):390-97.
bewildered participant : For a transcript of the famous interview, see Sarah Lichtenstein and Paul Slovic, eds., The Construction of Preference (New York: Cambridge University Press, 2006).
茫然的参与者 。关于这次著名的采访记录,见Sarah Lichtenstein和Paul Slovic, eds., The Construction of Preference (纽约:剑桥大学出版社,2006)。
the prestigious American Economic Review : David M. Grether and Charles R. Plott, “Economic Theory of Choice and the Preference Reversals Phenomenon,” American Economic Review 69 (1979): 623–28.
著名的 《 美国经济评论》 。David M. Grether和Charles R. Plott,"选择的经济理论和偏好逆转现象",《 美国经济评论》 69(1979)。623-28.
“context in which the choices are made” : Lichtenstein and Slovic, The Construction of Preference , 96.
"做出选择的背景"。 Lichtenstein and Slovic, The Construction of Preference , 96.
one embarrassing finding : Kuhn famously argued that the same is true of physical sciences as well: Thomas S. Kuhn, “The Function of Measurement in Modern Physical Science,” Isis 52 (1961): 161–93.
一个尴尬的发现 。库恩著名的论点是,物理科学也是如此。托马斯-S-库恩,"现代物理科学中测量的功能", Isis 52(1961)。161-93.
liking of dolphins : There is evidence that questions about the emotional appeal of species and the willingness to contribute to their protection yield the same rankings: Daniel Kahneman and Ilana Ritov, “Determinants of Stated Willingness to Pay for Public Goods: A Study in the Headline Method,” Journal of Risk and Uncertainty 9 (1994): 5–38.
对海豚的喜欢 。有 ,有证据表明,关于物种的情感吸引力的问题和为保护物种做出贡献的意愿会产生相同的排名。Daniel Kahneman和Ilana Ritov,"为公共物品支付意愿的决定因素。A Study in the Headline Method," Journal of Risk and Uncertainty 9 (1994):5-38.
superior on this attribute : Hsee, “Attribute Evaluability.”
在这个属性上的优势 。Hsee,"属性的可评价性"。
“requisite record-keeping” : Cass R. Sunstein, Daniel Kahneman, David Schkade, and Ilana Ritov, “Predictably Incoherent Judgments,” Stanford Law Review 54 (2002): 1190.
"必要的记录保存"。 Cass R. Sunstein, Daniel Kahneman, David Schkade, and Ilana Ritov, "Predictably Incoherent Judgments," Stanford Law Review 54 (2002):1190.
34: FRAMES AND REALITY
34:框架与现实
unjustified influences of formulation : Amos Tversky and Daniel Kahneman, “The Framing of Decisions and the Psychology of Choice,” Science 211 (1981): 453–58.
不合理的影响的表述 。Amos Tversky和Daniel Kahneman, "The Framing of Decisions and the Psychology of Choice," Science 211 (1981):453-58.
paid with cash or on credit : Thaler, “Toward a Positive Theory of Consumer Choice.”
用现金或赊账支付 :塞勒,"迈向消费者选择的积极理论"。
10% mortality is frightening : Barbara McNeil, Stephen G. Pauker, Harold C. Sox Jr., and Amos Tversky, “On the Elicitation of Preferences for Alternative Therapies,” New England Journal of Medicine 306 (1982): 1259–62.
10%的死亡率是令人恐惧的 。Barbara McNeil, Stephen G. Pauker, Harold C. Sox Jr., and Amos Tversky, "On the Elicitation of Preferences for Alternative Therapies, " New England Journal of Medicine 306 (1982):1259-62.
“Asian disease problem” : Some people have commented that the “Asian” label is unnecessary and pejorative. We probably would not use it today, but the example was written in the 1970s, when sensitivity to group labels was less developed than it is today. The word was added to make the example more concrete by reminding respondents of the Asian flu epidemic of 1957.
"亚洲疾病问题"。 有些人评论说,"亚洲人 "的标签是不必要的,是贬义的。我们今天可能不会使用它,但是这个例子是在20世纪70年代写的,当时对群体标签的敏感性还没有今天这么发达。添加这个词是为了使这个例子更加具体,让受访者想起1957年的亚洲流感疫情, 。
Choice and Consequence : Thomas Schelling, Choice and Consequence (Cambridge, MA: Harvard University Press, 1985).
选择与后果 。Thomas Schelling, Choice and Consequence (Cambridge, MA: Harvard University Press, 1985).
misleading frame : Richard P. Larrick and Jack B. Soll, “The MPG Illusion,” Science 320 (2008): 1593–94.
误导性框架 。Richard P. Larrick和Jack B. Soll,"MPG幻觉", 科学 320(2008)。1593-94.
rate of organ donation in European countries : Eric J. Johnson and Daniel Goldstein, “Do Defaults Save Lives?” Science 302 (2003): 1338–39.
欧洲国家的器官捐赠率 。埃里克-J-约翰逊和丹尼尔-戈尔茨坦,"违约金是否能拯救生命?" 科学 》302(2003)。1338-39.
35: TWO SELVES
35: 两个自我
“wantability” : Irving Fisher, “Is ‘Utility’ the Most Suitable Term for the Concept It Is Used to Denote?” American Economic Review 8 (1918): 335.
"可望性"。 欧文-费雪,"'效用'是它用来表示的概念的最合适的术语吗?" 美国经济评论》 8(1918)。335.
at any moment : Francis Edgeworth, Mathematical Psychics (New York: Kelley, 1881).
在任何时候 。弗朗西斯-埃奇沃斯,《 数学通灵学》 (纽约:凯利,1881)。
under which his theory holds : Daniel Kahneman, Peter P. Wakker, and Rakesh Sarin, “Back to Bentham? Explorations of Experienced Utility,” Quarterly Journal of Economics 112 (1997): 375–405. Daniel Kahneman, “Experienced Utility and Objective Happiness: A Moment-Based Approach” and “Evaluation by Moments: Past and Future,” in Kahneman and Tversky, Choices, Values, and Frames , 673–92, 693–708.
在这种情况下,他的理论是成立的 。Daniel Kahneman, Peter P. Wakker, and Rakesh Sarin, "Back to Bentham?经验效用的探索》,《 经济学季刊》 112 (1997):375-405.丹尼尔-卡尼曼,"经验效用和客观幸福感。基于时刻的方法 "和 "通过时刻进行评价。过去和未来,"卡尼曼和特维斯基, 选择、价值和框架 ,673-92,693-708。
a physician and researcher : Donald A. Redelmeier and Daniel Kahneman, “Patients’ Memories of Painful Medical Treatments: Real-time and Retrospective Evaluations of Two Minimally Invasive Procedures,” Pain 66 (1996): 3–8.
一个医生和研究人员 。Donald A. Redelmeier和Daniel Kahneman,"病人对痛苦医疗的记忆。两种微创手术的实时和回顾性评价 ," Pain 66 (1996):3-8.
free to choose : Daniel Kahneman, Barbara L. Frederickson, Charles A. Schreiber, and Donald A. Redelmeier, “When More Pain Is Preferred to Less: Adding a Better End,” Psychological Science 4 (1993): 401–405.
自由选择 。Daniel Kahneman, Barbara L. Frederickson, Charles A. Schreiber, and Donald A. Redelmeier, " When More Pain Is Preferred to Less:增加一个更好的结局," 《心理科学》 4(1993)。401-405.
duration of the shock : Orval H. Mowrer and L. N. Solomon, “Contiguity vs. Drive-Reduction in Conditioned Fear: The Proximity and Abruptness of Drive Reduction,” American Journal of Psychology 67 (1954): 15–25.
冲击的持续时间 。Orval H. Mowrer和L. N. Solomon, "Contiguity vs. Drive-Reduction in Conditioned Fear: The Proximity and Abruptness of Drive Reduction," American Journal of Psychology 67 (1954):15-25.
burst of stimulation : Peter Shizgal, “On the Neural Computation of Utility: Implications from Studies of Brain Stimulation Reward,” in Well-Being: The Foundations of Hedonic Psychology , ed. Daniel Kahneman, Edward Diener, and Norbert Schwarz (New York: Russell Sage Foundation, 1999), 500–24.
刺激的迸发 。Peter Shizgal,"关于效用的神经计算。暗示 从脑刺激奖励的研究,"在 幸福。 幸福: 享乐心理学的基础 》,丹尼尔-卡尼曼、爱德华-迪纳编辑。Daniel Kahneman, Edward Diener, and Norbert Schwarz (New York: Russell Sage Foundation, 1999), 500-24.
36: LIFE AS A STORY
36:生活是一个故事
had a lover : Paul Rozin and Jennifer Stellar, “Posthumous Events Affect Rated Quality and Happiness of Lives,” Judgment and Decision Making 4 (2009): 273–79.
有一个情人 。Paul Rozin和Jennifer Stellar,"遗体事件影响生命的额定质量和幸福," 判断和决策 4(2009)。273-79.
entire lives as well as brief episodes : Ed Diener, Derrick Wirtz, and Shigehiro Oishi, “End Effects of Rated Life Quality: The James Dean Effect,” Psychological Science 12 (2001): 124–28. The same series of experiments also tested for the peak-end rule in an unhappy life and found similar results: Jen was not judged twice as unhappy if she lived miserably for 60 years rather than 30, but she was regarded as considerably happier if 5 mildly miserable years were added just before her death.
整个生命以及短暂的插曲 。Ed Diener, Derrick Wirtz, and Shigehiro Oishi, "End Effects of Rated Life Quality:詹姆斯-迪恩效应",《 心理科学》 12(2001)。124-28.同一系列的实验还测试了不快乐的生活中的峰端规则 ,发现了类似的结果。如果珍悲惨地活了60年而不是30年,她不会被判定为加倍不快乐,但如果在她死前增加5个轻微悲惨的年份,她会被视为相当快乐。
37: EXPERIENCED WELL-BEING
37:经历过的幸福感
life as a whole these days : Another question that has been used frequently is, “Taken all together, how would you say things are these days? Would you say that you are very happy, pretty happy, or not too happy?” This question is included in the General Social Survey in the United States, and its correlations with other variables suggest a mix of satisfaction and experienced happiness. A pure measure of life evaluation used in the Gallup surveys is the Cantril Self-Anchoring Striving Scale, in which the respondent rates his or her current life on a ladder scale in which 0 is “the worst possible life for you” and 10 is “the best possible life for you.” The language suggests that people should anchor on what they consider possible for them, but the evidence shows that people all over the world have a common standard for what a good life is, which accounts for the extraordinarily high correlation ( r = .84) between the GDP of countries and the average ladder score of their citizens. Angus Deaton, “Income, Health, and Well-Being Around the World: Evidence from the Gallup World Poll,” Journal of Economic Perspectives 22 (2008): 53–72.
这些天来,生活作为一个整体 。另一个经常使用的问题是:"综合来看,你认为这些天的情况如何 ?你说你是非常高兴,相当高兴,还是不太高兴?"这个问题包括在美国的一般社会调查中,它与其他变量的相关性表明了满意度和体验到的幸福的混合。盖洛普调查中使用的一个纯粹的生活评价措施是坎特里尔自我调整奋斗量表,受访者在梯形量表上对他的 或她目前的生活进行评价,其中0是 "对你来说可能是最差的生活",10是 "对你来说可能是最好的生活。"这种语言表明,人们应该以他们认为可能的生活为基础,但证据表明,世界各地的人们对什么是好的生活有一个共同的标准,这说明各国的 GDP和其公民的平均阶梯得分之间有极高的相关性 (r = .84)。安格斯-迪顿,"世界范围内的收入、健康和幸福感。来自盖洛普世界民意调查的证据", 《经济展望杂志》 22(2008)。53-72.
“a dream team” : The economist was Alan Krueger of Princeton, noted for his innovative analyses of unusual data. The psychologists were David Schkade, who had methodological expertise; Arthur Stone, an expert on health psychology, experience sampling, and ecological momentary assessment; Norbert Schwarz, a social psychologist who was also an expert on survey method and had contributed experimental critiques of well-being research, including the experiment on which a dime left on a copying machine influenced subsequent reports of life satisfaction.
"一个梦之队"。 经济学家是普林斯顿大学的艾伦-克鲁格,他因对不寻常数据的创新分析而闻名。心理学家是David Schkade,他拥有方法论方面的专业知识; Arthur Stone,健康心理学、经验抽样和生态瞬间评估方面的专家;Norbert Schwarz,社会心理学家,也是调查方法方面的专家,对幸福感研究做出了实验性批评,包括在实验中,留在复印机上的一角钱影响了随后的生活满意度报告。
intensity of various feelings : In some applications, the individual also provides physiological information, such as continuous recordings of heart rate, occasional records of blood pressure, or samples of saliva for chemical analysis. The method is called Ecological Momentary Assessment: Arthur A. Stone, Saul S. Shiffman, and Marten W. DeVries, “Ecological Momentary Assessment Well-Being: The Foundations of Hedonic Psychology,” in Kahneman, Diener, and Schwarz, Well-Being , 26–39.
各种感觉的强度 。在一些应用中,个人还提供生理信息,如连续记录心率,偶尔记录血压,或提供唾液样本进行化学分析。这种方法被称为生态瞬间评估。Arthur A. Stone, Saul S. Shiffman, and Marten W. DeVries, "Ecological Momentary Assessment Well-Being:Hedonic心理学的基础," in Kahneman, Diener, and Schwarz, Well-Being , 26-39.
spend their time : Daniel Kahneman et al., “A Survey Method for Characterizing Daily Life Experience: The Day Reconstruction Method,” Science 306 (2004): 1776–80. Daniel Kahneman and Alan B. Krueger, “Developments in the Measurement of Subjective Well-Being,” Journal of Economic Perspectives 20 (2006): 3–24.
花费他们的时间 。丹尼尔-卡尼曼等人,"描述日常生活经验的调查方法。日子重建法", 科学 306(2004)。1776-80.丹尼尔-卡尼曼和艾伦-B-克鲁格,"主观幸福感测量的发展",《 经济展望》杂志 20(2006)。3-24.
physiological indications of emotion : Previous research had documented that people are able to “relive” feelings they had in a past situation when the situation is retrieved in sufficiently vivid detail. Michael D. Robinson and Gerald L. Clore, “Belief and Feeling: Evidence for an Accessibility Model of Emotional Self-Report,” Psychological Bulletin 128 (2002): 934–60.
情感的生理迹象 。以前的 研究记录显示,当人们以足够生动的细节回顾过去的情况时,能够 "重温 "他们在过去情况下的感受。迈克尔-D-罗宾逊和杰拉尔德-L-克罗尔,"信仰和感觉。情感自我报告的可及性模型的证据",《 心理学通报》 128(2002)。934-60.
state the U-index : Alan B. Krueger, ed., Measuring the Subjective Well-Being of Nations: National Accounts of Time Use and Well-Being (Chicago: University of Chicago Press, 2009).
状态的U指数 。Alan B. Krueger, ed., Measuring the Subjective Well-Being of Nations:时间使用和福祉的国民账户 (芝加哥:芝加哥大学出版社,2009)。
distribution of emotional pain : Ed Diener, “Most People Are Happy,” Psychological Science 7 (1996): 181–85.
情感痛苦的分布 。埃德-迪纳,"大多数人是快乐的",《 心理科学》 7(1996)。181-85.
Gallup World Poll : For a number of years I have been one of several Senior Scientists associated with the efforts of the Gallup Organization in the domain of well-being.
盖洛普世界民意调查 :若干年来,我一直是与盖洛普组织在福祉领域的努力有关的几位高级科学家之一。
more than 450,000 responses : Daniel Kahneman and Angus Deaton, “High Income Improves Evaluation of Life but Not Emotional Well-Being,” Proceedings of the National Academy of Sciences 107 (2010): 16489–93.
超过450,000份答复 。丹尼尔-卡尼曼和安格斯-迪顿,"高收入能提高对生活的评价,但不能提高情感的幸福感", 《美国国家科学院院刊》 107(2010)。16489-93.
worse for the very poor : Dylan M. Smith, Kenneth M. Langa, Mohammed U. Kabeto, and Peter Ubel, “Health, Wealth, and Happiness: Financial Resources Buffer Subjective Well-Being After the Onset of a Disability,” Psychological Science 16 (2005): 663–66.
对于非常贫穷的人来说,情况更糟 。Dylan M. Smith, Kenneth M. Langa, Mohammed U. Kabeto, and Peter Ubel, "Health, Wealth, and Happiness:财务资源在残疾发生后缓冲了主观幸福感," 心理学 Science 16 (2005):663-66.
$75,000 in high-cost areas : In a TED talk I presented in February 2010 I mentioned a preliminary estimate of $60,000, which was later corrected.
在高成本地区为75,000美元 。在我于2010年2月发表的TED演讲中,我提到初步估计为6万美元,后来被纠正了。
eat a bar of chocolate! : Jordi Quoidbach, Elizabeth W. Dunn, K. V. Petrides, and Moïra Mikolajczak, “Money Giveth, Money Taketh Away: The Dual Effect of Wealth on Happiness,” Psychological Science 21 (2010): 759–63.
吃块巧克力吧! : Jordi Quoidbach, Elizabeth W. Dunn, K. V. Petrides, and Moïra Mikolajczak, "金钱给予,金钱带走。财富对幸福的双重影响》,《 心理科学》 21(2010)。759-63.
38: THINKING ABOUT LIFE
38:对生活的思考
German Socio-Economic Panel : Andrew E. Clark, Ed Diener, and Yannis Georgellis, “Lags and Leads in Life Satisfaction: A Test of the Baseline Hypothesis.” Paper presented at the German Socio-Economic Panel Conference, Berlin, Germany, 2001.
德国社会经济小组 。Andrew E. Clark, Ed Diener, and Yannis Georgellis, "生活满意度的滞后和领先。对基线假说的测试"。提交给德国社会经济小组会议的论文,德国柏林,2001年。
affective forecasting : Daniel T. Gilbert and Timothy D. Wilson, “Why the Brain Talks to Itself: Sources of Error in Emotional Prediction,” Philosophical Transactions of the Royal Society B 364 (2009): 1335–41.
情感预测 。Daniel T. Gilbert和Timothy D. Wilson,"为什么大脑会自言自语。情感预测中的错误来源," 哲学 皇家学会B的交易 364(2009)。1335-41.
only significant fact in their life : Strack, Martin, and Schwarz, “Priming and Communication.”
他们生活中唯一重要的事实 。施特拉克、马丁和施瓦茨,"激发和交流"。
questionnaire on life satisfaction : The original study was reported by Norbert Schwarz in his doctoral thesis (in German) “Mood as Information: On the Impact of Moods on the Evaluation of One’s Life” (Heidelberg: Springer Verlag, 1987). It has been described in many places, notably Norbert Schwarz and Fritz Strack, “Reports of Subjective Well-Being: Judgmental Processes and Their Methodological Implications,” in Kahneman, Diener, and Schwarz, Well-Being , 61–84.
关于生活满意度的调查问卷 。最初的研究是由诺伯特-施瓦茨在他的博士论文(德语)《作为信息的情绪。论情绪对一个人的生活评价的影响"(海德堡:施普林格出版社,1987年)。 ,在很多地方都有描述,特别是诺伯特-施瓦茨和弗里茨-施特拉克,"主观幸福感的报告。判断过程及其方法论意义",载于Kahneman, Diener, and Schwarz, Well-Being , 61-84.
goals that young people set : The study was described in William G. Bowen and Derek Curtis Bok, The Shape of the River : Long-Term Consequences of Considering Race in College and University Admissions (Princeton: Princeton University Press, 1998). Some of Bowen and Bok’s findings were reported by Carol Nickerson, Norbert Schwarz, and Ed Diener, “Financial Aspirations, Financial Success, and Overall Life Satisfaction: Who? and How?” Journal of Happiness Studies 8 (2007): 467–515.
年轻人设定的目标 。这项研究在William G. Bowen和Derek Curtis Bok, The Shape of the River :在 大学 和大学录取中考虑种族的长期后果 (普林斯顿:普林斯顿大学出版社,1998)。Carol Nickerson,Norbert Schwarz和Ed Diener报告了Bowen和Bok的一些发现,"财务愿望,财务成功,和总体生活满意度。谁?以及如何?" 幸福研究杂志 》8(2007)。467-515.
“being very well-off financially” : Alexander Astin, M. R. King, and G. T. Richardson, “The American Freshman: National Norms for Fall 1976,” Cooperative Institutional Research Program of the American Council on Education and the University of California at Los Angeles, Graduate School of Education, Laboratory for Research in Higher Education, 1976.
"在经济上非常富裕"。 Alexander Astin, M. R. King, and G. T. Richardson, "The American Freshman: National Norms for Fall 1976," Cooperative Institutional Research Program of American Council on Education and University of California at Los Angeles, Graduate School of Education, Laboratory for Research in Higher Education, 1976.
money was not important : These results were presented in a talk at the American Economic Association annual meeting in 2004. Daniel Kahneman, “Puzzles of Well-Being,” paper presented at the meeting.
金钱并不重要 :这些结果在2004年美国经济学会年会上的演讲中被提出。丹尼尔 卡尼曼,"福祉之谜",在会议上发表的论文。
happiness of Californians : The question of how well people today can forecast the feelings of their descendants a hundred years from now is clearly relevant to the policy response to climate change, but it can be studied only indirectly, which is what we proposed to do.
加利福尼亚人的幸福 。今天的人们能在多大程度上预测他们一百年后的后代的感受,这个问题显然与应对气候变化的政策有关,但它只能被间接地研究,而这正是我们提议做的。
aspects of their lives : In posing the question, I was guilty of a confusion that I now try to avoid: Happiness and life satisfaction are not synonymous. Life satisfaction refers to your thoughts and feelings when you think about your life, which happens occasionally—including in surveys of well-being. Happiness describes the feelings people have as they live their normal life.
他们生活的各个方面 。在提出这个问题时,我犯了一个错误, ,现在我试图避免这种混淆。幸福感和生活满意度并不是同义词。生活满意度指的是当你思考你的生活时的想法和感受,这偶尔会发生--包括在幸福感的调查中。幸福感描述的是人们在正常生活中的感受。
I had won the family argument : However, my wife has never conceded. She claims that only residents of Northern California are happier.
我赢得了这场家庭争论 。然而,我的妻子一直没有认输。她声称 ,只有北加州的居民才更幸福。
students in California and in the Midwest : Asian students generally reported lower satisfaction with their lives, and Asian students made up a much larger proportion of the samples in California than in the Midwest. Allowing for this difference, life satisfaction in the two regions was identical.
加州和中西部的学生 。亚裔学生对自己生活的满意度普遍较低,而且亚裔学生在加州的样本中所占的比例比中西部大得多。考虑到这一差异,两个地区的生活满意度是相同的。
How much pleasure do you get from your car? : Jing Xu and Norbert Schwarz have found that the quality of the car (as measured by Blue Book value) predicts the owners’ answer to a general question about their enjoyment of the car, and also predicts people’s pleasure during joyrides. But the quality of the car has no effect on people’s mood during normal commutes. Norbert Schwarz, Daniel Kahneman, and Jing Xu, “Global and Episodic Reports of Hedonic Experience,” in R. Belli, D. Alwin, and F. Stafford (eds.), Using Calendar and Diary Methods in Life Events Research (Newbury Park, CA: Sage), pp. 157–74.
:徐静和Norbert Schwarz发现,汽车的质量(以蓝皮书的价值衡量)可以预测车主对一个关于他们对汽车的享受的一般问题的回答,也可以预测人们在兜风时的快乐 。 但汽车的质量对人们在正常通勤时的心情没有影响。诺伯特-施瓦茨、丹尼尔-卡尼曼和徐静,"Hedonic体验的全局和偶发报告 ",见R.贝利、D.阿尔温和F.斯塔福德(编),《 在生活事件研究中使用日历和日记方法 》(Newbury Park, CA: Sage),第157-74页。
paraplegics spend in a bad mood? : The study is described in more detail in Kahneman, “Evaluation by Moments.”
瘫痪者花在坏心情上的时间? :这项研究在卡尼曼的 "瞬间评价 "中有更详细的描述。
think about their situation : Camille Wortman and Roxane C. Silver, “Coping with Irrevocable Loss, Cataclysms, Crises, and Catastrophes: Psychology in Action,” American Psychological Association, Master Lecture Series 6 (1987): 189–235.
思考他们的处境 。Camille Wortman和Roxane C. Silver,"应对不可挽回的损失、灾难、危机、 和大灾难。行动中的心理学",美国心理学会,大师讲座系列6(1987)。189-235.
studies of colostomy patients : Dylan Smith et al., “Misremembering Colostomies? Former Patients Give Lower Utility Ratings than Do Current Patients,” Health Psychology 25 (2006): 688–95. George Loewenstein and Peter A. Ubel, “Hedonic Adaptation and the Role of Decision and Experience Utility in Public Policy,” Journal of Public Economics 92 (2008): 1795–1810.
对结肠造口术患者的研究 。Dylan Smith等人,"误记结肠造口?以前的病人比现在的病人给出更低的效用评价", 健康心理学 25(2006)。688-95.George Loewenstein和Peter A. Ubel, "Hedonic适应和公共政策中决策和经验的作用 Utility," Journal of Public Economics 92 (2008):1795-1810.
the word miswanting : Daniel Gilbert and Timothy D. Wilson, “Miswanting: Some Problems in Affective Forecasting,” in Feeling and Thinking: The Role of Affect in Social Cognition , ed. Joseph P. Forgas (New York: Cambridge University Press, 2000), 178–97.
the word miswanting :Daniel Gilbert和Timothy D. Wilson,"Miswanting。情感预测中的一些问题》,载于 《感觉与思考》。情感在社会认知中的作用 》,Joseph P. Forgas编辑。Joseph P. Forgas (New York: Cambridge University Press, 2000), 178-97.
CONCLUSIONS
结论
too important to be ignored : Paul Dolan and Daniel Kahneman, “Interpretations of Utility and Their Implications for the Valuation of Health,” Economic Journal 118 (2008): 215–234. Loewenstein and Ubel, “Hedonic Adaptation and the Role of Decision and Experience Utility in Public Policy.”
太重要了,不能被忽视 。Paul Dolan 和 Daniel Kahneman, "对效用的解释及其对健康估值的影响",《 经济杂志》 118 (2008)。215-234.Loewenstein和Ubel,"Hedonic适应以及公共政策中的决策和经验效用的作用"。
guide government policies : Progress has been especially rapid in the UK, where the use of measures of well-being is now official government policy. These advances were due in good part to the influence of Lord Richard Layard’s book Happiness: Lessons from a New Science , first published in 2005. Layard is among the prominent economists and social scientists who have been drawn into the study of well-being and its implications. Other important sources are: Derek Bok, The Politics of Happiness: What Government Can Learn from the New Research on Well-Being (Princeton: Princeton University Press, 2010). Ed Diener, Richard Lucus, Ulrich Schmimmack, and John F. Helliwell, Well-Being for Public Policy (New York: Oxford University Press, 2009). Alan B. Krueger, ed., Measuring the Subjective Well-Being of Nations: National Account of Time Use and Well-Being (Chicago: University of Chicago Press, 2009). Joseph E. Stiglitz, Amartya Sen, and Jean-Paul Fitoussi, Report of the Commission on the Measurement of Economic Performance and Social Progress . Paul Dolan, Richard Layard, and Robert Metcalfe, Measuring Subjective Well-being for Public Policy: Recommendations on Measures (London: Office for National Statistics, 2011).
指导政府政策 。在英国, ,进展尤其迅速,在那里,使用幸福感的衡量标准现在是政府的官方政策。这些进展 ,很大程度上是由于理查德-莱亚德勋爵的《 幸福 》一书的影响 。来自新科学的教训 》一书的影响,该书于2005年首次出版。莱亚德是被吸引到幸福感及其影响研究中的著名经济学家和社会科学家之一。其他重要的资料来源有。德里克-博克,《 幸福的政治》。What Government Can Learn from the New Research on Well-Being (Princeton: Princeton University Press, 2010).Ed Diener, Richard Lucus, Ulrich Schmimmack, and John F. Helliwell, Well-Being for Public Policy (New York: Oxford University Press, 2009).Alan B. Krueger, ed., Measuring the Subjective Well-Being of Nations:时间使用和福祉的国民账户 》(芝加哥:芝加哥大学出版社,2009年)。Joseph E. Stiglitz, Amartya Sen, and Jean-Paul Fitoussi, Report of the Commission on the Measurement of Economic Performance and Social Progress .Paul Dolan, Richard Layard, and Robert Metcalfe, 《 为公共政策衡量主观幸福感》。关于措施的建议 (伦敦:国家统计局,2011年)。
Irrational is a strong word : The view of the mind that Dan Ariely has presented in Predictably Irrational: The Hidden Forces That Shape Our Decisions (New York: Harper, 2008) is not much different from mine, but we differ in our use of the term.
非理性 是一个强烈的词 :丹-阿里利(Dan Ariely)在《 可预测的非理性》(Predictably Irrational )一书中提出的心灵观点 。塑造我们 决定的隐藏力量 》(纽约:哈珀,2008年)中提出的心智观点与我的没有太大区别,但我们在使用该术语时有所不同。
accept future addiction : Gary S. Becker and Kevin M. Murphy, “A Theory of Rational Addiction,” Journal of Political Economics 96 (1988): 675–700. Nudge: Richard H. Thaler and Cass R. Sunstein, Nudge: Improving Decisions About Health, Wealth, and Happiness (New Haven: Yale University Press, 2008).
接受未来的成瘾性 。加里-S-贝克尔和凯文-M-墨菲,"理性成瘾的理论", 《政治经济学杂志》 96(1988)。675-700.暗示。Richard H. Thaler和Cass R. Sunstein, Nudge : 改善关于健康、财富和幸福的决定 》(纽黑文:耶鲁大学出版社, 2008)。
can institute and enforce : Atul Gawande, The Checklist Manifesto: How to Get Things Right (New York: Holt, 2009). Daniel Kahneman, Dan Lovallo, and Oliver Sibony, “The Big Idea: Before You Make That Big Decision …” Harvard Business Review 89 (2011): 50–60.
可以制定和执行 。Atul Gawande, The Checklist Manifesto:如何把事情做对 》(纽约:霍尔特,2009年)。Daniel Kahneman, Dan Lovallo, and Oliver Sibony, "The Big Idea: Before You Make That Big Decision ..." 哈佛商业评论》 89(2011)。50-60.
distinctive vocabulary : Chip Heath, Richard P. Larrick, and Joshua Klayman, “Cognitive Repairs: How Organizational Practices Can Compensate for Individual Shortcomings,” Research in Organizational Behavior 20 (1998): 1–37.
独特的词汇 。Chip Heath, Richard P. Larrick, and Joshua Klayman, "Cognitive Repairs: How Organizational Practices Can Compensate for Individual Shortcomings," Research in Organizational Behavior 20 (1998):1-37.
Acknowledgments
鸣谢
I am fortunate to have many friends and no shame about asking for help. Every one of my friends has been approached, some of them many times, with requests for information or editorial suggestions. I apologize for not listing them all. A few individuals played a major role in making the book happen. My thanks go first to Jason Zweig, who urged me into the project and patiently tried to work with me until it became clear to both of us that I am impossible to work with. Throughout, he has been generous with his editorial advice and enviable erudition, and sentences that he suggested dot the book. Roger Lewin turned transcripts of a set of lectures into chapter drafts. Mary Himmelstein provided valuable assistance throughout. John Brockman began as an agent and became a trusted friend. Ran Hassin provided advice and encouragement when it was most needed. In the final stages of a long journey I had the indispensable help of Eric Chinski, my editor at Farrar, Straus and Giroux. He knew the book better than I did and the work became an enjoyable collaboration—I had not imagined that an editor could do as much as Eric did. My daughter, Lenore Shoham, rallied round to help me through the hectic final months, providing wisdom, a sharp critical eye, and many of the sentences in the “Speaking of” sections. My wife, Anne Treisman, went through a lot and did a lot—I would have given up long ago without her steady support, wisdom, and endless patience.
我很幸运,有很多朋友,而且对寻求帮助不感到羞耻。我的每一位朋友都曾被要求提供信息或编辑建议,其中有些人还被要求多次。我很抱歉没有把他们全部列出。有几个人在促成这本书的过程中发挥了重要作用。我首先要感谢杰森-茨威格,他敦促我加入这个项目,并耐心地 ,试图与我合作,直到我们俩都清楚,我是不可能合作的。自始至终,他都慷慨地提供了他的编辑建议和令人羡慕的博学,以及他建议的句子来点缀这本书。罗杰-莱文把一套讲座的记录稿变成了章节草稿。玛丽-希梅尔斯坦自始至终提供了宝贵的帮助。约翰-布罗克曼开始是作为代理人,后来成为一个值得信赖的 的朋友。兰-哈辛在最需要的时候提供了建议和鼓励。在漫长旅程的最后阶段,我得到了法拉尔、斯特劳斯和吉鲁公司的编辑埃里克-钦斯基不可或缺的帮助。他比我更了解这本书,这项工作成为一种愉快的合作,我没有想到一个编辑能像埃里克那样做得那么多。我的女儿莱诺尔-肖姆(Lenore Shoham)团结一致, ,帮助我度过了紧张的最后几个月,提供了智慧、敏锐的批评眼光,以及 "说到 "部分的许多句子。我的妻子安妮-特里斯曼经历了很多,也做了很多,如果没有她稳定的支持、智慧和无尽的耐心,我早就放弃了。
THE BEGINNING
开始的时候
Let the conversation begin …
让对话开始吧......。
Follow the Penguin Twitter.com@penguinUKbooks
关注企鹅出版社 Twitter.com@penguinUKbooks
Keep up-to-date with all our stories YouTube.com/penguinbooks
保持最新的 ,了解我们所有的故事 YouTube.com/penguinbooks
Pin ‘Penguin Books’ to your Pinterest
将 "企鹅图书 "钉在你的 Pinterest 上
Like ‘Penguin Books’ on Facebook.com/penguinbooks
在 Facebook.com/penguinbooks 上喜欢 "企鹅图书"。
Listen to Penguin at SoundCloud.com/penguin-books
请在 SoundCloud.com/penguin-books 上收听企鹅的 声音 。
Find out more about the author and discover more stories like this at Penguin.co.uk
了解更多关于作者的信息和 在 Penguin.co.uk 发现更多这样的故事
PENGUIN BOOKS
鹏程书院
UK | USA | Canada | Ireland | Australia
英国|美国|加拿大|爱尔兰|澳大利亚
India | New Zealand | South Africa
印度 | 新西兰 | 南非
Penguin Books is part of the Penguin Random House group of companies whose addresses can be found at global.penguinrandomhouse.com .
企鹅图书公司是企鹅兰登书屋集团的一部分,其地址可以在 global.penguinrandomhouse.com 上找到。
First published in the United States of America by Farrar, Straus and Giroux 2011
在美国由Farrar, Straus and Giroux 2011年首次出版。
First published in Great Britain by Allen Lane 2011
2011年由Allen Lane在英国首次出版
Published in Penguin Books 2012
发表于企鹅出版社2012年
Copyright © Daniel Kahneman, 2011
Copyright © Daniel Kahneman, 2011
Grateful acknowledgment is made for permission to reprint the following previously published material: “ Judgment Under Uncertainty: Heuristics and Biases ” from Science, New Series , Vol. 185, No. 4157, copyright © 1974 by Amos Tversky and Daniel Kahneman. Reprinted by permission of Science . “ Choices, Values, and Frames ” from The American Psychologist , copyright © 1983 by Daniel Kahneman and Amos Tversky. Reprinted by permission of the American Psychological Association .
在此对允许转载以下以前发表的材料表示感谢。" 不确定情况下的判断:启发式和偏见 "来自 《科学》,新系列 ,第185卷, 第4157号,版权归Amos Tversky和Daniel Kahneman所有。经 《科学》 杂志许可转载。" 选择、价值和框架 "来自 《美国心理学家》 ,版权©1983年,作者丹尼尔-卡尼曼和阿莫斯-特维斯基。经 美国心理学会 许可转载。
Grateful acknowledgment is made for permission to reprint the following images: Image here courtesy of Paul Ekman Group, LLC. Image here from “Cues of Being Watched Enhance Cooperation in a Real-World Setting” by Melissa Bateson, Daniel Nettle, and Gilbert Roberts, Biology Letters (2006); reprinted by permission of Biology Letters . Image here from Mind Sights by Roger N. Shepard (New York: W.H. Freeman and Company, 1990); reprinted by permission of Henry Holt and Company. Image here from “Human Amygdala Responsivity to Masked Fearful Eye Whites” by Paul J. Whalen et al., Science 306 (2004). Reprinted by permission of Science .
在此对允许转载以下图片表示感谢。 这里的 图片由Paul Ekman Group, LLC提供。 这里的 图片来自 "被观察的线索在现实世界的环境中增强合作",作者是Melissa Bateson, Daniel Nettle, 和Gilbert Roberts, Biology Letters (2006); Biology Letters 允许转载。 此处 的图片来自Roger N. Shepard的 " Mind Sights "(纽约:W.H. Freeman and Company,1990);经Henry Holt and Company许可转载。 这里的 图片来自于 "人类杏仁核反应性 ,以掩盖恐惧的眼睛白",作者Paul J. Whalen等人, 《科学》 306(2004)。经 《科学》 杂志许可转载。
All rights reserved
保留所有权利
ISBN: 978-0-141-91892-1
书名:978-0-141-91892-1
5: COGNITIVE EASE
5:认知上的轻松
fn1 5, 47.
fn1 5,47。
APPENDIX A: JUDGMENT UNDER UNCERTAINTY: HEURISTICS AND BIASES
附录A:不确定性下的判断:启发式方法和偏见
fn1 This article originally appeared in Science , vol. 185, 1974. The research was supported by the Advanced Research Projects Agency of the Department of Defense and was monitored by the Office of Naval Research under contract N00014-73-C-0438 to the Oregon Research Institute, Eugene. Additional support for this research was provided by the Research and Development Authority of the Hebrew University, Jerusalem, Israel.
fn1 这篇文章最初出现在 《科学》 ,第185卷,1974年。该研究得到了国防部高级研究计划局的支持,并由海军研究办公室根据合同N00014-73-C-0438对尤金的俄勒冈研究所进行监督。本研究的额外支持 ,由以色列耶路撒冷希伯来大学的研究和发展局提供。
APPENDIX B: CHOICES, VALUES, AND FRAMES
附录B:选择、价值和框架
fn1 This article was originally presented as a Distinguished Scientific Contributions Award address at the American Psychological Association meeting, August 1983. This work was supported by grant NR 197-058 from the U.S. Office of Naval Research. Originally published in American Psychologist , vol. 34, 1984.
fn1 本文最初是在1983年8月美国心理学会会议上作为杰出科学贡献奖发言。这项工作得到了美国海军办公室NR 197-058号拨款的支持 研究。原文发表于《 美国心理学家》 ,第34卷,1984年。

机器翻译 + 双语对照版极速定制
即刻上传外文书,定制读本
