Last week Rich Sutton and Andrew Barto were awarded the Turing Award for their pioneering contributions to the field of Reinforcement Learning. I covered their award in a previous post. Here I want to discuss an essay that Rich Sutton wrote pretty recently. In 2019, Rich Sutton wrote an essay titled “The Bitter Lesson”. The basic thesis in his essay is that in the field of Artificial Intelligence, techniques that take advantage of raw computing power will always work better than techniques that aim to capture the processes of the human mind in a computer.
Sutton uses examples of computers playing Chess, Go, and speech and visual recognition applications to drive home his point. In all these examples, early AI efforts tried to teach computers how humans solved these problems. For example, in Chess programs, early efforts tried to find techniques for how human grand masters played chess. Researchers in the field were dismissive of efforts that used brute force to try and find the best chess moves. Ultimately with the growth in computing power, the brute force techniques won. There was a similar pattern in how computer beat humans in Go. As Sutton puts it: “Search and learning are the two most important classes of techniques for utilizing massive amounts of computation in AI research.“
In the field of speech recognition/generation, early efforts aimed at decomposing human speech into phonoemes, etc. Those efforts had limited success and statistical methods outperformed them. Finally, deep learning techniques ended up producing the best results. Similarly in the area of computer vision, early efforts were based around trying to encode how the human mind probably made sense of the world around it: edge detection, texture detection etc. These techniques also plateaued out and finally deep learning produced the breakthroughs.
Sutton’s overall message is that the human mind is extremely complex and we should not try to engineer systems based on “how we think we think”. There is a strong appeal in trying to figure out how the mind human mind works. However all the history of AI points to the fact that statistical and raw computation power outperforms our notions of how the mind works. Sutton’s paper reminds me of Peter Norvig et. al’s paper on “The unreasonable effectiveness of data“, where they also make similar claims about the success of statistical methods over traditional AI techniques.
