We Want Robots to See and Understand the World
Antonio Torralba, member of MIT CSAIL, opens the dialogue by describing the research he performs in the field of computer vision and related artificial intelligence (AI). He also compares the conceptual differences and the context of the early days of artificial intelligence—where hardly any image recording devices existed—with the present situation, in which an enormous amount of data is available. Next, through the use of examples, he talks about the huge complexity faced by research in computer vision to get computers and machines to understand the meanings of what they “see” in the scenes, and the objects they contain, by means of digital cameras. As he explains afterward, the challenge of this complexity for computer vision processing is particularly noticeable in settings involving robots, or driverless cars, where it makes no sense to develop vision systems that can see if they cannot learn. Later he argues why today’s computer systems have to learn “to see” because if there is no learning process, for example machine learning, they will never be able to make autonomous decisions.