Perceptually Grounded Spatial Semantics
David Golland, Daniel Klein and Trevor Darrell
This project explores how to define the meaning of prepositions using visual data. One potential application is to be able to command a robot to arrange objects in a room. For example, in order for a robot to be able to follow the command "Put the cup there, on the front of the table", the robot must identify the target location of the cup. The robot can only identify this location if it understands the meanings of each of the components. The project specifically focuses on defining the meanings of prepositions because they are both perceptible in images and can be composed together to form higher level meanings.
Figure 1: Example of compositional spatial semantics