May 3, 2024
Probe3D: Study examines how well AI models understand the third dimension
Posted by Dan Kummer in category: robotics/AI
A new study examines whether and how well multimodal AI models understand the 3D structure of scenes and objects.
Researchers from the University of Michigan and Google Research investigated the 3D awareness of multimodal models. The goal was to understand how well the representations learned by these models capture the 3D structure of our world.
According to the team, 3D awareness can be measured by two key capabilities: Can the models reconstruct the visible 3D surface from a single image, i.e., infer depth and surface information? Are the representations consistent across multiple views of the same object or scene?