Benchmarking Affordance Generalization with BusyBox

Eval & Deploy Workshop at the 9th Conference on Robot Learning (CoRL 2025) |

Robot Foundation Models (RFMs), also referred to as Vision-Language Action models (VLAs), have been attracting the attention of researchers and practitioners with a promise of generalizing robot behaviors across tasks, objects, and environments. The community has extensively studied RFMs’ generalization capabilities in the vision and language space. However, affordance generalization – RFMs’ ability to manipulate new objects with familiar physical features – remains largely unexplored. In the meantime, this meta-skill is plays a critical rule in a person’s ability to quickly figure out how to handle hitherto unseen objects. In fact, basic physical interface elements like buttons and switches are designed to look and function similarly across different devices to facilitate affordance generalization in environments inhabited by people. Whether robots can capitalize on these design aids remains unknown: researchers currently lack a benchmark for systematically studying affordance generalization in RFMs.

BusyBox is a physical 3D-printable kit for systematically evaluating how well RFMs generalize their knowledge of basic affordances (pressing buttons, flipping switches, turning knobs, etc). BusyBox can be assembled into any of a multitude of distinct objects having the same set of affordances. Paired with a carefully design protocols for experiments and data collection that we present in this work, BusyBox can provide valuable insights into RFMs’ ability to recognize and exploit ubiquitous affordance classes. In our experiments, BusyBox hightlights affordance generalization as a major improvement area for RFMs.

Related Tools

BusyBox

BusyBox is a physical 3D-printable device for benchmarking affordance generalization in robot foundation models. It features Modular design with 6 interchangeable modules (buttons, switches, sliders, wires, knob, and display) Open-source CAD files and bill of materials for easy reproduction Optional electronics and Raspberry Pi instrumentation for automated state logging Reconfigurable setups enabling systematic evaluation of generalization A language-annotated dataset of 1000+ demonstration trajectories oof BusyBox affordances Please check out our website for more details. BusyBox assembly instructions For fully building a instrumented BusyBox capable of state logging, see the BOM. First print the BusyBox following Printing Instructions with details on files to print and any details on print settings. Electronic Assembly: TODO: add instructions on how to assemble electronics with pictures Firmware Flashing: Instructions for flashing the Arduino Nano's firmware: Flashing Firmware Data Collection Guide See Data Collection for a look into how we collected our data.