LabOSBench is a GUI agent benchmark suite for scientific instrument web simulators (SEM, FIB, EDS, APT, LFM, SPM, TEM, XRD). Agents control instrument UIs in the browser via Playwright and ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results