TPE search over (instruction × demo) cells
Each trial picks a cell, scores it on a noisy minibatch, updates the sampler's belief.
trials 0 best
8 × 8 configuration grid
0.01.0
Score per trial · best-so-far
Click Run 1 to step the search.