TPE search over (instruction × demo) cells
Each trial picks a cell, scores it on a noisy minibatch, updates the sampler's belief.
Run 1
Run 10
Reset
trials
0
best
…
8 × 8 configuration grid
0.0
1.0
Score per trial · best-so-far
Click
Run 1
to step the search.