Why Midscene's Action Space Is a Protocol
How many places do you have to touch to add a new action to Midscene?
This is my rough-and-ready test for whether a framework is well designed. If adding a “pull to refresh” gesture means changing the locating logic, changing the planning prompt, changing the execution dispatch, and then patching in a pile of if-else along the way, then sooner or later the framework gets crushed under its own action space.
Midscene’s answer is: you add it in one place. This post is about how it does that, and about the part it does not do.
