Using existing enterprise apps probably - this solution is scalable for the vendor and it's easier to sell using existing software as-is than to start out by writing new custom tools.
Yes, I can see the usecase for legacy desktop apps etc, but the web? it's a DOM. WebMCP coming now too, no need for screenshotting or DOM querying then either..
After doing few experiments, I think that having Agents work on browser for all tasks wouldn't be best due to many factors like token cost, safety, etc. But browser/computer can be a tool that the agent can be alongside MCPs to complete tasks that requires interaction with such modalities.
mrorigo|2 days ago
shahules|3 days ago