DeepYardDeepYard
V

VISUALSKILL

Hierarchical skill library for GUI automation with reusable multimodal interaction patterns

Open SourceFree

About

Research framework for organizing computer-use agent skills into reusable, application-specific components. Combines visual and textual artifacts in a hierarchical library structure with MCP-compatible interfaces. Enables agents to learn and reuse GUI interaction patterns across different applications, reducing the need to relearn common tasks. Particularly useful for building agents that automate desktop workflows and UI interactions.

Details

Language
Patterns

Tags

frameworkopen-sourcemcpautonomoustool-usepython