microsoft/OmniParser
A simple screen parsing tool towards pure vision based GUI agent
Stars: 24,420Language: Jupyter Notebook
Give AlbumentationsX a star on GitHub — it powers this leaderboard
Star on GitHubA simple screen parsing tool towards pure vision based GUI agent