gui-agent

Star

Here are 19 public repositories matching this topic...

bytedance / UI-TARS-desktop

Star

The Open-sourced Multimodal AI Agent Stack connecting Cutting-edge AI Models and Agent Infra.

agent mcp vision vlm tars multimodal computer-use mcp-server gui-agent browser-use gui-operator ui-tars agent-tars

Updated Aug 25, 2025
TypeScript

showlab / ShowUI

Star

[CVPR 2025] Open-source, End-to-end, Vision-Language-Action model for GUI Agent & Computer Use.

agent vision-language-model vision-language-action computer-use gui-agent

Updated May 29, 2025
Python

trycua / acu

Star

A curated list of resources about AI agents for Computer Use, including research papers, projects, frameworks, and tools.

awesome ai computer ai-research computer-use gui-agent ui-agent

Updated May 15, 2025

zai-org / CogAgent

Star

An open-sourced end-to-end VLM-based GUI Agent

agent glm vlm computer-use gui-agent

Updated Apr 4, 2025
Python

OS-Agent-Survey / OS-Agent-Survey

Star

This is the repo for the paper "OS Agents: A Survey on MLLM-based Agents for Computer, Phone and Browser Use" (ACL 2025 Oral).

agent gui survey operator web-agent browser-agent llms phone-use mllms computer-use gui-agent os-agent os-agent-survey computing-devices computer-using-agent computer-using

Updated Aug 16, 2025

SunzeY / SEAgent

Star

Official implementation of "SEAgent: Self-Evolving Computer Use Agent with Autonomous Learning from Experience"

agent rl vllm gui-agent self-evolving-systems grpo computer-use-agent osworld

Updated Aug 7, 2025
Python

ritzz-ai / GUI-R1

Star

Official implementation of GUI-R1 : A Generalist R1-Style Vision-Language Action Model For GUI Agents

deep-reinforcement-learning r1 multimodal o1 multimodal-large-language-models large-multimodal-models gui-agent grpo mllm-reasoning

Updated May 5, 2025
Python

lll6gg / UI-R1

Star

Code for "UI-R1: Enhancing Efficient Action Prediction of GUI Agents by Reinforcement Learning"

reinforcement-learning r1 multimodal-learning multimodal-large-language-models gui-agent efficient-reasoning

Updated May 26, 2025
Python

showlab / WorldGUI

Star

Enable AI to control your PC. This repo includes the WorldGUI Benchmark and GUI-Thinker Agent Framework.

gui-application agents large-multimodal-models gui-agent

Updated Jul 27, 2025
Python

wendell0218 / GVA-Survey

Star

Official repository of the paper "Generalist Virtual Agents: A Survey on Autonomous Agents Across Digital Platforms"

survey vlm gva multi-agent-system virtual-agent embodied-agent llm mllm gui-agent generalist-virtual-agent

Updated Jul 11, 2025

Official repo of "MMBench-GUI: Hierarchical Multi-Platform Evaluation Framework for GUI Agents". It can be used to evaluate a GUI agent with a hierarchical manner across multiple platforms, including Windows, Linux, macOS, iOS, Android and Web.

benchmark-framework vision-language-model computer-use gui-agent

Updated Aug 15, 2025
Python

InfiXAI / InfiGUI-G1

Star

Official repository for InfiGUI-G1. We introduce Adaptive Exploration Policy Optimization (AEPO) to overcome semantic alignment bottlenecks in GUI agents through efficient, guided exploration.

reinforcement-learning computer-vision deep-learning large-language-models multimodal-llm gui-grounding gui-agent

Updated Aug 19, 2025
Python

TurixAI / TuriX-CUA

Star

This is the official website for TuriX Computer-use-Agent

agent mcp cua ai-agents computer-automation computer-use gui-agent browser-use computer-use-agent gui-operator

Updated Aug 15, 2025
Python

TongUI-agent / TongUI-agent

Star

Release of code, datasets and model for our work TongUI: Building Generalized GUI Agents by Learning from Multimodal Web Tutorials

agent vision-language-model vision-language-action computer-use gui-agent vision-language-action-model computer-use-agent tongui

Updated Jul 11, 2025
HTML

V-Droid-Agent / V-Droid

Star

Source code of the paper "V-Droid: Advancing Mobile GUI Agent Through Generative Verifiers"

agent mobile mobile-agents llm phone-use computer-use gui-agent

Updated Jul 8, 2025
Python

Magic-Abracadabra / AI-Chinese-Scripting-Language

Star

This is a quick test of Chinese Scripting Language powered by AI. You can use it to open any text file. No illegal use is allowed! Free for commercial use and academic use.

prompt-engineering gui-agent context-engineering

Updated Aug 6, 2025
Python

Magic-Abracadabra / AI-Chinese-Scripting-Language-Linux

Star

Control Group of My Future Paper, without Task Planning, Exceptional Handling, and fully based on LLMs

linux prompt-engineering computer-use gui-agent context-engineering

Updated Aug 6, 2025
Python

shiva129stha / SEAgent

Star

🐙 SEAgent is a self-evolving agent with autonomous learning from experience, providing SEAgent-1.0-7B and World State Model for adaptive decision-making.

agent rl vllm gui-agent self-evolving-systems grpo computer-use-agent osworld

Updated Aug 25, 2025
Python

Yah185 / open-source-operator

Star

Create your self-hosted, open-source Operator model.

training-infra agent-evals gui-agent browseruse native-agent-model

Updated Aug 25, 2025

Improve this page

Add a description, image, and links to the gui-agent topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the gui-agent topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

gui-agent

Here are 19 public repositories matching this topic...

bytedance / UI-TARS-desktop

showlab / ShowUI

trycua / acu

zai-org / CogAgent

OS-Agent-Survey / OS-Agent-Survey

SunzeY / SEAgent

ritzz-ai / GUI-R1

lll6gg / UI-R1

showlab / WorldGUI

wendell0218 / GVA-Survey

open-compass / MMBench-GUI

InfiXAI / InfiGUI-G1

TurixAI / TuriX-CUA

TongUI-agent / TongUI-agent

V-Droid-Agent / V-Droid

Magic-Abracadabra / AI-Chinese-Scripting-Language

Magic-Abracadabra / AI-Chinese-Scripting-Language-Linux

shiva129stha / SEAgent

Yah185 / open-source-operator

Improve this page

Add this topic to your repo