AI-powered, vision-driven UI automation for every platform.
-
Updated
May 18, 2026 - TypeScript
AI-powered, vision-driven UI automation for every platform.
This is the repo for the paper "OS Agents: A Survey on MLLM-based Agents for Computer, Phone and Browser Use" (ACL 2025 Oral).
Source code of the paper "V-Droid: Advancing Mobile GUI Agent Through Generative Verifiers"
Eliminate distracted driving using AI detection and rewards system.
Scripts for the analysis of EMA and longitudinal data on virtual media use and well-being during the pandemic
Add a description, image, and links to the phone-use topic page so that developers can more easily learn about it.
To associate your repository with the phone-use topic, visit your repo's landing page and select "manage topics."