{"ID":2865218,"CreatedAt":"2026-06-01T04:54:23.091178241Z","UpdatedAt":"2026-06-01T04:54:23.091178241Z","DeletedAt":null,"paper_url":"https://arxiv.org/abs/2509.22137","arxiv_id":"2509.22137","title":"Log2Plan: An Adaptive GUI Automation Framework Integrated with Task Mining Approach","abstract":"GUI task automation streamlines repetitive tasks, but existing LLM or VLM-based planner-executor agents suffer from brittle generalization, high latency, and limited long-horizon coherence. Their reliance on single-shot reasoning or static plans makes them fragile under UI changes or complex tasks. Log2Plan addresses these limitations by combining a structured two-level planning framework with a task mining approach over user behavior logs, enabling robust and adaptable GUI automation. Log2Plan constructs high-level plans by mapping user commands to a structured task dictionary, enabling consistent and generalizable automation. To support personalization and reuse, it employs a task mining approach from user behavior logs that identifies user-specific patterns. These high-level plans are then grounded into low-level action sequences by interpreting real-time GUI context, ensuring robust execution across varying interfaces. We evaluated Log2Plan on 200 real-world tasks, demonstrating significant improvements in task success rate and execution time. Notably, it maintains over 60.0% success rate even on long-horizon task sequences, highlighting its robustness in complex, multi-step workflows.","short_abstract":"GUI task automation streamlines repetitive tasks, but existing LLM or VLM-based planner-executor agents suffer from brittle generalization, high latency, and limited long-horizon coherence. Their reliance on single-shot reasoning or static plans makes them fragile under UI changes or complex tasks. Log2Plan addresses t...","url_abs":"https://arxiv.org/abs/2509.22137","url_pdf":"https://arxiv.org/pdf/2509.22137v1","authors":"[\"Seoyoung Lee\",\"Seonbin Yoon\",\"Seongbeen Lee\",\"Hyesoo Kim\",\"Joo Yong Sim\"]","published":"2025-09-26T09:56:44Z","proceeding":"cs.AI","tasks":"[\"cs.AI\",\"cs.HC\",\"cs.MA\",\"cs.RO\"]","methods":"[\"Large Language Model\"]","has_code":false}
