{"ID":2874008,"CreatedAt":"2026-06-01T04:54:23.091178241Z","UpdatedAt":"2026-06-01T04:54:23.091178241Z","DeletedAt":null,"paper_url":"https://arxiv.org/abs/2509.05769","arxiv_id":"2509.05769","title":"IoT Miner: Intelligent Extraction of Event Logs from Sensor Data for Process Mining","abstract":"This paper presents IoT Miner, a novel framework for automatically creating high-level event logs from raw industrial sensor data to support process mining. In many real-world settings, such as mining or manufacturing, standard event logs are unavailable, and sensor data lacks the structure and semantics needed for analysis. IoT Miner addresses this gap using a four-stage pipeline: data preprocessing, unsupervised clustering, large language model (LLM)-based labeling, and event log construction. A key innovation is the use of LLMs to generate meaningful activity labels from cluster statistics, guided by domain-specific prompts. We evaluate the approach on sensor data from a Load-Haul-Dump (LHD) mining machine and introduce a new metric, Similarity-Weighted Accuracy, to assess labeling quality. Results show that richer prompts lead to more accurate and consistent labels. By combining AI with domain-aware data processing, IoT Miner offers a scalable and interpretable method for generating event logs from IoT data, enabling process mining in settings where traditional logs are missing.","short_abstract":"This paper presents IoT Miner, a novel framework for automatically creating high-level event logs from raw industrial sensor data to support process mining. In many real-world settings, such as mining or manufacturing, standard event logs are unavailable, and sensor data lacks the structure and semantics needed for ana...","url_abs":"https://arxiv.org/abs/2509.05769","url_pdf":"https://arxiv.org/pdf/2509.05769v1","authors":"[\"Edyta Brzychczy\",\"Urszula Jessen\",\"Krzysztof Kluza\",\"Sridhar Sriram\",\"Manuel Vargas Nettelnstroth\"]","published":"2025-09-06T16:50:33Z","proceeding":"cs.SE","tasks":"[\"cs.SE\"]","methods":"[\"Large Language Model\",\"Language Model\"]","has_code":false}
