{"ID":2892436,"CreatedAt":"2026-06-01T04:54:23.091178241Z","UpdatedAt":"2026-06-01T04:54:23.091178241Z","DeletedAt":null,"paper_url":"https://arxiv.org/abs/2507.16011","arxiv_id":"2507.16011","title":"mRAKL: Multilingual Retrieval-Augmented Knowledge Graph Construction for Low-Resourced Languages","abstract":"Knowledge Graphs represent real-world entities and the relationships between them. Multilingual Knowledge Graph Construction (mKGC) refers to the task of automatically constructing or predicting missing entities and links for knowledge graphs in a multilingual setting. In this work, we reformulate the mKGC task as a Question Answering (QA) task and introduce mRAKL: a Retrieval-Augmented Generation (RAG) based system to perform mKGC. We achieve this by using the head entity and linking relation in a question, and having our model predict the tail entity as an answer. Our experiments focus primarily on two low-resourced languages: Tigrinya and Amharic. We experiment with using higher-resourced languages Arabic and English for cross-lingual transfer. With a BM25 retriever, we find that the RAG-based approach improves performance over a no-context setting. Further, our ablation studies show that with an idealized retrieval system, mRAKL improves accuracy by 4.92 and 8.79 percentage points for Tigrinya and Amharic, respectively.","short_abstract":"Knowledge Graphs represent real-world entities and the relationships between them. Multilingual Knowledge Graph Construction (mKGC) refers to the task of automatically constructing or predicting missing entities and links for knowledge graphs in a multilingual setting. In this work, we reformulate the mKGC task as a Qu...","url_abs":"https://arxiv.org/abs/2507.16011","url_pdf":"https://arxiv.org/pdf/2507.16011v1","authors":"[\"Hellina Hailu Nigatu\",\"Min Li\",\"Maartje ter Hoeve\",\"Saloni Potdar\",\"Sarah Chasins\"]","published":"2025-07-21T19:11:31Z","proceeding":"cs.CL","tasks":"[\"cs.CL\"]","methods":"[\"RAG\"]","has_code":false}
