{"ID":2826902,"CreatedAt":"2026-06-01T04:54:23.091178241Z","UpdatedAt":"2026-06-01T04:54:23.091178241Z","DeletedAt":null,"paper_url":"https://arxiv.org/abs/2512.17217","arxiv_id":"2512.17217","title":"LZ78 Substring Compression in Compressed Space","abstract":"The Lempel--Ziv 78 (LZ78) factorization is a well-studied technique for data compression. It and its derivatives are used in compression formats such as \"compress\" or \"gif\". Although most research focuses on the factorization of plain data, not much research has been conducted on indexing the data for fast LZ78 factorization. Here, we study the LZ78 factorization and its derivatives in the substring compression model, where we are allowed to index the data and return the factorization of a substring specified at query time. In that model, we propose an algorithm that works in compressed space, computing the factorization with a logarithmic slowdown compared to the optimal time complexity.","short_abstract":"The Lempel--Ziv 78 (LZ78) factorization is a well-studied technique for data compression. It and its derivatives are used in compression formats such as \"compress\" or \"gif\". Although most research focuses on the factorization of plain data, not much research has been conducted on indexing the data for fast LZ78 factori...","url_abs":"https://arxiv.org/abs/2512.17217","url_pdf":"https://arxiv.org/pdf/2512.17217v1","authors":"[\"Hiroki Shibata\",\"Dominik Köppl\"]","published":"2025-12-19T03:59:08Z","proceeding":"cs.DS","tasks":"[\"cs.DS\"]","methods":"[]","has_code":false}
