{"ID":3053217,"CreatedAt":"2026-06-04T04:41:36.695875263Z","UpdatedAt":"2026-06-05T19:19:17.853951865Z","DeletedAt":null,"paper_url":"https://arxiv.org/abs/2606.04166","arxiv_id":"2606.04166","title":"End-to-End Text Line Detection and Ordering","abstract":"Practical text-recognition pipelines for historical documents typically decompose layout analysis into line detection followed by a separate reading-order step, with the latter most often handled by a hand-coded geometric heuristic that struggles with marginalia, multiple columns, tables, and source-specific editorial conventions. This article introduces Orli (Ordered Regression of Lines), an end-to-end model that casts both sub-tasks as a single image-to-sequence problem: from a page image, Orli autoregressively generates text-line baselines directly in reading order. Baselines are represented in a chord-frame parameterization that anchors a line's position, orientation, and extent while encoding local geometry through perpendicular offsets; an iterative refinement head and a local visual refiner produce the final curve. Trained on a heterogeneous corpus of 196,691 pages spanning ten writing systems, Orli marginally exceeds the previously reported state of the art for cBAD line detection without dataset-specific training, reaches near perfect coverage and ordering on multiple reading-order benchmarks zero-shot, and adapts to more specialized out-of-domain layouts with limited fine-tuning. The method's source code and model weights are available under an open license at https://github.com/mittagessen/orli.","short_abstract":"Practical text-recognition pipelines for historical documents typically decompose layout analysis into line detection followed by a separate reading-order step, with the latter most often handled by a hand-coded geometric heuristic that struggles with marginalia, multiple columns, tables, and source-specific editorial...","url_abs":"https://arxiv.org/abs/2606.04166","url_pdf":"https://arxiv.org/pdf/2606.04166v1","authors":"[\"Benjamin Kiessling\"]","published":"2026-06-02T19:29:32Z","proceeding":"cs.CV","tasks":"[\"cs.CV\"]","methods":"[]","has_code":false,"code_links":[{"ID":612801,"CreatedAt":"2026-06-04T04:41:36.695875263Z","UpdatedAt":"2026-06-04T04:41:36.695875263Z","DeletedAt":null,"paper_id":3053217,"paper_url":"https://arxiv.org/abs/2606.04166","paper_title":"End-to-End Text Line Detection and Ordering","repo_url":"https://github.com/mittagessen/orli","is_official":false,"mentioned_in_paper":false,"mentioned_in_github":true,"github_stars":0}]}
