{"ID":3004648,"CreatedAt":"2026-06-03T03:09:48.883664427Z","UpdatedAt":"2026-06-05T11:43:53.432517148Z","DeletedAt":null,"paper_url":"https://arxiv.org/abs/2606.03948","arxiv_id":"2606.03948","title":"A Pocket Offline Model for Simultaneous Speech Translation as CUNI Submission to IWSLT 2026","abstract":"We implement simultaneous translation capability with the offline direct speech-to-text translation model Canary, using the state-of-the-art policy AlignAtt, and submit it to IWSLT 2026 Simultaneous Speech Translation Shared task for Czech to English and English to German and Italian. The strengths of our system are: (1) high translation quality, outperforming similarly sized baselines both in low- and high-latency regimes in computationally unaware simulations; (2) low computational requirements, as the model has only 1B parameters; (3) multilinguality -- support of 25 source and 25 target languages.","short_abstract":"We implement simultaneous translation capability with the offline direct speech-to-text translation model Canary, using the state-of-the-art policy AlignAtt, and submit it to IWSLT 2026 Simultaneous Speech Translation Shared task for Czech to English and English to German and Italian. The strengths of our system are: (...","url_abs":"https://arxiv.org/abs/2606.03948","url_pdf":"https://arxiv.org/pdf/2606.03948v1","authors":"[\"Aziz Sharipov Ortega\",\"Dominik Macháček\"]","published":"2026-06-02T17:37:11Z","proceeding":"cs.CL","tasks":"[\"cs.CL\"]","methods":"[]","has_code":false}
