{"ID":2840559,"CreatedAt":"2026-06-01T04:54:23.091178241Z","UpdatedAt":"2026-06-01T04:54:23.091178241Z","DeletedAt":null,"paper_url":"https://arxiv.org/abs/2511.13276","arxiv_id":"2511.13276","title":"Recognition of Abnormal Events in Surveillance Videos using Weakly Supervised Dual-Encoder Models","abstract":"We address the challenge of detecting rare and diverse anomalies in surveillance videos using only video-level supervision. Our dual-backbone framework combines convolutional and transformer representations through top-k pooling, achieving 90.7% area under the curve (AUC) on the UCF-Crime dataset.","short_abstract":"We address the challenge of detecting rare and diverse anomalies in surveillance videos using only video-level supervision. Our dual-backbone framework combines convolutional and transformer representations through top-k pooling, achieving 90.7% area under the curve (AUC) on the UCF-Crime dataset.","url_abs":"https://arxiv.org/abs/2511.13276","url_pdf":"https://arxiv.org/pdf/2511.13276v1","authors":"[\"Noam Tsfaty\",\"Avishai Weizman\",\"Liav Cohen\",\"Moshe Tshuva\",\"Yehudit Aperstein\"]","published":"2025-11-17T11:47:28Z","proceeding":"cs.CV","tasks":"[\"cs.CV\"]","methods":"[\"Transformer\"]","has_code":false}