{"ID":2889191,"CreatedAt":"2026-06-01T04:54:23.091178241Z","UpdatedAt":"2026-06-01T04:54:23.091178241Z","DeletedAt":null,"paper_url":"https://arxiv.org/abs/2507.21783","arxiv_id":"2507.21783","title":"Domain Generalization and Adaptation in Intensive Care with Anchor Regression","abstract":"The performance of predictive models in clinical settings often degrades when deployed in new hospitals due to distribution shifts. This paper presents a large-scale study of causality-inspired domain generalization on heterogeneous multi-center intensive care unit (ICU) data. We apply anchor regression and introduce anchor boosting, a novel, tree-based nonlinear extension, to a large dataset comprising 400,000 patients from nine distinct ICU databases. We find that anchor regularization yields improvements of out-of-distribution performance, particularly for the most dissimilar target domains. The methods appear robust to violations of theoretical assumptions, such as anchor exogeneity. Furthermore, we propose a novel conceptual framework to quantify the utility of large external data datasets. By evaluating performance as a function of available target-domain data, we identify three regimes: (i) a domain generalization regime, where only the external model should be used, (ii) a domain adaptation regime, where refitting the external model is optimal, and (iii) a data-rich regime, where external data provides no additional value.","short_abstract":"The performance of predictive models in clinical settings often degrades when deployed in new hospitals due to distribution shifts. This paper presents a large-scale study of causality-inspired domain generalization on heterogeneous multi-center intensive care unit (ICU) data. We apply anchor regression and introduce a...","url_abs":"https://arxiv.org/abs/2507.21783","url_pdf":"https://arxiv.org/pdf/2507.21783v2","authors":"[\"Malte Londschien\",\"Manuel Burger\",\"Gunnar Rätsch\",\"Peter Bühlmann\"]","published":"2025-07-29T13:09:41Z","proceeding":"stat.AP","tasks":"[\"stat.AP\",\"cs.LG\",\"stat.ME\",\"stat.ML\"]","methods":"[]","has_code":false}
