{"ID":2860796,"CreatedAt":"2026-06-01T04:54:23.091178241Z","UpdatedAt":"2026-06-01T04:54:23.091178241Z","DeletedAt":null,"paper_url":"https://arxiv.org/abs/2510.02734","arxiv_id":"2510.02734","title":"SAE-RNA: A Sparse Autoencoder Model for Interpreting RNA Language Model Representations","abstract":"Deep learning, particularly with the advancement of Large Language Models, has transformed biomolecular modeling, with protein language models such as ESM inspiring emerging RNA language models such as RiNALMo. Recent work has begun applying sparse autoencoders (SAEs) to protein language model representations, exploring representation-level interpretability in biomolecular models. Here, we explore whether SAEs can provide interpretable feature decompositions of RNA language model representations, while also examining their limitations in this setting. We present SAE-RNA, interpretability model that analyzes RiNALMo representations and maps them to known human-level biological features. Rather than claiming definitive biological concept discovery, our study frames SAE-based analysis as a representation-level probe for characterizing how RNA language models organize biological information internally. More broadly, SAE-RNA provides a feature-level framework for comparing RNA groups and identifying sparse representation components associated with RNA family identity or structural context.","short_abstract":"Deep learning, particularly with the advancement of Large Language Models, has transformed biomolecular modeling, with protein language models such as ESM inspiring emerging RNA language models such as RiNALMo. Recent work has begun applying sparse autoencoders (SAEs) to protein language model representations, explorin...","url_abs":"https://arxiv.org/abs/2510.02734","url_pdf":"https://arxiv.org/pdf/2510.02734v2","authors":"[\"Taehan Kim\",\"Sangdae Nam\"]","published":"2025-10-03T05:34:59Z","proceeding":"q-bio.BM","tasks":"[\"q-bio.BM\",\"cs.AI\",\"q-bio.GN\"]","methods":"[\"Language Model\",\"Generative Adversarial Network\"]","has_code":false}