{"ID":2889817,"CreatedAt":"2026-06-01T04:54:23.091178241Z","UpdatedAt":"2026-06-01T04:54:23.091178241Z","DeletedAt":null,"paper_url":"https://arxiv.org/abs/2508.00903","arxiv_id":"2508.00903","title":"Universal Neurons in GPT-2: Emergence, Persistence, and Functional Impact","abstract":"We investigate the phenomenon of neuron universality in independently trained GPT-2 Small models, examining these universal neurons-neurons with consistently correlated activations across models-emerge and evolve throughout training. By analyzing five GPT-2 models at five checkpoints, we identify universal neurons through pairwise correlation analysis of activations over a dataset of 5 million tokens. Ablation experiments reveal significant functional impacts of universal neurons on model predictions, measured via cross entropy loss. Additionally, we quantify neuron persistence, demonstrating high stability of universal neurons across training checkpoints, particularly in early and deeper layers. These findings suggest stable and universal representational structures emerge during language model training.","short_abstract":"We investigate the phenomenon of neuron universality in independently trained GPT-2 Small models, examining these universal neurons-neurons with consistently correlated activations across models-emerge and evolve throughout training. By analyzing five GPT-2 models at five checkpoints, we identify universal neurons thro...","url_abs":"https://arxiv.org/abs/2508.00903","url_pdf":"https://arxiv.org/pdf/2508.00903v2","authors":"[\"Advey Nandan\",\"Cheng-Ting Chou\",\"Amrit Kurakula\",\"Cole Blondin\",\"Kevin Zhu\",\"Vasu Sharma\",\"Sean O'Brien\"]","published":"2025-07-28T22:45:33Z","proceeding":"cs.LG","tasks":"[\"cs.LG\",\"cs.AI\",\"cs.NE\"]","methods":"[\"Language Model\"]","has_code":false}
