{"ID":3084653,"CreatedAt":"2026-06-05T06:46:15.197025399Z","UpdatedAt":"2026-06-06T19:31:40.473717466Z","DeletedAt":null,"paper_url":"https://arxiv.org/abs/2606.05383","arxiv_id":"2606.05383","title":"Can AI Refute Economic Theory? Evidence from Beyond the Knowledge Cutoff","abstract":"Can artificial intelligence (AI) refute economic theory? I document experiments in which I asked several AI models (Gemini, Refine, Claude, and ChatGPT) to check the correctness of four published papers in economic theory, each containing an error that I helped identify or correct. ChatGPT Pro performed best, occasionally constructing counterexamples and corrected proofs, while other models fared worse. However, no model located a true error without substantial human guidance, and data contamination complicates interpretation. I argue that a competent human paired with a frontier model can outperform current peer review, but AI cannot yet refute economic theory on its own.","short_abstract":"Can artificial intelligence (AI) refute economic theory? I document experiments in which I asked several AI models (Gemini, Refine, Claude, and ChatGPT) to check the correctness of four published papers in economic theory, each containing an error that I helped identify or correct. ChatGPT Pro performed best, occasiona...","url_abs":"https://arxiv.org/abs/2606.05383","url_pdf":"https://arxiv.org/pdf/2606.05383v1","authors":"[\"Alexis Akira Toda\"]","published":"2026-06-03T19:36:37Z","proceeding":"econ.GN","tasks":"[\"econ.GN\",\"cs.AI\",\"econ.TH\"]","methods":"[]","has_code":false}
