{"ID":2892297,"CreatedAt":"2026-06-01T04:54:23.091178241Z","UpdatedAt":"2026-06-01T04:54:23.091178241Z","DeletedAt":null,"paper_url":"https://arxiv.org/abs/2507.15671","arxiv_id":"2507.15671","title":"BugScope: Learn to Find Bugs Like Human","abstract":"Software auditing is an increasingly critical task in the era of rapid code generation. While LLM-based auditors have demonstrated strong potential, their effectiveness remains limited by misalignment with the highly complex, domain-specific nature of bug detection. In this work, we introduce BugScope, a framework that mirrors how human auditors learn specific bug patterns from representative examples and apply this knowledge during code auditing. BugScope structures auditing into three steps: seed identification, context retrieval, and bug detection, and aligns LLMs to each step by analyzing real bug reports and mutated examples, and distilling concise, reusable guidelines. On a curated dataset of 33 real-world bugs from 21 widely used open-source projects, BugScope achieves 86.05\\% precision and 87.88\\% recall, corresponding to an F1 score of 0.87. By comparison, leading industrial tools such as Claude Code (with Claude Opus 4.6) and Cursor BugBot achieve F1 scores of only 0.51 and 0.43, respectively. Beyond benchmarks, large-scale evaluation on real-world projects such as the Linux kernel uncovered 184 previously unknown bugs, of which 78 have already been fixed and 7 explicitly confirmed by developers. Our code is available at https://github.com/jinyaoguo/BugScope","short_abstract":"Software auditing is an increasingly critical task in the era of rapid code generation. While LLM-based auditors have demonstrated strong potential, their effectiveness remains limited by misalignment with the highly complex, domain-specific nature of bug detection. In this work, we introduce BugScope, a framework that...","url_abs":"https://arxiv.org/abs/2507.15671","url_pdf":"https://arxiv.org/pdf/2507.15671v2","authors":"[\"Jinyao Guo\",\"Chengpeng Wang\",\"Dominic Deluca\",\"Jinjie Liu\",\"Zhuo Zhang\",\"Xiangyu Zhang\"]","published":"2025-07-21T14:34:01Z","proceeding":"cs.SE","tasks":"[\"cs.SE\"]","methods":"[\"Large Language Model\"]","has_code":false,"code_links":[{"ID":611981,"CreatedAt":"2026-06-01T04:54:23.091178241Z","UpdatedAt":"2026-06-01T04:54:23.091178241Z","DeletedAt":null,"paper_id":2892297,"paper_url":"https://arxiv.org/abs/2507.15671","paper_title":"BugScope: Learn to Find Bugs Like Human","repo_url":"https://github.com/jinyaoguo/BugScope","is_official":false,"mentioned_in_paper":false,"mentioned_in_github":true,"github_stars":0}]}
