{"ID":2839211,"CreatedAt":"2026-06-01T04:54:23.091178241Z","UpdatedAt":"2026-06-01T04:54:23.091178241Z","DeletedAt":null,"paper_url":"https://arxiv.org/abs/2511.16624","arxiv_id":"2511.16624","title":"SAM 3D: 3Dfy Anything in Images","abstract":"We present SAM 3D, a generative model for visually grounded 3D object reconstruction, predicting geometry, texture, and layout from a single image. SAM 3D excels in natural images, where occlusion and scene clutter are common and visual recognition cues from context play a larger role. We achieve this with a human- and model-in-the-loop pipeline for annotating object shape, texture, and pose, providing visually grounded 3D reconstruction data at unprecedented scale. We learn from this data in a modern, multi-stage training framework that combines synthetic pretraining with real-world alignment, breaking the 3D \"data barrier\". We obtain significant gains over recent work, with at least a 5:1 win rate in human preference tests on real-world objects and scenes. We will release our code and model weights, an online demo, and a new challenging benchmark for in-the-wild 3D object reconstruction.","short_abstract":"We present SAM 3D, a generative model for visually grounded 3D object reconstruction, predicting geometry, texture, and layout from a single image. SAM 3D excels in natural images, where occlusion and scene clutter are common and visual recognition cues from context play a larger role. We achieve this with a human- and...","url_abs":"https://arxiv.org/abs/2511.16624","url_pdf":"https://arxiv.org/pdf/2511.16624v1","authors":"[\"SAM 3D Team\",\"Xingyu Chen\",\"Fu-Jen Chu\",\"Pierre Gleize\",\"Kevin J Liang\",\"Alexander Sax\",\"Hao Tang\",\"Weiyao Wang\",\"Michelle Guo\",\"Thibaut Hardin\",\"Xiang Li\",\"Aohan Lin\",\"Jiawei Liu\",\"Ziqi Ma\",\"Anushka Sagar\",\"Bowen Song\",\"Xiaodong Wang\",\"Jianing Yang\",\"Bowen Zhang\",\"Piotr Dollár\",\"Georgia Gkioxari\",\"Matt Feiszli\",\"Jitendra Malik\"]","published":"2025-11-20T18:31:46Z","proceeding":"cs.CV","tasks":"[\"cs.CV\",\"cs.AI\"]","methods":"[]","has_code":false}
