{"ID":2843999,"CreatedAt":"2026-06-01T04:54:23.091178241Z","UpdatedAt":"2026-06-01T04:54:23.091178241Z","DeletedAt":null,"paper_url":"https://arxiv.org/abs/2511.07158","arxiv_id":"2511.07158","title":"Guiding Generative Models to Uncover Diverse and Novel Crystals via Reinforcement Learning","abstract":"Discovering functional crystalline materials entails navigating an immense combinatorial design space. While recent advances in generative artificial intelligence have enabled the sampling of chemically plausible compositions and structures, a fundamental challenge remains: the objective misalignment between likelihood-based sampling in generative modelling and targeted focus on underexplored regions where novel compounds reside. Here, we introduce a reinforcement learning framework that guides latent denoising diffusion models toward diverse and novel, yet thermodynamically viable crystalline compounds. Our approach integrates group relative policy optimisation with verifiable, multi-objective rewards that jointly balance creativity, stability, and diversity. Beyond de novo generation, we demonstrate enhanced property-guided design that preserves chemical validity, while targeting desired functional properties. This approach establishes a modular foundation for controllable AI-driven inverse design that addresses the novelty-validity trade-off across scientific discovery applications of generative models.","short_abstract":"Discovering functional crystalline materials entails navigating an immense combinatorial design space. While recent advances in generative artificial intelligence have enabled the sampling of chemically plausible compositions and structures, a fundamental challenge remains: the objective misalignment between likelihood...","url_abs":"https://arxiv.org/abs/2511.07158","url_pdf":"https://arxiv.org/pdf/2511.07158v1","authors":"[\"Hyunsoo Park\",\"Aron Walsh\"]","published":"2025-11-10T14:48:49Z","proceeding":"cs.LG","tasks":"[\"cs.LG\",\"physics.comp-ph\"]","methods":"[\"Reinforcement Learning\",\"Diffusion Model\"]","has_code":false}
