{"ID":2844857,"CreatedAt":"2026-06-01T04:54:23.091178241Z","UpdatedAt":"2026-06-01T04:54:23.091178241Z","DeletedAt":null,"paper_url":"https://arxiv.org/abs/2511.11626","arxiv_id":"2511.11626","title":"Omics-scale polymer computational database transferable to real-world artificial intelligence applications","abstract":"Developing large-scale foundational datasets is a critical milestone in advancing artificial intelligence (AI)-driven scientific innovation. However, unlike AI-mature fields such as natural language processing, materials science, particularly polymer research, has significantly lagged in developing extensive open datasets. This lag is primarily due to the high costs of polymer synthesis and property measurements, along with the vastness and complexity of the chemical space. This study presents PolyOmics, an omics-scale computational database generated through fully automated molecular dynamics simulation pipelines that provide diverse physical properties for over $10^5$ polymeric materials. The PolyOmics database is collaboratively developed by approximately 260 researchers from 48 institutions to bridge the gap between academia and industry. Machine learning models pretrained on PolyOmics can be efficiently fine-tuned for a wide range of real-world downstream tasks, even when only limited experimental data are available. Notably, the generalisation capability of these simulation-to-real transfer models improve significantly as the size of the PolyOmics database increases, exhibiting power-law scaling. The emergence of scaling laws supports the \"more is better\" principle, highlighting the significance of ultralarge-scale computational materials data for improving real-world prediction performance. This unprecedented omics-scale database reveals vast unexplored regions of polymer materials, providing a foundation for AI-driven polymer science.","short_abstract":"Developing large-scale foundational datasets is a critical milestone in advancing artificial intelligence (AI)-driven scientific innovation. However, unlike AI-mature fields such as natural language processing, materials science, particularly polymer research, has significantly lagged in developing extensive open datas...","url_abs":"https://arxiv.org/abs/2511.11626","url_pdf":"https://arxiv.org/pdf/2511.11626v1","authors":"[\"Ryo Yoshida\",\"Yoshihiro Hayashi\",\"Hidemine Furuya\",\"Ryohei Hosoya\",\"Kazuyoshi Kaneko\",\"Hiroki Sugisawa\",\"Yu Kaneko\",\"Aiko Takahashi\",\"Yoh Noguchi\",\"Shun Nanjo\",\"Keiko Shinoda\",\"Tomu Hamakawa\",\"Mitsuru Ohno\",\"Takuya Kitamura\",\"Misaki Yonekawa\",\"Stephen Wu\",\"Masato Ohnishi\",\"Chang Liu\",\"Teruki Tsurimoto\",\"Arifin\",\"Araki Wakiuchi\",\"Kohei Noda\",\"Junko Morikawa\",\"Teruaki Hayakawa\",\"Junichiro Shiomi\",\"Masanobu Naito\",\"Kazuya Shiratori\",\"Tomoki Nagai\",\"Norio Tomotsu\",\"Hiroto Inoue\",\"Ryuichi Sakashita\",\"Masashi Ishii\",\"Isao Kuwajima\",\"Kenji Furuichi\",\"Norihiko Hiroi\",\"Yuki Takemoto\",\"Takahiro Ohkuma\",\"Keita Yamamoto\",\"Naoya Kowatari\",\"Masato Suzuki\",\"Naoya Matsumoto\",\"Seiryu Umetani\",\"Hisaki Ikebata\",\"Yasuyuki Shudo\",\"Mayu Nagao\",\"Shinya Kamada\",\"Kazunori Kamio\",\"Taichi Shomura\",\"Kensaku Nakamura\",\"Yudai Iwamizu\",\"Atsutoshi Abe\",\"Koki Yoshitomi\",\"Yuki Horie\",\"Katsuhiko Koike\",\"Koichi Iwakabe\",\"Shinya Gima\",\"Kota Usui\",\"Gikyo Usuki\",\"Takuro Tsutsumi\",\"Keitaro Matsuoka\",\"Kazuki Sada\",\"Masahiro Kitabata\",\"Takuma Kikutsuji\",\"Akitaka Kamauchi\",\"Yusuke Iijima\",\"Tsubasa Suzuki\",\"Takenori Goda\",\"Yuki Takabayashi\",\"Kazuko Imai\",\"Yuji Mochizuki\",\"Hideo Doi\",\"Koji Okuwaki\",\"Hiroya Nitta\",\"Taku Ozawa\",\"Hitoshi Kamijima\",\"Toshiaki Shintani\",\"Takuma Mitamura\",\"Massimiliano Zamengo\",\"Yuitsu Sugami\",\"Seiji Akiyama\",\"Yoshinari Murakami\",\"Atsushi Betto\",\"Naoya Matsuo\",\"Satoru Kagao\",\"Tetsuya Kobayashi\",\"Norie Matsubara\",\"Shosei Kubo\",\"Yuki Ishiyama\",\"Yuri Ichioka\",\"Mamoru Usami\",\"Satoru Yoshizaki\",\"Seigo Mizutani\",\"Yosuke Hanawa\",\"Shogo Kunieda\",\"Mitsuru Yambe\",\"Takeru Nakamura\",\"Hiromori Murashima\",\"Kenji Takahashi\",\"Naoki Wada\",\"Masahiro Kawano\",\"Yosuke Harada\",\"Takehiro Fujita\",\"Erina Fujita\",\"Ryoji Himeno\",\"Hiori Kino\",\"Kenji Fukumizu\"]","published":"2025-11-07T09:03:07Z","proceeding":"physics.chem-ph","tasks":"[\"physics.chem-ph\",\"cond-mat.mtrl-sci\",\"cond-mat.soft\",\"cs.LG\"]","methods":"[]","has_code":false}
