{"ID":2875986,"CreatedAt":"2026-06-01T04:54:23.091178241Z","UpdatedAt":"2026-06-01T04:54:23.091178241Z","DeletedAt":null,"paper_url":"https://arxiv.org/abs/2509.01566","arxiv_id":"2509.01566","title":"CSRM-LLM: Embracing Multilingual LLMs for Cold-Start Relevance Matching in Emerging E-commerce Markets","abstract":"As global e-commerce platforms continue to expand, companies are entering new markets where they encounter cold-start challenges due to limited human labels and user behaviors. In this paper, we share our experiences in Coupang to provide a competitive cold-start performance of relevance matching for emerging e-commerce markets. Specifically, we present a Cold-Start Relevance Matching (CSRM) framework, utilizing a multilingual Large Language Model (LLM) to address three challenges: (1) activating cross-lingual transfer learning abilities of LLMs through machine translation tasks; (2) enhancing query understanding and incorporating e-commerce knowledge by retrieval-based query augmentation; (3) mitigating the impact of training label errors through a multi-round self-distillation training strategy. Our experiments demonstrate the effectiveness of CSRM-LLM and the proposed techniques, resulting in successful real-world deployment and significant online gains, with a 45.8% reduction in defect ratio and a 0.866% uplift in session purchase rate.","short_abstract":"As global e-commerce platforms continue to expand, companies are entering new markets where they encounter cold-start challenges due to limited human labels and user behaviors. In this paper, we share our experiences in Coupang to provide a competitive cold-start performance of relevance matching for emerging e-commerc...","url_abs":"https://arxiv.org/abs/2509.01566","url_pdf":"https://arxiv.org/pdf/2509.01566v2","authors":"[\"Yujing Wang\",\"Yiren Chen\",\"Huoran Li\",\"Chunxu Xu\",\"Yuchong Luo\",\"Xianghui Mao\",\"Cong Li\",\"Lun Du\",\"Chunyang Ma\",\"Qiqi Jiang\",\"Yin Wang\",\"Fan Gao\",\"Wenting Mo\",\"Pei Wen\",\"Shantanu Kumar\",\"Taejin Park\",\"Yiwei Song\",\"Vijay Rajaram\",\"Tao Cheng\",\"Sonu Durgia\",\"Pranam Kolari\"]","published":"2025-09-01T15:51:30Z","proceeding":"cs.IR","tasks":"[\"cs.IR\",\"cs.CL\"]","methods":"[\"Large Language Model\",\"Language Model\"]","has_code":false}
