Previously, I received both my M.S. and B.S. degrees from the College of Computer Science and Technology at Zhejiang University, under the supervision of Prof. Kun Kuang and Shengyu Zhang. Additionally, I was fortunate to conduct research on generative retrieval of recommendation at the Taobao & Tmall Group, Alibaba.
My research focuses on the generalizability and personalization of recommender systems. Recently, I am interested in both recommendations with large language models and the efficient inference of large recommenders when facing much longer user interaction sequences. Meanwhile, I also tackle the distinctive challenges involved in integrating heterogeneous models across diverse computational environments like mobile devices and cloud servers in a seamless and effective manner.
๐ฅ News
- 2026.05: ย ๐๐ One paper has been accepted to KDD 2026.
- 2026.02: ย ๐ฐ๐ฐ One paper has been deployed in Taobao and is available on Arxiv about rank-enhanced generative retrieval with list-wise DPO.
- 2026.01: ย ๐๐ Three papers have been accepted to the research/industry/short paper track of TheWebConf 2026.
- 2026.01: ย ๐ฐ๐ฐ One paper has been available on Arxiv about the efficient inference of large sequential recommendation.
- 2025.09: ย ๐ฐ๐ฐ One paper has been deployed in Taobao and is available on Arxiv about generative retrieval with semantic identifiers.
- 2025.07: ย ๐๐ Two papers have been accepted to MM 2025.
- 2025.07: ย ๐๐ Forward-OFA has been deployed in the Ascend Community of Huawei using NPU.
- 2024.12: ย ๐๐ One paper has been accepted to AAAI 2025.
- 2024.11: ย ๐๐ One paper has been accepted to KDD 2025.
- 2024.08: ย ๐ฅณ๐ฅณ I went to Barcelona, Spain, to attend the KDD conference to deliver an oral presentation of our paper DIET.
- 2024.05: ย ๐๐ One paper has been accepted to KDD 2024.
- 2023.07: ย ๐ฅณ๐ฅณ I went to Fuzhou, China, to attend the CICAI conference to deliver an oral presentation and won the Best Paper Award.
- 2023.06: ย ๐๐ One paper has been accepted to CICAI 2023.
๐ Publications
* denote the authors contributed equally.

FORGE: Forming Semantic Identifiers for Generative Retrieval in Industrial Datasets
Kairui Fu, Tao Zhang, Shuwen Xiao, Ziyang Wang, Xinming Zhang, Chenchi Zhang, Yuliang Yan, Junjun Zheng, Yu Li, Zhihong Chen, Jian Wu, Xiangheng Kong, Shengyu Zhang, Kun Kuang, Yuning Jiang, Bo Zheng
Huggingface Github ็ฅไน Wechat
- The first industrial dataset about generative retrieval with semantic identifiers, which contains 14 billion user interactions and multimodal features of 250 million items sampled from Taobao.
- Subsequent proposed optimizations of data modality and ID collisions are validated with both offline (15\% improvements on HitRate) and online (0.35% improvements on transaction count) experiments in the โGuess You Likeโ Section of Taobao.

Kairui Fu*, Changfa Wu*, Kun Yuan, Binbin Cao, Dunxian Huang, Yuliang Yan, Junjun Zheng, Jianning Zhang, Silu Zhou, Jian Wu, Kun Kuang
- A rank-enhanced generative retrieval framework for recommendation. Equipped with the listwise direct preference optimization in IAP and the extra scoring module in RSP, RankGR breaks the limitations of existing methods, particularly their inability to capture hierarchy preference and rich interactions between SIDs and user sequences
- Deployed on Taobaoโs โGuess You Likeโ section, RankGR achieved a 1.08% increase in online item page views and captured 49.88% of total exposures.

PI2I: A Personalized Item-Based Collaborative Filtering Retrieval Framework
Shaoqing Wang*, Yingcai Ma*, Kairui Fu*, Ziyang Wang, Dunxian Huang, YuliangYan, Jian Wu
- A novel two-stage retrieval framework that enhances the personalization capabilities of traditional collaborative filtering.
- Deployed on Taobaoโs โGuess You Likeโ section, PI2I achieved a 1.05% increase in online transaction rates.

ThinkRec: Thinking-based recommendation via LLM
Qihang Yu*, Kairui Fu*, Shengyu Zhang, Zheqi Lv, Fan Wu, Fei Wu
- Almost the first emphasizes the importance of activating the thinking of LLMs to make recommendations more interpretable and effective.

Tianyu Zhan*, Kairui Fu*, Zheqi Lv, Shengyu Zhang
- An effective strategy to selectively prunes less informative tokens in the input sequence for semantic identifiers based recommendation.

MALLOC: Benchmarking the Memory-aware Long Sequence Compression for Large Sequential Recommendation
Qihang Yu*, Kairui Fu*, Zhaocheng Du*, Yuxuan Si, Kaiyuan Li, Weihao Zhao, Zhicheng Zhang, Jieming Zhu, Quanyu Dai, Zhenhua Dong, Shengyu Zhang, Kun Kuang, Fei Wu
- A benchmark that establishes a rigorous multi-dimensional evaluation protocol that couples standard ranking metrics with system-level constraints for long-sequence compression in large recommender systems.

Tianqi Liu*, Kairui Fu*, Shengyu Zhang, Wenyan Fan, Zhaocheng Du, Jieming Zhu, Fan Wu, Fei Wu
- A framework for device-cloud collaborative personalized mixed-precision quantization that generates lightweight networks for heterogeneous mobile devices.

Tackling Device Data Distribution Real-time Shift via Prototype-based Parameter Editing
Zheqi Lv, Wenqiao Zhang, Kairui Fu, Qi Tian, Shengyu Zhang, Jiajie Su, Jingyuan Chen, Kun Kuang, Fei Wu
- The composition of coarse and fine-grained intersts for tackling the on-device continuous data distribution shift in both vision and recommendation tasks.

Kairui Fu, Zheqi Lv, Shengyu Zhang, Fan Wu, Kun Kuang
- An early attempt to investigate the joint customization of both structure and parameters, analyzing the challenges of interest heterogeneity, network transmission, and on-device inference simultaneously.

MergeNet: Knowledge Migration across Heterogeneous Models, Tasks, and Modalities
Kunxi Li*, Tianyu Zhan*, Kairui Fu*, Shengyu Zhang, Kun Kuang, Jiwei Li, Zhou Zhao, Fan Wu, Fei Wu
- Leverage parameters as the medium to achieve knowledge transfer between heterogeneous models, tasks, and modalities.

DIET: Customized Slimming for Incompatible Networks in Sequential Recommendation
Kairui Fu, Shengyu Zhang, Zheqi Lv, Jingyuan Chen, Jiwei Li
- Tackle both the parameter personalization and the communication efficiency under strict device constraints in device-cloud collaborative recommendation.

Kairui Fu, Qiaowei Miao, Shengyu Zhang, Kun Kuang, Fei Wu
- Investigate the inconsistent distribution of users in recommender system and the difficulty in causal structure learning accompanied by the intervention of recommender system.
๐ Honors and Awards
- 2026.03 Outstanding Graduate of Zhejiang Province
- 2025.10 National Scholarship (Top 1%)
- 2024.12 Huawei Jingying Scholarship (Top 1%)
- 2023.7 Best Paper Award in CICAI 2023 (Top 1)
- 2023.6 Outstanding Graduates of Zhejiang University
- 2022.10 Scholarship of Zhejiang University
- 2021.10 Scholarship of Zhejiang University
- 2020.10 Scholarship of Zhejiang University
๐ Educations
- 2023.09 - 2026.03, Master, Computer Science and Technology, Zhejiang University, Hangzhou.
- 2019.09 - 2023.06, Undergraduate, Turing Class(Chu Kochen Honors College), Computer Science and Technology, Zhejiang University, Hangzhou
๐ป Internships
- 2025.02 - 2025.05, Huawei Noahโs Ark Lab, China.
- 2025.05 - 2026.03, Taobao & Tmall Group of Alibaba, China.