Previously, I received both my M.S. and B.S. degrees from the College of Computer Science and Technology at Zhejiang University, under the supervision of Prof. Kun Kuang and Shengyu Zhang. Additionally, I was fortunate to conduct research on generative retrieval of recommendation at the Taobao & Tmall Group, Alibaba.

My research focuses on the generalizability and personalization of recommender systems. Recently, I am interested in both recommendations with large language models and the efficient inference of large recommenders when facing much longer user interaction sequences. Meanwhile, I also tackle the distinctive challenges involved in integrating heterogeneous models across diverse computational environments like mobile devices and cloud servers in a seamless and effective manner.

๐Ÿ”ฅ News

  • 2026.05: ย ๐ŸŽ‰๐ŸŽ‰ One paper has been accepted to KDD 2026.
  • 2026.02: ย ๐Ÿ“ฐ๐Ÿ“ฐ One paper has been deployed in Taobao and is available on Arxiv about rank-enhanced generative retrieval with list-wise DPO.
  • 2026.01: ย ๐ŸŽ‰๐ŸŽ‰ Three papers have been accepted to the research/industry/short paper track of TheWebConf 2026.
  • 2026.01: ย ๐Ÿ“ฐ๐Ÿ“ฐ One paper has been available on Arxiv about the efficient inference of large sequential recommendation.
  • 2025.09: ย ๐Ÿ“ฐ๐Ÿ“ฐ One paper has been deployed in Taobao and is available on Arxiv about generative retrieval with semantic identifiers.
  • 2025.07: ย ๐ŸŽ‰๐ŸŽ‰ Two papers have been accepted to MM 2025.
  • 2025.07: ย ๐Ÿ“Š๐Ÿ“Š Forward-OFA has been deployed in the Ascend Community of Huawei using NPU.
  • 2024.12: ย ๐ŸŽ‰๐ŸŽ‰ One paper has been accepted to AAAI 2025.
  • 2024.11: ย ๐ŸŽ‰๐ŸŽ‰ One paper has been accepted to KDD 2025.
  • 2024.08: ย ๐Ÿฅณ๐Ÿฅณ I went to Barcelona, Spain, to attend the KDD conference to deliver an oral presentation of our paper DIET.
  • 2024.05: ย ๐ŸŽ‰๐ŸŽ‰ One paper has been accepted to KDD 2024.
  • 2023.07: ย ๐Ÿฅณ๐Ÿฅณ I went to Fuzhou, China, to attend the CICAI conference to deliver an oral presentation and won the Best Paper Award.
  • 2023.06: ย ๐ŸŽ‰๐ŸŽ‰ One paper has been accepted to CICAI 2023.

๐Ÿ“ Publications

* denote the authors contributed equally.

KDD 2026
sym

FORGE: Forming Semantic Identifiers for Generative Retrieval in Industrial Datasets

Kairui Fu, Tao Zhang, Shuwen Xiao, Ziyang Wang, Xinming Zhang, Chenchi Zhang, Yuliang Yan, Junjun Zheng, Yu Li, Zhihong Chen, Jian Wu, Xiangheng Kong, Shengyu Zhang, Kun Kuang, Yuning Jiang, Bo Zheng

Huggingface Github ็ŸฅไนŽ Wechat

  • The first industrial dataset about generative retrieval with semantic identifiers, which contains 14 billion user interactions and multimodal features of 250 million items sampled from Taobao.
  • Subsequent proposed optimizations of data modality and ID collisions are validated with both offline (15\% improvements on HitRate) and online (0.35% improvements on transaction count) experiments in the โ€œGuess You Likeโ€ Section of Taobao.
Arxiv
sym

RankGR: Rank-Enhanced Generative Retrieval with Listwise Direct Preference Optimization in Recommendation

Kairui Fu*, Changfa Wu*, Kun Yuan, Binbin Cao, Dunxian Huang, Yuliang Yan, Junjun Zheng, Jianning Zhang, Silu Zhou, Jian Wu, Kun Kuang

  • A rank-enhanced generative retrieval framework for recommendation. Equipped with the listwise direct preference optimization in IAP and the extra scoring module in RSP, RankGR breaks the limitations of existing methods, particularly their inability to capture hierarchy preference and rich interactions between SIDs and user sequences
  • Deployed on Taobaoโ€™s โ€œGuess You Likeโ€ section, RankGR achieved a 1.08% increase in online item page views and captured 49.88% of total exposures.
TheWebConf 2026
sym

PI2I: A Personalized Item-Based Collaborative Filtering Retrieval Framework

Shaoqing Wang*, Yingcai Ma*, Kairui Fu*, Ziyang Wang, Dunxian Huang, YuliangYan, Jian Wu

Huggingface

  • A novel two-stage retrieval framework that enhances the personalization capabilities of traditional collaborative filtering.
  • Deployed on Taobaoโ€™s โ€œGuess You Likeโ€ section, PI2I achieved a 1.05% increase in online transaction rates.
TheWebConf 2026
sym

ThinkRec: Thinking-based recommendation via LLM

Qihang Yu*, Kairui Fu*, Shengyu Zhang, Zheqi Lv, Fan Wu, Fei Wu

Project

  • Almost the first emphasizes the importance of activating the thinking of LLMs to make recommendations more interpretable and effective.
TheWebConf 2026
sym

RASTP: Representation-Aware Semantic Token Pruning for Generative Recommendation with Semantic Identifiers

Tianyu Zhan*, Kairui Fu*, Zheqi Lv, Shengyu Zhang

Project

  • An effective strategy to selectively prunes less informative tokens in the input sequence for semantic identifiers based recommendation.
Arxiv
sym

MALLOC: Benchmarking the Memory-aware Long Sequence Compression for Large Sequential Recommendation

Qihang Yu*, Kairui Fu*, Zhaocheng Du*, Yuxuan Si, Kaiyuan Li, Weihao Zhao, Zhicheng Zhang, Jieming Zhu, Quanyu Dai, Zhenhua Dong, Shengyu Zhang, Kun Kuang, Fei Wu

Project

  • A benchmark that establishes a rigorous multi-dimensional evaluation protocol that couples standard ranking metrics with system-level constraints for long-sequence compression in large recommender systems.
MM 2025
sym

CHORD: Customizing Hybrid-precision On-device Model for Sequential Recommendation with Device-cloud Collaboration

Tianqi Liu*, Kairui Fu*, Shengyu Zhang, Wenyan Fan, Zhaocheng Du, Jieming Zhu, Fan Wu, Fei Wu

  • A framework for device-cloud collaborative personalized mixed-precision quantization that generates lightweight networks for heterogeneous mobile devices.
MM 2025
sym

Tackling Device Data Distribution Real-time Shift via Prototype-based Parameter Editing

Zheqi Lv, Wenqiao Zhang, Kairui Fu, Qi Tian, Shengyu Zhang, Jiajie Su, Jingyuan Chen, Kun Kuang, Fei Wu

  • The composition of coarse and fine-grained intersts for tackling the on-device continuous data distribution shift in both vision and recommendation tasks.
KDD 2025
sym

Forward Once for All: Structural Parameterized Adaptation for Efficient Cloud-coordinated On-device Recommendation

Kairui Fu, Zheqi Lv, Shengyu Zhang, Fan Wu, Kun Kuang

Project

  • An early attempt to investigate the joint customization of both structure and parameters, analyzing the challenges of interest heterogeneity, network transmission, and on-device inference simultaneously.
AAAI 2025
sym

MergeNet: Knowledge Migration across Heterogeneous Models, Tasks, and Modalities

Kunxi Li*, Tianyu Zhan*, Kairui Fu*, Shengyu Zhang, Kun Kuang, Jiwei Li, Zhou Zhao, Fan Wu, Fei Wu

Project ็ŸฅไนŽ

  • Leverage parameters as the medium to achieve knowledge transfer between heterogeneous models, tasks, and modalities.
KDD 2024
sym

DIET: Customized Slimming for Incompatible Networks in Sequential Recommendation

Kairui Fu, Shengyu Zhang, Zheqi Lv, Jingyuan Chen, Jiwei Li

  • Tackle both the parameter personalization and the communication efficiency under strict device constraints in device-cloud collaborative recommendation.
CICAI 2023 Best Paper
sym

End-to-End Optimization of Quantization-Based Structure Learning and Interventional Next-Item Recommendation

Kairui Fu, Qiaowei Miao, Shengyu Zhang, Kun Kuang, Fei Wu

  • Investigate the inconsistent distribution of users in recommender system and the difficulty in causal structure learning accompanied by the intervention of recommender system.

๐ŸŽ– Honors and Awards

  • 2026.03 Outstanding Graduate of Zhejiang Province
  • 2025.10 National Scholarship (Top 1%)
  • 2024.12 Huawei Jingying Scholarship (Top 1%)
  • 2023.7 Best Paper Award in CICAI 2023 (Top 1)
  • 2023.6 Outstanding Graduates of Zhejiang University
  • 2022.10 Scholarship of Zhejiang University
  • 2021.10 Scholarship of Zhejiang University
  • 2020.10 Scholarship of Zhejiang University

๐Ÿ“– Educations

  • 2023.09 - 2026.03, Master, Computer Science and Technology, Zhejiang University, Hangzhou.
  • 2019.09 - 2023.06, Undergraduate, Turing Class(Chu Kochen Honors College), Computer Science and Technology, Zhejiang University, Hangzhou

๐Ÿ’ป Internships