Skip to main navigation Skip to search Skip to main content

CoMix: Collaborative filtering with mixup for implicit datasets

  • Jaewan Moon
  • , Yoonki Jeong
  • , Dong Kyu Chae
  • , Jaeho Choi
  • , Hyunjung Shim
  • , Jongwuk Lee
  • Sungkyunkwan University
  • NAVER Corporation
  • Hanyang University
  • Korea Advanced Institute of Science and Technology

Research output: Contribution to journalArticlepeer-review

Abstract

Collaborative filtering (CF) is the prevalent solution to mitigate massive information overload in modern recommender systems. However, it usually suffers from data sparsity and popularity bias problems. Existing studies exploit auxiliary information or data augmentation, requiring additional data collection or expensive computational overheads. Inspired by Mixup used in the classification problem, we propose a simple-yet-effective data augmentation method for CF, namely Collaborative Mixup (CoMix), for implicit feedback datasets. The underlying idea of CoMix is to generate virtual users/items by logically combining random users/items. Unlike the original Mixup, we synthesize virtual users/items to complement weak collaborative signals by distinguishing the intersection and non-overlapping parts between two users/items. Despite its simplicity, extensive experimental results show that various CF models equipped with CoMix consistently improve the base models on four benchmark datasets.

Original languageEnglish
Pages (from-to)254-268
Number of pages15
JournalInformation Sciences
Volume628
DOIs
StatePublished - May 2023

Fingerprint

Dive into the research topics of 'CoMix: Collaborative filtering with mixup for implicit datasets'. Together they form a unique fingerprint.

Cite this