Understanding Embedding Scaling in Collaborative Filtering, Domain Knowledge Acquisition for LLMs via RL, and More!
Vol.123 for Sep 22 - Sep 28, 2025
Stay Ahead of the Curve with the Latest Advancements and Discoveries in Information Retrieval.
This week’s newsletter highlights the following research:
Scalable Multimodal Retrieval Through Flexible Late Interaction and Test-Time Budget Control, from Meta
How Interaction Noise Shapes Embedding Scalability in Collaborative Filtering Models, from He et al.
Embedding Domain Expertise in LLMs through Reinforcement Learning from Augmented Generation, from Nie et al.
EmbeddingGemma: Powerful and Lightweight Text Representations, from Google DeepMind
Bringing Context Engineering and Reasoning to Industrial Cascade Ranking Systems, from Shopee
The Role of Vocabularies in Learning Sparse Representations for Ranking, from Naver
Improving Dual Encoders for Multi-Level Document Retrieval, from Google DeepMind
A Production-Ready Generative Framework for Unified Search and Recommendation, from Alibaba
A Unified PyTorch Framework for Large-Scale Sparse-Dense Recommendation Training, from Alibaba
Treating Reranking as Noise Reduction in Multi-Stage Recommender Systems, from Kuaishou
Keep reading with a 7-day free trial
Subscribe to Top Information Retrieval Papers of the Week to keep reading this post and get 7 days of free access to the full post archives.

