Working Notes on Late Interaction Dynamics: Analyzing Targeted Behaviors of Late Interaction Models
Abstract
Late Interaction retrieval models exhibit length bias in multi-vector scoring and efficient similarity exploitation through MaxSim operator, as demonstrated on NanoBEIR benchmark.
While Late Interaction models exhibit strong retrieval performance, many of their underlying dynamics remain understudied, potentially hiding performance bottlenecks. In this work, we focus on two topics in Late Interaction retrieval: a length bias that arises when using multi-vector scoring, and the similarity distribution beyond the best scores pooled by the MaxSim operator. We analyze these behaviors for state-of-the-art models on the NanoBEIR benchmark. Results show that while the theoretical length bias of causal Late Interaction models holds in practice, bi-directional models can also suffer from it in extreme cases. We also note that no significant similarity trend lies beyond the top-1 document token, validating that the MaxSim operator efficiently exploits the token-level similarity scores.
Community
We link the poster as a PNG file 🤗 We were very glad to present it at the 1st Late Interaction Workshop at ECIR 2026!
Get this paper in your agent:
hf papers read 2603.26259 Don't have the latest CLI?
curl -LsSf https://hf.co/cli/install.sh | bash Models citing this paper 0
No model linking this paper
Datasets citing this paper 0
No dataset linking this paper
Spaces citing this paper 0
No Space linking this paper
Collections including this paper 0
No Collection including this paper