Skip to main content

One post tagged with "dense-retrieval"

View All Tags

Recent Progress on Information Retrieval

Zeyu Yang
PhD student at Rice University

Reading across a selection of papers from ICLR 2026 (plus one arXiv preprint), a single argument emerges: 2026 information retrieval work is converging on a shared strategy of pushing cost off the query and online path. Whether the lever is data (Revela trains with far less of it), query-time compute (LightRetriever shrinks the query encoder), index footprint (MILCO and CSRv2 attack representation cost from opposite ends), or model size (the Transformer-SSM hybrid prunes attention to a retrieval-critical core), the recurring move is to relocate expense to where it is cheapest to absorb (offline indexing, fewer labels, or a smaller serving model) rather than to make retrieval itself fundamentally smarter.

The papers below navigate this shared goal from different angles. The table groups them by theme and names the tradeoff each one accepts in exchange for its efficiency gains.

ThemePapersWhere cost is movedTradeoff accepted
Cheaper supervisionRevelaFrom labeled pairs to an unsupervised language-modeling signalCaps out at unsupervised quality; no annotated relevance signal
Cheaper query-time computeLightRetrieverFrom the online query encoder to offline document encodingRoughly 5% quality gap; out-of-domain behavior less characterized
Cheaper sparse representationsCSRv2, MILCOCSRv2 to ultra-sparse dimensions; MILCO to a shared English lexical spaceCSRv2 degrades at the lowest sparsity; MILCO inherits English-pivot bias
Cheaper indexing unitsSPLARE, DISCoSPLARE to SAE latents (also yielding language-agnostic representations); DISCo to submodular subset coverageSPLARE depends on pretrained SAE quality; DISCo adds formulation complexity
Cheaper serving modelTransformer-SSM hybridFrom full attention to a retrieval-critical head subset plus SSMsValidated only at small scale and short context