Lost in Decoding? Reproducing and Stress-Testing the Look-Ahead Prior in Generative Retrieval

Kidist Amde Mekonnen, Yongkang Li, Yubao Tang, Simon Lupart, Maarten De Rijke

April, 2026

Abstract

Generative retrieval (GR) ranks documents by autoregressively generating identifiers. Trie-constrained beam search, which is used by many GR methods, is susceptible to early pruning of relevant prefixes. Planning ahead in generative retrieval (PAG) mitigates such prefix pruning by using simultaneous decoding to compute a document-level look-ahead prior that guides subsequent sequential decoding. We reproduce PAG at inference time and stress-test its decoding behavior. Using the authors’ released checkpoint and identifier/trie artifacts in the reported decoding setup, we reproduce the main effectiveness results on MS~MARCO Dev and TREC-DL 2019/2020 and corroborate the reported beam-size and latency trade-offs in our hardware setup. Beyond reproduction, we introduce plan drift diagnostics that quantify how intent-preserving query variations, including misspellings, reordering, synonym substitutions, paraphrases, naturality shifts, and translation-based variants, change the planner’s top-n candidate set and highest-weight tokens. We find that the planning signal is brittle, intent-preserving typos cause ``plan collapse,’’ where the look-ahead bonus effectively vanishes, reverting the model to a weaker unguided search. We further evaluate cross-lingual robustness by querying a fixed English index with non-English \textsc{mMARCO} inputs, and assess inference-time mitigations and query-side adaptation that require no re-indexing. We reproduce PAG’s reported effectiveness and confirm the benefit of planning-guided decoding, while showing that the planner’s sparse token-level scoring mechanism is sensitive to query surface-form variation, a robustness aspect not systematically evaluated in the original work.

Type

Conference paper

Publication

The 49th International ACM SIGIR Conference on Research and Development in Information Retrieval

Source Themes

Yongkang Li

PhD Student

I am currently a PhD student in IR LAB, the University of Amsterdam, working with Prof. Evangelos Kanoulas. Before that, I got my master degree at Southern University of Science and Technology, Department of Computer Science and Engineering, SUSTech-UTokyo Joint Research Center on Super Smart City Lab, where I am supervised by Prof. Xuan Song in SUSTech and Prof. Zipei Fan at the University of Tokyo. What’s more, I received a B.E. degree in the School of Information and Communication Engineering, Beijing University of Posts and Telecommunications in 2020.