Efficient Large Language Model Based Passage Re-Ranking Using Single Token Representations 


Vol. 52,  No. 5, pp. 395-403, May  2025
10.5626/JOK.2025.52.5.395


PDF

  Abstract

In information retrieval systems, document re-ranking involves reordering a set of candidate documents based on evaluation of their relevance to a given query. Leveraging extensive natural language understanding capabilities of large language models(LLMs), numerous studies on document re-ranking have been conducted, demonstrating groundbreaking performance. However, studies utilizing large language models focus solely on improving reranking performance, resulting in degraded efficiency due to excessively long input sequences and the need for repetitive inference. To address these limitations, we propose ListT5++, a novel model that represents the relevance between a query and a passage using single token embedding and significantly improves the efficiency of LLM-based reranking through a single-step decoding strategy that minimizes the decoding process. Experimental results showed that ListT5++ could maintain accuracy levels comparable to existing methods while reducing inference latency by a factor of 29.4 relative to the baseline. Moreover, our approach demonstrates robust characteristics by being insensitive to th initial ordering of candidate documents, thereby ensuring high practicality in real-time retrieval environments.


  Statistics
Cumulative Counts from November, 2022
Multiple requests among the same browser session are counted as one view. If you mouse over a chart, the values of data points will be shown.


  Cite this article

[IEEE Style]

J. Na, J. Kwon, E. Choi, J. Lee, "Efficient Large Language Model Based Passage Re-Ranking Using Single Token Representations," Journal of KIISE, JOK, vol. 52, no. 5, pp. 395-403, 2025. DOI: 10.5626/JOK.2025.52.5.395.


[ACM Style]

Jeongwoo Na, Jun Kwon, Eunseong Choi, and Jongwuk Lee. 2025. Efficient Large Language Model Based Passage Re-Ranking Using Single Token Representations. Journal of KIISE, JOK, 52, 5, (2025), 395-403. DOI: 10.5626/JOK.2025.52.5.395.


[KCI Style]

나정우, 권준, 최은성, 이종욱, "단일 토큰 표현을 활용한 효율적인 거대 언어 모델 기반 문서 재순위화," 한국정보과학회 논문지, 제52권, 제5호, 395~403쪽, 2025. DOI: 10.5626/JOK.2025.52.5.395.


[Endnote/Zotero/Mendeley (RIS)]  Download


[BibTeX]  Download



Search




Journal of KIISE

  • ISSN : 2383-630X(Print)
  • ISSN : 2383-6296(Electronic)
  • KCI Accredited Journal

Editorial Office

  • Tel. +82-2-588-9240
  • Fax. +82-2-521-1352
  • E-mail. chwoo@kiise.or.kr