Improving Retrieval Models through Reinforcement Learning with Feedback 


Vol. 51,  No. 10, pp. 900-907, Oct.  2024
10.5626/JOK.2024.51.10.900


PDF

  Abstract

Open-domain question answering involves the process of retrieving clues through search to solve problems. In such tasks, it is crucial that the search model provides appropriate clues, as this directly impacts the final performance. Moreover, information retrieval is an important function frequently used in everyday life. This paper recognizes the significance of these challenges and aims to improve performances of search models. Just as the recent trend involves adjusting outputs in decoder models using Reinforcement Learning from Human Feedback (RLHF), this study seeks to enhance search models through the use of reinforcement learning. Specifically, we defined two rewards: the loss of the answer model and the similarity between the retrieved documents and the correct document. Based on these, we applied reinforcement learning to adjust the probability score of the top-ranked document in the search model's document probability distribution. Through this approach, we confirmed the generality of the reinforcement learning method and its potential for further performance improvements.


  Statistics
Cumulative Counts from November, 2022
Multiple requests among the same browser session are counted as one view. If you mouse over a chart, the values of data points will be shown.


  Cite this article

[IEEE Style]

M. Seo, J. Lim, T. Kim, H. Ryu, D. Chang, S. Na, "Improving Retrieval Models through Reinforcement Learning with Feedback," Journal of KIISE, JOK, vol. 51, no. 10, pp. 900-907, 2024. DOI: 10.5626/JOK.2024.51.10.900.


[ACM Style]

Min-Taek Seo, Joon-Ho Lim, Tae-Hyeong Kim, Hwi-Jung Ryu, Du-Seong Chang, and Seung-Hoon Na. 2024. Improving Retrieval Models through Reinforcement Learning with Feedback. Journal of KIISE, JOK, 51, 10, (2024), 900-907. DOI: 10.5626/JOK.2024.51.10.900.


[KCI Style]

서민택, 임준호, 김태형, 류휘정, 장두성, 나승훈, "피드백 강화학습을 통한 검색 모델 개선," 한국정보과학회 논문지, 제51권, 제10호, 900~907쪽, 2024. DOI: 10.5626/JOK.2024.51.10.900.


[Endnote/Zotero/Mendeley (RIS)]  Download


[BibTeX]  Download



Search




Journal of KIISE

  • ISSN : 2383-630X(Print)
  • ISSN : 2383-6296(Electronic)
  • KCI Accredited Journal

Editorial Office

  • Tel. +82-2-588-9240
  • Fax. +82-2-521-1352
  • E-mail. chwoo@kiise.or.kr