AI Tax: Performance Analysis of AI Inference Serving 


Vol. 53,  No. 1, pp. 8-14, Jan.  2026
10.5626/JOK.2026.53.1.8


PDF

  Abstract

With the rapid advancements in artificial intelligence (AI), smart applications powered by compute- and memory-intensive AI models now make up a significant portion of modern datacenter workloads. To meet the growing demands of AI workloads, specialized accelerators are increasingly deployed in datacenters to enhance AI inference efficiency. However, most previous studies on AI inference acceleration have focused primarily on the performance of neural network computations in isolation. In addition to these computations, an AI inference server typically handles other essential infrastructure tasks, such as web serving to send and receive inference requests and responses, as well as application-specific pre- and post-processing. In this paper, we refer to these additional operations as the AI Tax. We analyze the AI Tax in a representative modern AI inference server that runs various image classification models using Nvidia's industry-standard AI serving software stack. Our findings reveal that the AI Tax can lead to up to 55% degradation in end-to-end server performance compared to standalone neural network compute and consumes an average of 25 CPU cores.


  Statistics
Cumulative Counts from November, 2022
Multiple requests among the same browser session are counted as one view. If you mouse over a chart, the values of data points will be shown.


  Cite this article

[IEEE Style]

H. Jeong and J. Kim, "AI Tax: Performance Analysis of AI Inference Serving," Journal of KIISE, JOK, vol. 53, no. 1, pp. 8-14, 2026. DOI: 10.5626/JOK.2026.53.1.8.


[ACM Style]

Heetaek Jeong and Jangwoo Kim. 2026. AI Tax: Performance Analysis of AI Inference Serving. Journal of KIISE, JOK, 53, 1, (2026), 8-14. DOI: 10.5626/JOK.2026.53.1.8.


[KCI Style]

정희택, 김장우, "AI Tax: AI 추론 서버에서 성능 분석," 한국정보과학회 논문지, 제53권, 제1호, 8~14쪽, 2026. DOI: 10.5626/JOK.2026.53.1.8.


[Endnote/Zotero/Mendeley (RIS)]  Download


[BibTeX]  Download



Search




Journal of KIISE

  • ISSN : 2383-630X(Print)
  • ISSN : 2383-6296(Electronic)
  • KCI Accredited Journal

Editorial Office

  • Tel. +82-2-588-9240
  • Fax. +82-2-521-1352
  • E-mail. chwoo@kiise.or.kr