CVE-2025-46570

vLLM is an inference and serving engine for large language models (LLMs). Prior to version 0.9.0, when a new prompt is processed, if the PageAttention mechanism finds a matching prefix chunk, the prefill process speeds up, which is reflected in the TTFT (Time to First Token). These timing differences caused by matching chunks are significant enough to be recognized and exploited. This issue has been patched in version 0.9.0.
CVSS

No CVSS.

Configurations

Configuration 1 (hide)

cpe:2.3:a:vllm:vllm:*:*:*:*:*:*:*:*

History

24 Jun 2025, 18:25

Type Values Removed Values Added
First Time Vllm vllm
Vllm
CWE CWE-203
CPE cpe:2.3:a:vllm:vllm:*:*:*:*:*:*:*:*
References () https://github.com/vllm-project/vllm/pull/17045 - () https://github.com/vllm-project/vllm/pull/17045 - Issue Tracking, Vendor Advisory
References () https://github.com/vllm-project/vllm/commit/77073c77bc2006eb80ea6d5128f076f5e6c6f54f - () https://github.com/vllm-project/vllm/commit/77073c77bc2006eb80ea6d5128f076f5e6c6f54f - Patch
References () https://github.com/vllm-project/vllm/security/advisories/GHSA-4qjh-9fv9-r85r - () https://github.com/vllm-project/vllm/security/advisories/GHSA-4qjh-9fv9-r85r - Vendor Advisory

29 May 2025, 17:15

Type Values Removed Values Added
New CVE

Information

Published : 2025-05-29 17:15

Updated : 2025-06-24 18:25


NVD link : CVE-2025-46570

Mitre link : CVE-2025-46570


JSON object : View

Products Affected

vllm

  • vllm
CWE
CWE-203

Observable Discrepancy

CWE-208

Observable Timing Discrepancy