A LLM serving engine extension to reduce TTFT and increase throughput, especially under long-context scenarios.
No known vulnerabilities in the latest version
Based on latest version 0.4.2. If you're running an older version, check OSV for your specific version.
[](/packages/lmcache)
<a href="/packages/lmcache"><img src="/api/badges/lmcache?period=month" alt="PyPI Stats"></a>