SDK and CLI for parsing PDF, DOCX, HTML, and more, to a unified document representation for powering downstream workflows such as gen AI applications.
[](/packages/docling)
<a href="/packages/docling"><img src="/api/badges/docling?period=month" alt="PyPI Stats"></a>