SDK and CLI for parsing PDF, DOCX, HTML, and more, to a unified document representation for powering downstream workflows such as gen AI applications.
12 similar packages · ranked by shared classifiers & health score · View docling →
This package contains the AI models used by the Docling PDF conversion package
A library for efficient similarity search and clustering of dense vectors.
spaCy pipelines for pre-trained BERT and other transformers
Docling LangChain integration
Bayesian networks and other Probabilistic Graphical Models.
Industrial-strength Natural Language Processing (NLP) in Python
A complete web automation framework for end-to-end testing.
Google OR-Tools python libraries and modules
The sweetest config system for Python
Simple package to extract text with coordinates from programmatic PDFs
CUDNN FrontEnd python library
Python VISA bindings for GPIB, RS232, TCPIP and USB instruments