A library for efficient similarity search and clustering of dense vectors.
12 similar packages · ranked by shared classifiers & health score · View faiss-cpu →
Bayesian networks and other Probabilistic Graphical Models.
Google OR-Tools python libraries and modules
SDK and CLI for parsing PDF, DOCX, HTML, and more, to a unified document representation for powering downstream workflows such as gen AI applications.
spaCy pipelines for pre-trained BERT and other transformers
Simple package to extract text with coordinates from programmatic PDFs
NVIDIA cuDNN Frontend — Python and C++ Graph API with SOTA attention (SDPA / Flash Attention), MoE grouped GEMM fusions, and FP8/MXFP8 kernels for Hopper and Bl
This package contains the AI models used by the Docling PDF conversion package
A Python wrapper for libjpeg, with a focus on use as a plugin for for pylibjpeg
Docling LangChain integration
Industrial-strength Natural Language Processing (NLP) in Python
SeleniumBase is a framework for web crawling, scraping, and testing. Supports pytest. CDP Mode adds stealth. Includes many tools.
Read and write HDF5 files from Python