PyPI StatsPyPI Stats
DiscoverCompareTrendingDashboard
PyPI Stats — Download analytics for Python packages
APIGitHubPrivacyTerms

Sign in to subscribe to our weekly trending newsletter.

Download data sourced from BigQuery (Google). Counts may include CI/CD and mirror traffic.
Inspired by and built upon the work of pypistats.org
Discover/Text Processing & NLP
📝

Best Python Text Processing & NLP Libraries

Libraries for parsing text, natural language processing, and linguistic analysis.

24 packages · ranked by health score & downloads

Trending this week

↑ lxml↑ regex
#1charset-normalizerv3.4.7
79

The Real First Universal Charset Detector. Open, modern and actively maintained alternative to Chardet.

1.1B/moMIT★ 776
#2Jinja2v3.1.6
78

A very fast and expressive template engine.

494.6M/moBSD License★ 11.7K
#3Jinja2v3.1.6
78

A very fast and expressive template engine.

494.6M/moBSD License★ 11.7K
#4MarkupSafev3.0.3
74

Safely add untrusted strings to HTML/XML markup.

544.7M/moBSD-3-Clause★ 690
#5MarkupSafev3.0.3
74

Safely add untrusted strings to HTML/XML markup.

544.7M/moBSD-3-Clause★ 691
#6lxmlv6.1.1↑
86

Powerful and Pythonic XML processing library combining libxml2/libxslt with the ElementTree API.

275.5M/moBSD-3-Clause★ 3.0K
#7pyparsingv3.3.2
82

pyparsing - Classes and methods to define and execute parsing grammars

301.4M/moMIT★ 2.5K
#8beautifulsoup4v4.15.0
81
← All categoriesSearch text processing & nlp packages →

Screen-scraping library

304.9M/moMIT License
#9regexv2026.6.28↑
77

Alternative regular expression module, to replace re.

291.7M/moApache-2.0 AND CNRI-Python★ 590
#10chardetv7.4.3
84

Universal character encoding detector

148.3M/mo0BSD★ 2.6K
#11docutilsv0.23
81

Docutils -- Python Documentation Utilities

165.2M/moBSD License

19 more packages

Sign in for free to see the full list

Sign in free