pip install jieba==0.42.1

Chinese Words Segmentation Utilities

Source
Among top 1% packages on PyPI.
Over 1.4M downloads in the last 90 days.

Commonly used with jieba

Based on how often these packages appear together in public requirements.txt files on GitHub.

kipp

Python Utils

newspaper

Simplified python article discovery & extraction.

tldextract

Accurately separate the TLD from the registered domain and subdomains of a URL, using the Public Suffix List. By default, this includes the public ICANN TLDs and their exceptions. You can optionally support the Public Suffix List's private domains as well.

feedfinder2

Find the feed URLs for a website.

goose-extractor

Html Content / Article Extractor, web scrapping

newspaper3k

Simplified python article discovery & extraction.

jieba3k

Chinese Words Segementation Utilities

Pysolar

Collection of Python libraries for simulating the irradiation of any point on earth by the sun

readability-lxml

fast html to text parser (article readability tool) with python 3 support

feedparser

Universal feed parser, handles RSS 0.9x, RSS 1.0, RSS 2.0, CDF, Atom 0.3, and Atom 1.0 feeds

defcon

A set of flexible objects for representing UFO data.

nltk

Natural Language Toolkit

ImageHash

Image Hashing library

cleverbot

An unofficial library to access the Cleverbot service

rarfile

RAR archive reader for Python

Pattern

Web mining module for Python.

HTMLParser

Backport of HTMLParser from python 2.7

edt

Multi-Label Anisotropic Euclidean Distance Transform 3D

translate

This is a simple, yet powerful command line translator with google translate behind it. You can also use it as a Python module in your code.

Version usage of jieba

Proportion of downloaded versions in the last 3 months (only versions over 1%).

0.42.1

90.79%

0.39

7.74%