pip install boilerpipe==1.2.0.0

Python interface to Boilerpipe, Boilerplate Removal and Fulltext Extraction from HTML pages

Source
Among fairly niche packages on PyPI.
Over 443 downloads in the last 90 days.

Commonly used with boilerpipe

Based on how often these packages appear together in public requirements.txt files on GitHub.

twitter-text-py

A library for auto-converting URLs, mentions, hashtags, lists, etc. in Twitter text. Also does tweet validation and search term highlighting.

cluster

None

ipythonblocks

Practice Python with colored grids in the IPython Notebook

goose-extractor

Html Content / Article Extractor, web scrapping

notifiers

The easy way to send notifications

rpy2

Python interface to the R language (embedded R)

pygraphviz

Python interface to Graphviz

tldextract

Accurately separate the TLD from the registered domain and subdomains of a URL, using the Public Suffix List. By default, this includes the public ICANN TLDs and their exceptions. You can optionally support the Public Suffix List's private domains as well.

polyaxon

Command Line Interface (CLI) and client to interact with Polyaxon API.

datascience

A Jupyter notebook Python library for introductory data science

scrapy-mongodb

Pipeline to MongoDB for Scrapy. Supports MongoDB replica sets

nltk

Natural Language Toolkit

python-bencode

bencode for humans

newspaper

Simplified python article discovery & extraction.

jieba

Chinese Words Segmentation Utilities

PySimpleSOAP

Python Simple SOAP Library (sable branch)

dota2py

Python tools for Dota 2

reldi

Python library for the ReLDI API

python-rdm

Relational data mining in python

Version usage of boilerpipe

Proportion of downloaded versions in the last 3 months (only versions over 1%).

1.2.0.0

100.00%