pip install goose-extractor==1.0.25

Html Content / Article Extractor, web scrapping

Source
Among top 50% packages on PyPI.
Over 3.6K downloads in the last 90 days.

Commonly used with goose-extractor

Based on how often these packages appear together in public requirements.txt files on GitHub.

kipp

Python Utils

jieba

Chinese Words Segmentation Utilities

overwatch-api

Overwatch API Wrapper using lootbox.eu

wikitextparser

A simple parsing tool for MediaWiki's wikitext markup.

scrapy-mongodb

Pipeline to MongoDB for Scrapy. Supports MongoDB replica sets

translate

This is a simple, yet powerful command line translator with google translate behind it. You can also use it as a Python module in your code.

Shosetsu

Python 3 Aiohttp VNDB Scraper

boilerpipe

Python interface to Boilerpipe, Boilerplate Removal and Fulltext Extraction from HTML pages

cssselect

cssselect parses CSS3 Selectors and translates them to XPath 1.0

sumy

Module for automatic summarization of text documents and HTML pages.

steam

Module for interacting with various Steam features

newspaper

Simplified python article discovery & extraction.

Scaffold

Simple project scaffolding for Python

naiveBayesClassifier

yet another general purpose naive bayesian classifier

tldextract

Accurately separate the TLD from the registered domain and subdomains of a URL, using the Public Suffix List. By default, this includes the public ICANN TLDs and their exceptions. You can optionally support the Public Suffix List's private domains as well.

python-twitch

Library for interaction with the videogame streaming platform twitch

django-socialnetworks

Extends Django with “log in” and “share” functionalities for the most common social networks.

PyFlickrStreamr

PyFlickrStreamr provides a continuous, blocking python interface for streaming Flickr photos in near real-time. It is a wrapper around the Flickr photos.getRecent API.

urlunshort

Tools for detecting and expanding shortened URLs.

Version usage of goose-extractor

Proportion of downloaded versions in the last 3 months (only versions over 1%).

1.0.25

39.34%

1.0.24

3.90%

1.0.23

3.81%

1.0.22

3.45%

1.0.8

3.42%

1.0.20

3.37%

1.0.12

3.37%

1.0.21

3.37%

1.0.11

3.34%

1.0.19

3.31%

1.0.1

3.31%

1.0.15

3.31%

1.0.13

3.28%

1.0.14

3.28%

1.0.9

3.28%

1.0.17

3.26%

1.0.6

3.20%

1.0.7

3.20%

1.0.2

3.20%