pip install scraper==0.1.0

Configurable Python Web Scraper

Source
Among top 50% packages on PyPI.
Over 2.9K downloads in the last 90 days.

Commonly used with scraper

Based on how often these packages appear together in public requirements.txt files on GitHub.

queuelib

Collection of persistent (disk-based) queues

w3lib

Library of web-related functions

Scrapy

A high-level Web Crawling and Web Scraping framework

PyDispatcher

Multi-producer-multi-consumer signal dispatching mechanism

parsel

Parsel is a library to extract data from HTML and XML using XPath and CSS selectors

Twisted

An asynchronous networking framework written in Python

Automat

Self-service finite-state machines for the programmer on the go.

constantly

Symbolic constants in Python

incremental

None

scrapyd

A service for running Scrapy spiders, with an HTTP API

hyperlink

A featureful, immutable, and correct URL for Python.

characteristic

Python attributes without boilerplate.

scrapy-djangoitem

Scrapy extension to write scraped items using Django models

filepath

Object-oriented filesystem path representation.

scrapyd-client

A client for scrapyd

Protego

Pure-Python robots.txt parser with support for modern conventions

extruct

Extract embedded metadata from HTML markup

scrapy-mongodb

Pipeline to MongoDB for Scrapy. Supports MongoDB replica sets

html2markdown

Conservatively convert html to markdown

Version usage of scraper

Proportion of downloaded versions in the last 3 months (only versions over 1%).

0.1.0

100.00%