pip install extruct==0.13.0

Extract embedded metadata from HTML markup

Source
Among top 2% packages on PyPI.
Over 186.7K downloads in the last 90 days.

Commonly used with extruct

Based on how often these packages appear together in public requirements.txt files on GitHub.

scraper

Configurable Python Web Scraper

queuelib

Collection of persistent (disk-based) queues

w3lib

Library of web-related functions

parsel

Parsel is a library to extract data from HTML and XML using XPath and CSS selectors

Scrapy

A high-level Web Crawling and Web Scraping framework

scrapyd

A service for running Scrapy spiders, with an HTTP API

PyDispatcher

Multi-producer-multi-consumer signal dispatching mechanism

characteristic

Python attributes without boilerplate.

scrapy-djangoitem

Scrapy extension to write scraped items using Django models

constantly

Symbolic constants in Python

incremental

None

Automat

Self-service finite-state machines for the programmer on the go.

Twisted

An asynchronous networking framework written in Python

filepath

Object-oriented filesystem path representation.

scrapyd-client

A client for scrapyd

hyperlink

A featureful, immutable, and correct URL for Python.

scrapy-mongodb

Pipeline to MongoDB for Scrapy. Supports MongoDB replica sets

axiom

An in-process object-relational database

epsilon

A set of utility modules used by Divmod projects

Version usage of extruct

Proportion of downloaded versions in the last 3 months (only versions over 1%).

0.13.0

43.58%

0.9.0

30.18%

0.12.0

17.86%

0.10.0

3.29%

0.11.0

1.18%

0.8.0

1.02%