pip install textract==1.6.4

extract text from any document. no muss. no fuss.

Source
Among top 2% packages on PyPI.
Over 211.2K downloads in the last 90 days.

Commonly used with textract

Based on how often these packages appear together in public requirements.txt files on GitHub.

python-pptx

Generate and manipulate Open XML PowerPoint (.pptx) files

EbookLib

Ebook library which can handle EPUB2/EPUB3 and Kindle format

docx2txt

A pure python-based utility to extract text and images from docx files.

defcon

A set of flexible objects for representing UFO data.

python-docx

Create and update Microsoft Word .docx files.

strawpoll.py

A python wrapper for the Strawpoll API.

win10toast

An easy-to-use Python library for displaying Windows 10 Toast Notifications

strawpy

Strawpy is a python wrapper for the strawpoll API.

python-wowapi

Python-wowapi is a client library for the World of Warcraft, Data and Profile API's.

pointfree

Pointfree style toolkit for Python

functionally

Simple & extensive functional programming library

font-v

Font version reporting and modification tool

opentype-sanitizer

Python wrapper for the OpenType Sanitizer

ufolint

UFO source file linter

ttfautohint-py

Python wrapper for ttfautohint, a free auto-hinter for TrueType fonts

FBRank

A commandline tool helps you visualize league rank and other imformation

pyzmail36

Python easy mail library, to parse, compose and send emails

user_agent

User-Agent generator

docx-mailmerge

Performs a Mail Merge on docx (Microsoft Office Word) files

Version usage of textract

Proportion of downloaded versions in the last 3 months (only versions over 1%).

1.6.3

54.27%

1.6.4

37.57%

1.5.0

3.95%

1.6.1

2.13%

1.6.0

1.55%