pip install wikiextractor==3.0.6

A tool for extracting plain text from Wikipedia dumps

Source
Among top 50% packages on PyPI.
Over 3.6K downloads in the last 90 days.

Commonly used with wikiextractor

Based on how often these packages appear together in public requirements.txt files on GitHub.

wikiextractor

scripts for parsing the wikimedia xml dumps files

nrepl-python-client

A Python client for the nREPL Clojure networked-REPL server.

tensorflow-estimator

TensorFlow Estimator.

dm-reverb

Reverb is an efficient and easy-to-use data storage and transport system designed for machine learning research.

sentencepiece

SentencePiece python wrapper

bert-tensorflow

BERT

gast

Python AST that abstracts the underlying Python version

google-pasta

pasta is an AST-based Python refactoring library

tensorboard

TensorBoard lets you watch Tensors Flow

syntok

sentence segmentation and word tokenization toolkit

Keras-Preprocessing

Easy data preprocessing and data augmentation for deep learning models

ml-collections

ML Collections is a library of Python collections designed for ML usecases.

astor

Read/rewrite/write Python ASTs

tensorflow-hub

TensorFlow Hub is a library to foster the publication, discovery, and consumption of reusable parts of machine learning models.

autocolorize

Automatic colorizaton of grayscale images using Deep Learning.

vttLib

Compile Visual TrueType assembly with FontTools.

pytorch-nlp

Text utilities and datasets for PyTorch

Keras-Applications

Reference implementations of popular deep learning models

pydgraph

Official Dgraph client implementation for Python

Version usage of wikiextractor

Proportion of downloaded versions in the last 3 months (only versions over 1%).

3.0.4

56.78%

3.0.6

30.07%

0.1

7.90%

3.0.0

5.25%