pip install dedupe==2.0.8
A python library for accurate and scaleable data deduplication and entity-resolution
SourceAmong top 1% packages on PyPI.
Over 633.4K downloads in the last 90 days.
dedupe
Based on how often these packages appear together in public
requirements.txt
files on GitHub.
Case weighted L2 regularized logistic regression |
|
Compare two categorical variables |
|
A Cython implementation of the affine gap string distance |
|
LBFGS and OWL-QN optimization algorithms |
|
Hierarchical Clustering Algorithms (Information Theory) |
|
Simple cosine distance |
|
canonicalize a cluster of records |
|
Address variable type for dedupe |
|
Name variable type for dedupe |
|
Hidden alignment conditional random field, a discriminative string edit distance |
|
Learnable Edit Distance Using PyHacrf |
|
Structured variable type for dedupe |
|
Command line tools for deduplicating and merging csv files |
|
A tiny utility to get application version from pkg_resouces |
|
A hierarchical clustering package for Scipy. |
|
GeoServer REST Configuration |
|
grabber: periodically grabs a picture of your screen |
|
Vulk: Advanced 3D engine |
|
Advanced Recording Format for acoustic, behavioral, and physiological data |
dedupe
Proportion of downloaded versions in the last 3 months (only versions over 1%).
2.0.8 |
50.28% |
1.10.0 |
33.03% |