pip install dedupe-hcluster==0.3.8

Hierarchical Clustering Algorithms (Information Theory)

Source
Among top 1% packages on PyPI.
Over 381.0K downloads in the last 90 days.

Commonly used with dedupe-hcluster

Based on how often these packages appear together in public requirements.txt files on GitHub.

canonicalize

canonicalize a cluster of records

PyLBFGS

LBFGS and OWL-QN optimization algorithms

dedupe-hcluster

Hierarchical Clustering Algorithms (Information Theory)

affinegap

A Cython implementation of the affine gap string distance

categorical-distance

Compare two categorical variables

rlr

Case weighted L2 regularized logistic regression

pyhacrf

Hidden alignment conditional random field, a discriminative string edit distance

highered

Learnable Edit Distance Using PyHacrf

parseratorvariable

Structured variable type for dedupe

dedupe

A python library for accurate and scaleable data deduplication and entity-resolution

dedupe-variable-address

Address variable type for dedupe

dedupe-variable-name

Name variable type for dedupe

pipe

Module enablig a sh like infix syntax (using pipes)

semantic3

Common Natural Language Processing Tasks for Python

bpforms

Unambiguous representation of modified DNA, RNA, and proteins

nagioscheck

A Python framework for Nagios plug-in developers

nomenclate

A tool for generating strings based on a preset naming convention.

molnctrl

A simple python Apache Cloudstack API

pypairix

Pypairix is a Python module for fast querying on a pairix-indexed bgzipped text file that contains a pair of genomic coordinates per line. For more information, see: https://github.com/4dn-dcic/pairix/blob/master/README.md.

Version usage of dedupe-hcluster

Proportion of downloaded versions in the last 3 months (only versions over 1%).

0.3.8

96.61%