7 projects
ftfy
Fixes mojibake and other problems with Unicode, after the fact
langcodes
Tools for labeling human languages with IETF language tags
wordfreq
Look up the frequencies of words in many languages, based on many sources of data.
pack64
A library for representing floating point vectors in a compact, base64-like format
ordered-set
An OrderedSet is a custom MutableSet that remembers its order, so that every
langcodes-py2
Labels and compares human languages in a standardized way -- Python 2 backport
assoc_space
Computes association strength over semantic networks in a dimensionality-reduced form.