Assembling
fuzzy_join(), Joining
tables on non-normalized categories with approximate matching. Example
Joiner,
AggJoiner, transformers for joining multiple tables together. Example
Encoding
TableVectorizer:
turn a pandas dataframe into a numerical array for
machine learning. Example
GapEncoder,
OneHotEncoder but robust to typos or non-normalized categories. Example
Cleaning
deduplicate(): merge
categories of similar morphology (spelling). Example