TalkPython.fm #90 - Data Wrangling with Python
- Katharine on the web: http://kjamistan.com
- Katharine on twitter: @kjam
- Book: Data Wrangling with Python: Tips and Tools to Make Your Life Easier: http://amzn.to/2fGc0Cx
- Pycon 2016: How to Automate your Data Cleanup with Python: http://youtube.com/watch?v=gp-ngPV_ZX8
- Dedupe Python Library: https://github.com/datamade/dedupe
- probablepeople: https://github.com/datamade/probablepeople
- usaddress: https://github.com/datamade/usaddress
- jellyfish: https://github.com/jamesturk/jellyfish
- Fuzzywuzzy: https://github.com/seatgeek/fuzzywuzzy
- scrubadub: https://github.com/datascopeanalytics/scrubadub
- pint: http://pint.readthedocs.io
- arrow: https://github.com/crsmithdev/arrow
- pdftables.six: https://github.com/vnaydionov/pdftables
- Datacleaner: https://github.com/rhiever/datacleaner
- Parserator: https://github.com/datamade/parserator
- Gensim: http://radimrehurek.com/gensim
- Faker: https://github.com/joke2k/faker
- Dask: http://dask.pydata.org
- SpaCy: http://spacy.io
- Airflow: http://airflow.incubator.apache.org
- Luigi: http://luigi.readthedocs.io
- Hypothesis (testing): http://hypothesis.works