Fine-tuning DistilBERT on senator tweets
A guide to fine-tuning DistilBERT on the tweets of American Senators with snscrape, SQLite, and Transformers (PyTorch) on Google Colab.
The ultimate reference for clean Pandas code
A curated collection of clean Pandas methods that I use to preprocess, investigate, aggregate, and analyze text data.
Making the jump from data analyst to data scientist in 2023
The skills and resources you need to transition from a data analyst to data scientist position.
Helping Crowdfight facilitate scientific collaborations
I led a team of five data scientists in collaboration with a team of data engineers in a six-month sprint to set up an automated data pipeline for Crowdfight, a non-profit dedicated to facilitating scientific collaborations, with Correlaid Netherlands.