I've been teaching and writing about data analysis and visualization using Python for a few years now, and an absence of teaching work during the events of this year has finally given me the time to put together a book. There are also articles about various aspects of data processing with the scientific Python stack (pandas/numpy/matplotlib/seaborn) and more to come.
I used thousands of New Yorker cartoon contest caption submissions to classify about 100 cartoons into nineteen categories, and trained a neural network to generate new captions for each category. Some of them are kind of funny. [more inside]
What does critical data science add to our understanding of sexual harassment in academia? [more inside]
Moderators of r/worldnews on reddit worked with me to test an idea: what are the effects of encouraging fact-checking on the response and spread of unreliable news? On average, messages encouraging fact-checking caused a 2x reduction in the reddit score of tabloid submissions, which likely influenced reddit's rankings. [more inside]