This is the new Mac desktop version of my Datasette application, which helps people explore SQLite databases and CSV files and install plugins to visualize them, clean them up and more. [more inside]
This is a link to a blog, which is itself a project that I have worked on, along with other teammates. It highlights work that I and my coworkers at U.S. Geological Survey Water Mission Area have been working on over the past few years. [more inside]
The Juris Lab is a collaborative empirical legal research blog covering a wide range of subjects, including judicial behavior, regulatory activity, computational linguistics, and litigation analytics. New posts most weekdays.
I've been teaching and writing about data analysis and visualization using Python for a few years now, and an absence of teaching work during the events of this year has finally given me the time to put together a book. There are also articles about various aspects of data processing with the scientific Python stack (pandas/numpy/matplotlib/seaborn) and more to come.
I used thousands of New Yorker cartoon contest caption submissions to classify about 100 cartoons into nineteen categories, and trained a neural network to generate new captions for each category. Some of them are kind of funny. [more inside]
What does critical data science add to our understanding of sexual harassment in academia? [more inside]
Moderators of r/worldnews on reddit worked with me to test an idea: what are the effects of encouraging fact-checking on the response and spread of unreliable news? On average, messages encouraging fact-checking caused a 2x reduction in the reddit score of tabloid submissions, which likely influenced reddit's rankings. [more inside]