I used thousands of New Yorker cartoon contest caption submissions to classify about 100 cartoons into nineteen categories, and trained a neural network to generate new captions for each category. Some of them are kind of funny. [more inside]
What does critical data science add to our understanding of sexual harassment in academia? [more inside]
Just finished building a content recommendation engine for MeFi using natural language processing and non-negative matrix factorization techniques! It produces a list of post recommendations based on a user history of posts, comments and favorites. It can also make recommendations based on a piece of text, so for example, you could paste a particular post and it will return a list of other posts that have some similar characteristics. I hope you enjoy playing around with it! Please let me know what you think. Here's more info in case you're interested (: https://github.com/tomasbielskis/metafilterpostrecommender
Monolog is an interactive diary bot that prompts you with interesting questions, which it chooses based on the topics you write about. [more inside]
A twitter bot that uses machine learning to define invented words, posting truncated definitions on Twitter and complete ones on Tumblr. Tweet @lexiconjure a made-up word, and it'll define it for you. [more inside]
Neuralsnap generates an image caption using a model I trained (convolutional and recurrent neural networks), then uses another character-level recurrent neural net that I trained on ~40 MB of poetry to expand the caption into a poem. (In this example, generated from a Rothko painting, the red text is the direct image caption, and the rest is the poetic expansion.) [more inside]
Using Python 2.7 and the Natural Language Toolkit, I created a program called Sonnetizer that generates 14-line rhyming sonnets in (mostly) iambic pentameter from any text corpus. Using Sonnetizer, I generated 10,000 unique sonnets from the sonnets of William Shakespeare, and compiled them into a PDF.
Step inside the "brain" of Jesus, the Pope, Kurt Cobain, or Tupac to come up with your own sentences and stories. Words are suggested based on the language patterns of the brain (and you are limited to their vocabulary). There's a learning curve, but once you get past it, hilarious times can be had. Read my introduction post or check out the tutorial to get started. [more inside]