ZackBot

Date: 2018ish

I swear, I started this project two weeks before Silicon Valley made this exact joke:

I was actually inspired by this post. The author used the GPT-2 walked through the process of using the OpenAI GPT-2 model to generate poetry in the style of various authors and poets. I highly suggest reading it if you’re interested in doing something similar.

The thing that surprised me was how intuitive it seemed to be to get the model to generate text for a particular context, e.g. in the style of a particular author. Essentially, all you have to do is precede segments of text in the training data with a brief tag indicating the context. Then, when you go to generate new text, you prime the model with the context by including that tag. It really feels like cheating, but apparently GPT-2 is really good at identifying these tags and figuring out how to associate them with a particular semantic flavor.

To make a ZackBot, I figured Slack would be my best source for a relatively large number of snappy interactions. I started by making sure I could get the poetry example working. I primed the bot with the line “I am a little bumblebee”.

bumble buzz

Satisfied, I collected the Slack data. I used the Slack API to scrape a couple of channels where I talk with some pals.

I trained the model for a couple of epochs, and the results were looking good:

Share on

Twitter Facebook LinkedIn