Who I Saw, 2020 Edition

The fourth time is the charm, by which I mean I hope I see more celebrities than last year because the third time certainly was not the charm. Jade Chang.  Jade Chang is the author of one of the favorite novels I read in 2018, Wangs vs. the World.  Even better, I actually met her […]

What I Read, 2020 Edition

The Silk Roads: A New History of the World by Peter Frankopan.  After reading this 505 page history, I hereby firmly resolve to not judge a book by its cover.  Had I spent 10-15 minutes skimming chapters, I would have realized that this book is not in fact about “silk roads”, whatever that means to […]

Batch zip files

One of my hard drives is down to its final terabyte, of 8, so its time for me to compress some files. Since I have thousands of files on that drive, it would be inefficient to select them one by one. It turns out its easy to pass a bunch of files to gzip. I […]

Understanding Subnational Variation in Tweets

My primary source of data is tweets I get from Twitter’s POST statuses/filter endpoint, what I believe was called the “Streaming Endpoint” when I started working with Twitter data eons ago.  While it has always been straightforward to use a bounding box to get tweets with geographic information, exactly what Twitter reports and how it […]

Crawling Followers with Intelligent Stopping

Like almost every other academic, I have started a Covid-19 project.  I think my team has a unique angle because of the kind of data I collect.  One dynamic we are interested in is patterns of following, and being able to analyze that across enough accounts required me to work with Twitter endpoints I have […]

My Ongoing Twitter Collections

I recently spent a lot of time reviewing my Twitter data collection infrastructure in order to start some more collections.  In that process, I discovered some tokens and streams I forgot about.  The purpose of this post is to document what data I am collecting as of 04.29.2020 so that I have an easy reference […]