Written by ZacharySTOctober 17, 2020March 5, 2025

Compendium of Evidence for not Using Excel for Data Analysis

I always tell my students not to use Excel for data analysis, which I think means never to use it. Confused eyes stare back at me, understandably since Microsoft is like oxygen: imagining life without it sounds deadly. So this post is going to document examples of where using Excel for data analysis has led […]

Written by ZacharySTSeptember 16, 2020March 5, 2025

Batch zip files

One of my hard drives is down to its final terabyte, of 8, so its time for me to compress some files. Since I have thousands of files on that drive, it would be inefficient to select them one by one. It turns out its easy to pass a bunch of files to gzip. I […]

Written by ZacharySTAugust 18, 2020March 5, 2025

Fixing Page Numbers with Large Floats in Latex

It’s quite common for me to have a figure that takes up an entire page. Latex always has trouble placing that page’s number: it usually puts it in the top right. I finally found a Stack Exchange answer that works for me: https://tex.stackexchange.com/questions/238469/page-numbering-on-bottom-center-on-full-page-figure. What’s also cool is that the overfull messages I would get may […]

Written by ZacharySTJune 1, 2020March 5, 2025

Feel Less Bad: Pay Attention to Social Media to Better Understand the World

My research uses Twitter to understand the dynamics of protest and state violence, which also means I think a lot about media coverage. I have a few thoughts on recent events. TL;DR: There is very little looting going on across the United States. There are lots and lots of peaceful protests and protesters. You would […]

Written by ZacharySTMay 8, 2020March 5, 2025

Understanding Subnational Variation in Tweets

My primary source of data is tweets I get from Twitter’s POST statuses/filter endpoint, what I believe was called the “Streaming Endpoint” when I started working with Twitter data eons ago. While it has always been straightforward to use a bounding box to get tweets with geographic information, exactly what Twitter reports and how it […]

Written by ZacharySTApril 29, 2020March 5, 2025

Crawling Followers with Intelligent Stopping

Like almost every other academic, I have started a Covid-19 project. I think my team has a unique angle because of the kind of data I collect. One dynamic we are interested in is patterns of following, and being able to analyze that across enough accounts required me to work with Twitter endpoints I have […]

Written by ZacharySTApril 29, 2020March 5, 2025

My Ongoing Twitter Collections

I recently spent a lot of time reviewing my Twitter data collection infrastructure in order to start some more collections. In that process, I discovered some tokens and streams I forgot about. The purpose of this post is to document what data I am collecting as of 04.29.2020 so that I have an easy reference […]

Written by ZacharySTDecember 31, 2019January 1, 2020

What I Read, 2019 Version

Starting with the 19th book review (Mao biography), I have decided to add a grade and, to counterbalance what can often seem like negative reviews, one interesting fact learned from each book. I aim for a B- average. The Sellout by Paul Beatty – Wow, what a novel. My wife bought it for me for […]

Written by ZacharySTDecember 30, 2019March 5, 2025

Who I Saw, 2019 Edition

Tim Gunn (April 2019) – My wife always raves about the celebrities she saw while a student in NYC, so I was cautiously optimistic during a weekend we spent there. While we did not see anyone in the Village, we did cross paths with Tim Gunn as we walked from Levain Bakery to the Museum […]