[UPDATE: THE BELOW CODE WAS DESIGNED FOR TWEETS THAT HAD BEEN ALREADY FORMATTED TO .CSV.  I HAVE UPDATED THE CODE TO WORK WITH RAW TWEETS.  THE GITHUB PAGE HAS LIKEWISE BEEN UPDATED.] When looking at tweets, it is often important to know where the tweet was created.  For tweets with GPS coordinates, Twitter is nice […]

I often add text in plots. This text can be a legend or labeling certain points, in which case assigning it x and y coordinates are easy. Often times, the text is descriptive, e.g. the slope of a line or the number of observations in a sample. Customizing (x,y) coordinates in that case is simple […]

I was making a plot comparing how five outcomes varied by whether they occurred during Period 1 or Period 2.  I wanted to plot the standard error of each point, which would often cause points to become hidden by the error bars.  The obvious answer is to shift the 10 points (5 outcomes, 2 treatments) […]

In today’s edition of Becoming a Functional Data Analyst, I am writing to remind myself not to create interaction variables during data munging.  That is, when acquiring, cleaning, and aggregating data, I find it easier to not interact variables I will later need.  Instead, it is easier to keep variables as they are and only […]

TL;DR: htmlTable and tableHTML are better than xtable for creating HTML tables in R. Today, I had to make tables for a Word document, meaning I could not convert my R table to Latex code.  (I know I know, pandoc.  Just another crusty old thing to learn.)  In the past, I had used xtable, but […]

[UPDATE: My best use of academic Twitter yet.  I tweeted a link to this post and quickly got pwnd by the internet.  What I learned:  there is a function, ggsave(), that does this that I did not know about.  So use that instead of the hacky workaround I came up with.  Never would’ve learned if […]

Today, I need to launch a new AWS EC2 instance to collect some more tweets.  Since I have done this several times, I used my AWS console to replicate the most recent instance I created that collects tweets.  Everything ran smoothly until I tried to install streamR, my go to library for collecting tweets in […]

This post is a sequel to my wildly popular – ok, not wildly popular, but very useful for me – post documenting my function to create a forest plot.  Recently, I made models using cross-validation via R’s caret package.  The train() function does the heavy lifting, but the problem is that its output is a […]

TL;DR: Just use generic style files for new projects. Overleaf is the best tool I have found for collaborating on Latex documents.  It is cloud-based, has a ton of templates, nice text editing features that go beyond TexShop and other desktop editors I know, and the rich text editor (using a GUI, like a word processor) […]

On the heels of my book inventory for 2017 is this post, my celebrity sighting inventory for 2017.  The New England sophisticate in me hates that I even considered documenting these events, much less put them on my professional website, but the budding Angelino in me, interacted with my forgetfulness, thinks it’s so cool that […]