How to Use googlesheets to Connect R to Google Sheets

How to Use googlesheets to Connect R to Google Sheets

Often I use R to handle large datasets, analyse the data and filter out the data I don’t need. When all this is done, I usually use write.csv()┬áto print my data off and reopen it in Google Sheets. My workflow […]

Continue Reading

R for Absolute Beginners

  On Tuesday I gave a workshop at the Data Journalism UK conference, run by Paul Bradshaw. This was the worked example for absolute beginners that we went through. If you’ve never looked at R before and want to run […]

Continue Reading

Road accidents in November

Road accidents in November

Recently the British Department for Transport published its latest STATS19 data for the year 2016. We’ve looked at this data before. To recap, each row of the STATS19 data is a traffic accident that caused injury or death, identified by […]

Continue Reading

Vandalism Causing Train Delays

Vandalism Causing Train Delays

Over the past two weeks I’ve been looking at Network Rail’s delays data. The data tells us how many delays there have been to trains thanks to all kinds of problems that affect the railways, from natural causes such as […]

Continue Reading

The Losses in the Final Year of WW1

The Losses in the Final Year of WW1

Back in August 2014, around the 100th anniversary of the outbreak of the First World War, the Data Unit published our analysis of the Commonwealth War Graves Commission‘s records of fallen soldiers, airmen, sailors and other servicemen and women who […]

Continue Reading

Scraping in R: Access to mortgage petition

Scraping in R: Access to mortgage petition

Over the past few years a good source of data has been Parliament’s petitions website. Anyone can start petitions or sign them. MPs have to consider the ones that get to 100,000 signatures for debates. The most popular petitions often […]

Continue Reading

Spring Budget 2017: Circle visualisation

Spring Budget 2017: Circle visualisation

It’s time to branch out into a new area of data visualisation: proportion area plots. These plots use area to show proportion between different related values. A common type of proportional area plots are tree maps. We are going to […]

Continue Reading

Comparing Donald Trump and Hillary Clinton’s Facebook pages during the US presidential election, 2016

Comparing Donald Trump and Hillary Clinton’s Facebook pages during the US presidential election, 2016

R has a lot of packages for users to analyse posts on social media. As an experiment in this field, I decided to start with the biggest one: Facebook. I decided to look at the Facebook activity of Donald Trump […]

Continue Reading

Calculating Distances in R: How Fast is Your Train?

Calculating Distances in R: How Fast is Your Train?

Earlier this month Marie Segger, Carlos Novoa and I had a major new project published about different rail speeds between cities around Britain. We compared the distances between train stations in Britain’s largest cities and found which areas were poorly-served […]

Continue Reading

Internal Migration Part III: Plotting Age Groups

Internal Migration Part III: Plotting Age Groups

Introduction In Part I we looked at overall internal migration local authority by local authority – are more people coming than going? In Part II we looked at where people are moving from and to around the country. Here in […]

Continue Reading