With the holidays approaching, one of the most discussed questions at was whether we’ll have a white Christmas or not. And what better way to get our hopes up, than by taking a look at the historic data on the snow depth on the past ten Christmas Eves?
But how to best visualize spatial data? Other than most data types, spatial data usually calls for a very particular visualization, namely data points overlaying a map. In this way areal data is automatically contextualized by the geographic information intuitively conveyed by a map.
The basic functionality of ggplot2
dosen’t offer the possibility to do so, but there is a package akin to ggplot2
that allows to do so: ggmap
. ggmap
was written by David Kahle and Hadley Wickham and combines the building blocks of ggplot2
, the grammar of graphics as well as the static maps of Google Maps, OpenStreetMap, Stamen Maps or CloudMade Maps. And ggmap
allows to make really fancy visualizations:
Above-average snow depth on Christmas Eve (2008-2017)
First things first: the map
Conveniently, ggmap
employs the same theoretical framework and general syntax as ggplot2
. However, ggmap
requires one additional step: Before we can start plotting, we have to download a map as backdrop for our visualization. This is done with get_stamenmap()
, get_cloudmademap()
, get_googlemap()
or get_openstreetmap()
or the more general get_map()
. We’re going to use get_stamenmap()
.
To determine the depicted map cutout, the left, bottom, right and top coordinates of a bounding box, have to be supplied to the argument bbox
. Conveniently, there is no need to know the exact latitudes and longitudes of each and every bounding box of interest. The function geocode_OSM()
from the package tmaptools
, which whenever possible returns the coordinates of a search query consisting of an address, zip code and/or name of a city or country.
library(scales)
library(tidyverse)
library(tmaptools)
library(ggimage)
library(ggmap)
# get the bounding box
geocode_OSM("Germany")$bbox
# returns:
# xmin ymin xmax ymax
# 5.866315 47.270111 15.041932 55.099161
The zoom level can be set via the zoom argument and can range between 0 (least detailed) and 18 (most detailed, quick disclaimer: this can take a very long time). The zoom level determines the resolution of the image as well as the amount of displayed annotations.
Depending on whether we want to highlight roads, political or administrative boundaries or bodies of water and land different styles of maps excel. The maptype
argument allows to choose from different ready-made styles: "terrain"
, "terrain-background"
, "terrain-labels"
, "terrain-lines"
, "toner"
, "toner-2010"
, "toner-2011"
, "toner-background"
, "toner-hybrid"
, "toner-labels"
, "toner-lines"
, "toner-lite"
or "watercolor"
.
Some further, very handy arguments of get_stamenmap()
are crop
, force
and color
: As implied by the name, color
defines whether a map should be in black-and-white ("bw"
) or when possible in color ("color"
).
Under the hood get_stamenmap()
downloads map tiles, which are joined to the complete map. If the map tiles should be cropped so as to only depict the specified bounding box, the crop
argument can be set to TRUE
.
Unless the force
argument is set to TRUE
, even when arguments changing the style of a map have been altered, once a map of a given location has been downloaded it will not be downloaded again.
When we’ve obtained the map of the right location and style, we can store the “map image” in an object or simply pass it along to ggmap()
to plot it. The labels, ticks etc. of axes can be controlled as usual.
# getting map
plot_map_z7 <- get_stamenmap(as.numeric(geocode_OSM("Germany")$bbox),
zoom = 7,
force = T,
maptype = "terrain")
# saving plotted map alone
plot1 <- ggmap(plot_map_z7) +
theme(axis.title = element_blank(),
axis.ticks = element_blank(),
axis.text = element_blank())
# getting map
plot_map_z5 <- get_stamenmap(as.numeric(geocode_OSM("Germany")$bbox),
zoom = 5,
force = T,
maptype = "terrain")
# saving plotted map alone
plot2 <- ggmap(plot_map_z5) +
theme(axis.title = element_blank(),
axis.ticks = element_blank(),
axis.text = element_blank())
# plotting maps together
plot <- gridExtra::grid.arrange(plot1, plot2, nrow = 1)
Example for maptype = “terrain” with zoom = 7 (left) vs. zoom = 5 (right).
Business as usual: layering geoms on top
We then can layer any ggplot2
geom we’d like on top of that map, with the only requirement being that the variables mapped to the axes are within the same numeric range as the latitudes and longitudes of the depicted map. We also can use many extension packages building on ggplot2
. For example, we can use the very handy package ggimage
by Guangchuang Yu to make our plots extra festive:
# aggregating data per coordinate
df_snow_agg <- df_snow %>%
dplyr::mutate(LATITUDE = plyr::round_any(LATITUDE, accuracy = 1),
LONGITUDE = plyr::round_any(LONGITUDE, accuracy = 1)) %>%
dplyr::group_by(LATITUDE, LONGITUDE) %>%
dplyr::summarise(WERT = mean(WERT, na.rm = TRUE))
# cutting into equal intervals
df_snow_agg$snow <- as.numeric(cut(df_snow_agg$WERT, 12))
# setting below average snow depths to 0
df_snow_agg <- df_snow_agg %>%
mutate(snow = ifelse(WERT <= mean(df_snow_agg$WERT), 0, snow))
# adding directory of image to use
df_snow_agg$image <- "Snowflake_white.png"
# getting map
plot_map <- get_stamenmap(as.numeric(geocode_OSM("Germany")$bbox),
zoom = 7,
force = T,
maptype = "terrain")
# plotting map + aggregated snow data with ggimage
plot <- ggmap(plot_map) +
geom_image(data = df_snow_agg,
aes(x = LONGITUDE,
y = LATITUDE,
image = image,
size = I(snow/20))) + # rescaling to get valid size values
theme(axis.title = element_blank(),
axis.ticks = element_blank(),
axis.text = element_blank())
Above-average snow depth on Christmas Eve (2008-2017)
We can get creative, playing with all sorts of geoms and map styles. Especially visualization strategies to prevent can be quite handy:
For example, a facetted scatter plot with alpha blending:
# getting map
plot_map <- get_stamenmap(as.numeric(geocode_OSM("Germany")$bbox),
zoom = 7,
force = T,
maptype = "watercolor")
# facets per year, scatter plot with alpha blending
plot <- ggmap(plot_map) +
geom_point(data = df_snow,
aes(x = LONGITUDE,
y = LATITUDE,
alpha = WERT,
size = WERT),
color = "white") +
facet_wrap(. ~ DATE, nrow = 2) +
scale_alpha_continuous(range = c(0.0, 0.9)) +
theme(axis.title = element_blank(),
axis.ticks = element_blank(),
axis.text = element_blank(),
legend.position = "none",
strip.background = element_rect(fill = "white"))
Snow depth on Christmas Eve per year (2008-2017)
geom_line()
or geom_polygon()
:# getting map
plot_map <- get_stamenmap(as.numeric(geocode_OSM("Germany")$bbox),
zoom = 7,
crop = FALSE,
force = T,
maptype = "toner-background")
# density plot: lines
plot1 <- ggmap(plot_map) +
geom_density2d(data = df_snow_dens,
aes(x = LONGITUDE,
y = LATITUDE,
color = ..level..),
size = 0.7) +
scale_color_continuous(low = "#17775A", high = "white",
name = "",
breaks = c(0.01, 0.06),
labels = c("little snow", "much snow")) +
theme(axis.title = element_blank(),
axis.ticks = element_blank(),
axis.text = element_blank(),
legend.position = "left")
# density plot: polygon
plot2 <- ggmap(plot_map) +
stat_density_2d(data = df_snow_dens,
aes(x = LONGITUDE, y = LATITUDE,
alpha = ..level..),
fill = "#CFECFF",
geom = "polygon") +
scale_alpha(name = "",
breaks = c(0.012, 0.024, 0.036, 0.048, 0.06),
labels = c("little snow", "","", "", "much snow")) +
theme(axis.title = element_blank(),
axis.ticks = element_blank(),
axis.text = element_blank())
plot <- gridExtra::grid.arrange(plot1, plot2, nrow = 1)
ggsave(plot = plot, filename = "density.png", width = 5.5, height = 3)
ggmap
, alas based on the historic data, there will be no white Christmas for most of us… Against all odds, I still hope that outside there’s a winter wonderland waiting for you just now.From everybody here at STATWORX happy holidays and a happy new year!
References
- D. Kahle and H. Wickham. ggmap: Spatial Visualization with ggplot2. The R Journal, 5(1), 144-161. http://journal.r-project.org/archive/2013-1/kahle-wickham.pdf
- DWD Climate Data Center (CDC): Tägliche Stationsmessungen Schneehöhe in cm für Deutschland, Version v18.3 & recent, abgerufen am 16.12.2018.
- Map tiles by , under Data by , under respectively under .