11.61 Scatter Plot Colour Shape Theme BW

20180603

set.seed(26439)
ds %>%
  sample_n(1000) %>%
  ggplot(aes(x=min_temp, y=max_temp, colour=rain_tomorrow, shape=rain_tomorrow)) +
  geom_point() +
  labs(x      = vnames["min_temp"], 
       y      = vnames["max_temp"], 
       colour = vnames["rain_tomorrow"]) +
  theme_bw() +
  theme(legend.position="bottom")

There are many circumstances where it makes sense to colour the points according to some scheme. For example, in a predictive modelling context the colour might correspond to the prediction made (yes or no). For a cluster analysis the colour might represent the cluster each point is allocated to.

A key variable of interest in the rattle::weatherAUS dataset is rain_tomorrow. By colouring the dots according to the rain_tomorrow we may begin to see relationships in the data.

The colour is added simply by specifying a further aesthetic, colour=rain_tomorrow. Different values of the variable rain_tomorrow will then be coloured differently.



Your donation will support ongoing availability and give you access to the PDF version of this book. Desktop Survival Guides include Data Science, GNU/Linux, and MLHub. Books available on Amazon include Data Mining with Rattle and Essentials of Data Science. Popular open source software includes rattle, wajig, and mlhub. Hosted by Togaware, a pioneer of free and open source software since 1984. Copyright © 1995-2022 Graham.Williams@togaware.com Creative Commons Attribution-ShareAlike 4.0