11.2 Visualisation Data
The rattle::weatherAUS dataset is loaded into the template
ds and further template variables are setup as
introduced by Graham J. Williams (2017). See
Chapter 8 for details.
<- "weatherAUS" dsname <- get(dsname) ds <- nrow(ds) nobs <- names(ds) vnames %<>% clean_names(numerals="right") ds names(vnames) <- names(ds) <- names(ds) vars <- "rain_tomorrow" target <- c(target, vars) %>% unique() %>% rev()vars
We also do a little more to set the data up for demonstrating various approaches to visualisation. As with the model template, a number of template variables are identified here. We also a little data wrangling to remove all missing values by performing a missing value imputation with randomForest::na.roughfix().
<- "risk_mm" risk <- c("date", "location") id <- c(risk, id) ignore <- setdiff(vars, ignore) vars <- setdiff(vars, target) inputs %<>% na.roughfix()ds[vars]
Williams, Graham J. 2017. The Essentials of Data Science: Knowledge Discovery Using r. The r Series. CRC Press.
Your donation will support ongoing availability and give you access to the PDF version of this book. Desktop Survival Guides include Data Science, GNU/Linux, and MLHub. Books available on Amazon include Data Mining with Rattle and Essentials of Data Science. Popular open source software includes rattle, wajig, and mlhub. Hosted by Togaware, a pioneer of free and open source software since 1984. Copyright © 1995-2022 Graham.Williams@togaware.com Creative Commons Attribution-ShareAlike 4.0