21.19 Complexity Parameter

We can print a table of optimal prunings based on a complexity parameter using rpart::printcp(). The data is actually stored as model$cptable.

printcp(model)
## 
## Classification tree:
## rpart(formula = form, data = ds[tr, vars], model = TRUE)
## 
## Variables actually used in tree construction:
## [1] humidity_3pm    wind_gust_speed
## 
## Root node error: 26187/123722 = 0.21166
## 
## n= 123722 
## 
##         CP nsplit rel error  xerror      xstd
## 1 0.151258      0   1.00000 1.00000 0.0054867
## 2 0.033146      1   0.84874 0.84847 0.0051558
## 3 0.010000      3   0.78245 0.78291 0.0049943


Your donation will support ongoing development and give you access to the PDF version of this book. Desktop Survival Guides include Data Science, GNU/Linux, and MLHub. Books available on Amazon include Data Mining with Rattle and Essentials of Data Science. Popular open source software includes rattle, wajig, and mlhub. Hosted by Togaware, a pioneer of free and open source software since 1984.
Copyright © 1995-2021 Graham.Williams@togaware.com Creative Commons Attribution-ShareAlike 4.0.