10.2 rain demo data

20210418

Prepare the Data

The weatherAUS data comes from the Rattle package (https://rattle.togaware.com). It covers some 50 weather stations in Australia with over 10 years of daily observations of some 20 variables. The data is loaded, cleansed and wrangled, and prepared for modelling, as explained in the OnePageR chapter on data templates:

https://onepager.togaware.com/Chapter_Data_Template.html.

A view of the data is shared below.

Rows: 176,747
Columns: 24
$ date            <date> 2008-12-01, 2008-12-02, 2008-12-03, 2008-12-04, 2008…
$ location        <chr> "Albury", "Albury", "Albury", "Albury", "Albury", "Al…
$ min_temp        <dbl> 13.4, 7.4, 12.9, 9.2, 17.5, 14.6, 14.3, 7.7, 9.7, 13.…
$ max_temp        <dbl> 22.9, 25.1, 25.7, 28.0, 32.3, 29.7, 25.0, 26.7, 31.9,…
$ rainfall        <dbl> 0.6, 0.0, 0.0, 0.0, 1.0, 0.2, 0.0, 0.0, 0.0, 1.4, 0.0…
$ evaporation     <dbl> 4.8, 4.8, 4.8, 4.8, 4.8, 4.8, 4.8, 4.8, 4.8, 4.8, 4.8…
$ sunshine        <dbl> 8.5, 8.5, 8.5, 8.5, 8.5, 8.5, 8.5, 8.5, 8.5, 8.5, 8.5…
$ wind_gust_dir   <ord> w, wnw, wsw, ne, w, wnw, w, w, nnw, w, n, nne, w, sw,…
$ wind_gust_speed <dbl> 44, 44, 46, 24, 41, 56, 50, 35, 80, 28, 30, 31, 61, 4…
$ wind_dir_9am    <ord> w, nnw, w, se, ene, w, sw, sse, se, s, sse, ne, nnw, …
$ wind_dir_3pm    <ord> wnw, wsw, wsw, e, nw, w, w, w, nw, sse, ese, ene, nnw…
$ wind_speed_9am  <dbl> 20, 4, 19, 11, 7, 19, 20, 6, 7, 15, 17, 15, 28, 24, 4…
$ wind_speed_3pm  <dbl> 24, 22, 26, 9, 20, 24, 24, 17, 28, 11, 6, 13, 28, 20,…
$ humidity_9am    <dbl> 71, 44, 38, 45, 82, 55, 49, 48, 42, 58, 48, 89, 76, 6…
$ humidity_3pm    <dbl> 22, 25, 30, 16, 33, 23, 19, 19, 9, 27, 22, 91, 93, 43…
$ pressure_9am    <dbl> 1007.7, 1010.6, 1007.6, 1017.6, 1010.8, 1009.2, 1009.…
$ pressure_3pm    <dbl> 1007.1, 1007.8, 1008.7, 1012.8, 1006.0, 1005.4, 1008.…
$ cloud_9am       <dbl> 8, 5, 5, 5, 7, 5, 1, 5, 5, 5, 5, 8, 8, 5, 5, 0, 8, 8,…
$ cloud_3pm       <dbl> 5, 5, 2, 5, 8, 5, 5, 5, 5, 5, 5, 8, 8, 7, 5, 5, 1, 1,…
$ temp_9am        <dbl> 16.9, 17.2, 21.0, 18.1, 17.8, 20.6, 18.1, 16.3, 18.3,…
$ temp_3pm        <dbl> 21.8, 24.3, 23.2, 26.5, 29.7, 28.9, 24.6, 25.5, 30.2,…
$ rain_today      <fct> no, no, no, no, no, no, no, no, no, yes, no, yes, yes…
$ risk_mm         <dbl> 0.0, 0.0, 0.0, 1.0, 0.2, 0.0, 0.0, 0.0, 1.4, 0.0, 2.2…
$ rain_tomorrow   <fct> no, no, no, no, no, no, no, no, yes, no, yes, yes, ye…


Your donation will support ongoing availability and give you access to the PDF version of this book. Desktop Survival Guides include Data Science, GNU/Linux, and MLHub. Books available on Amazon include Data Mining with Rattle and Essentials of Data Science. Popular open source software includes rattle, wajig, and mlhub. Hosted by Togaware, a pioneer of free and open source software since 1984. Copyright © 1995-2022 Graham.Williams@togaware.com Creative Commons Attribution-ShareAlike 4.0