class: center, middle, inverse, title-slide # R for Data Analysis ## Handling date and time ### Ayush Patel ### 29-Jul-2021 --- layout: true --- name: Introduction class: left,middle .pull-left[ ## Find me [__@ayushbipinpatel__](https://twitter.com/ayushbipinpatel) <img src="" width=5%> [__@AyushBipinPatel__](https://github.com/AyushBipinPatel) <img src="" width=5%> [__ayushpatel.netlify.app__](https://ayushpatel.netlify.app/) <img src="" width=5%> [__ayush.ap58@gmail.com__](ayush.ap58@gmail.com)<img src="" width=5%> ] .pull-right[ <img src = "https://images.metmuseum.org/CRDImages/ad/original/57258.jpg"> .small[ Image: [John Biglin in a Single Scull by Thomas Eakins](https://images.metmuseum.org/CRDImages/ad/original/57258.jpg) ] ] --- class: left, middle .pull-left[ # Pre-requisite .big[You....] understand __different types of objects, how to create objects and assign values to objects__. <br> __how to access specific values within an object.__ <br> know __what a function is and how to use a function.__ <br> know __basics of data wrangling__ <br> ] .pull-right[ <img src = "https://images.metmuseum.org/CRDImages/ad/original/DT84.jpg"> .small[ [Image: Lake George by John Frederick Kensett ](https://images.metmuseum.org/CRDImages/ad/original/DT84.jpg) ] ] --- # Before we get to it.. .big[Continue in the Rmarkdown document you used in the last class or create a new one.] <br> Load the following libraries and data ```r library(tidyverse) library(nycflights13) library(lubridate) ``` --- class: middle # Why learn about handling date and time + This is not as simple as it seems. How may seconds make a minute, how many days make a year?? + Messes up up calcualtions if you dont pay attention + TIMEZONEs Need I say more?!! --- class: middle # The date time These objects can be of three types: + A Date + A Time + A Date-Time .big[Use the format which will get the job done.] Meaning if you only need date, no need to use data-time. --- # Creating date/times .big[There are 3 ways to do it] .yscroll[ .panelset[ .panel[.panel-name[From String] ```r ymd("2020-03-23") # what happened on this day? ``` ``` ## [1] "2020-03-23" ``` ```r mdy("January 26st, 1950") ``` ``` ## [1] "1950-01-26" ``` ```r dmy("08-Nov-2016") # the horror of this day!!! ``` ``` ## [1] "2016-11-08" ``` *_hms *_hm <br> <br> <br> <br> <br> <br> ] .panel[.panel-name[From individual Date time components] ``` ## # A tibble: 336,776 x 5 ## year month day hour minute ## <int> <int> <int> <dbl> <dbl> ## 1 2013 1 1 5 15 ## 2 2013 1 1 5 29 ## 3 2013 1 1 5 40 ## 4 2013 1 1 5 45 ## 5 2013 1 1 6 0 ## 6 2013 1 1 5 58 ## 7 2013 1 1 6 0 ## 8 2013 1 1 6 0 ## 9 2013 1 1 6 0 ## 10 2013 1 1 6 0 ## # ... with 336,766 more rows ``` ```r flights %>% select(year, month, day, hour, minute) %>% mutate(departure = make_datetime(year, month, day, hour, minute), departure2 = make_date(year, month, day)) %>% select(departure,departure2,everything()) ``` ``` ## # A tibble: 336,776 x 7 ## departure departure2 year month day hour minute ## <dttm> <date> <int> <int> <int> <dbl> <dbl> ## 1 2013-01-01 05:15:00 2013-01-01 2013 1 1 5 15 ## 2 2013-01-01 05:29:00 2013-01-01 2013 1 1 5 29 ## 3 2013-01-01 05:40:00 2013-01-01 2013 1 1 5 40 ## 4 2013-01-01 05:45:00 2013-01-01 2013 1 1 5 45 ## 5 2013-01-01 06:00:00 2013-01-01 2013 1 1 6 0 ## 6 2013-01-01 05:58:00 2013-01-01 2013 1 1 5 58 ## 7 2013-01-01 06:00:00 2013-01-01 2013 1 1 6 0 ## 8 2013-01-01 06:00:00 2013-01-01 2013 1 1 6 0 ## 9 2013-01-01 06:00:00 2013-01-01 2013 1 1 6 0 ## 10 2013-01-01 06:00:00 2013-01-01 2013 1 1 6 0 ## # ... with 336,766 more rows ``` <br> <br> <br> <br> ] .panel[.panel-name[From date/time object] ``` ## [1] "2021-08-01 UTC" ``` ```r as_date(yesterday) ``` ``` ## [1] "2021-08-01" ``` ] ] ] --- # Activity 1 Copy this code, Use lubridate functions to get following dates ```r d1 <- "January 1, 2010" d2 <- "2015-Mar-07" d3 <- "06-Jun-2017" d4 <- c("August 19 (2015)", "July 1 (2015)") d5 <- "12/30/14" # Dec 30, 2014 ```
05
:
00
--- # Getting components of a date-time .big[What does that mean] Any of the following: .center[ year <br> <br> month <br> <br> month and day <br> <br> day of the year <br> <br> day of the week <br> <br> hour <br> <br> minutes <br> <br> seconds ] --- .yscroll[ ```r datetime <- ymd_hms("2016-07-08 12:34:56") year(datetime) ``` ``` ## [1] 2016 ``` ```r month(datetime) ``` ``` ## [1] 7 ``` ```r mday(datetime) ``` ``` ## [1] 8 ``` ```r yday(datetime) ``` ``` ## [1] 190 ``` ```r wday(datetime) ``` ``` ## [1] 6 ``` ] --- ```r month(datetime, label = TRUE) ``` ``` ## [1] Jul ## 12 Levels: Jan < Feb < Mar < Apr < May < Jun < Jul < Aug < Sep < ... < Dec ``` ```r wday(datetime, label = TRUE, abbr = FALSE) ``` ``` ## [1] Friday ## 7 Levels: Sunday < Monday < Tuesday < Wednesday < Thursday < ... < Saturday ``` --- # Time spans ``` durations, which represent an exact number of seconds. periods, which represent human units like weeks and months. intervals, which represent a starting and ending point. ``` .small[from R4ds] --- ## Durations ```r datedemon <- dmy("08/Nov/2016") today() - datedemon -> since_demon as.duration(since_demon) ``` ``` ## [1] "149212800s (~4.73 years)" ``` --- ### Duration Constructors .yscroll[ .panelset[ .panel[.panel-name[constructor] ```r dseconds(15) ``` ``` ## [1] "15s" ``` ```r dminutes(10) ``` ``` ## [1] "600s (~10 minutes)" ``` ```r dhours(c(12, 24)) ``` ``` ## [1] "43200s (~12 hours)" "86400s (~1 days)" ``` ```r ddays(0:5) ``` ``` ## [1] "0s" "86400s (~1 days)" "172800s (~2 days)" ## [4] "259200s (~3 days)" "345600s (~4 days)" "432000s (~5 days)" ``` ```r dweeks(3) ``` ``` ## [1] "1814400s (~3 weeks)" ``` ```r dyears(1) ``` ``` ## [1] "31557600s (~1 years)" ``` ] .panel[.panel-name[Allows math operations] ```r 2 * dyears(1) ``` ``` ## [1] "63115200s (~2 years)" ``` ```r dyears(1) + dweeks(12) + dhours(15) ``` ``` ## [1] "38869200s (~1.23 years)" ``` BUT, be careful of DST. ] ] ] --- # Periods >"Periods are time spans but don’t have a fixed length in seconds, instead they work with “human” times, like days and months." .small[from R4ds] ```r one_pm <- ymd_hms("2016-03-12 13:00:00", tz = "America/New_York") one_pm ``` ``` ## [1] "2016-03-12 13:00:00 EST" ``` ```r one_pm + ddays(1) ``` ``` ## [1] "2016-03-13 14:00:00 EDT" ``` ```r one_pm + days(1) ``` ``` ## [1] "2016-03-13 13:00:00 EDT" ``` Periods have constructors just like Durations. --- class: center, middle background-image: url("images/background2.jpg") background-size: cover