DATA TRANSFORMATION�R & dplyr
CONTENT
RECOMMENDED LITERATURE
Wickham, Hadley, and Garrett Grolemund. R for Data Science: Import, Tidy, Transform, Visualize, and Model Data. 2nd edition, O’Reilly, 2023. Online verfügbar: https://r4ds.hadley.nz/�
Sauer, Sebastian. Moderne Datenanalyse mit R. Springer Gabler, 2019.
�→ Chapter 7
STEPS IN EXPLORATORY DATA ANALYSIS
STEPS IN EXPLORATORY DATA ANALYSIS
Source: Wickham, Hadley, and Garrett Grolemund. R for Data Science: Import, Tidy, Transform, Visualize, and Model Data. First edition, O’Reilly, 2016. URL: https://r4ds.hadley.nz/diagrams/data-science/base.png
FIRST STEPS WITH �R & RStudio
FIRST STEPS WITH R AND RSTUDIO
DESKTOP OR CLOUD
Download, Installation R and RStudio
Walkthrough RStudio
alternatively
Registration and Login RStudio Cloud
FIRST STEPS WITH R AND RSTUDIO
CREATE A NEW PROJECT
All code examples for this course are hosted publicly on GitHub
FIRST STEPS WITH R AND RSTUDIO
CHECKOUT GITHUB REPO
All code examples for this course are hosted publicly on GitHub
OUR TOOLSET
OUR TOOLSET
mutate
transmute
group_by
summarize
filter
select
arrange
GROUP & SUMMARIZE
GROUP BY VARIABLE AND SUMMARIZE
DATA LOADING
{readr}
DATA LOADING
EXERCISE DATA
Politician’s Tweets (JSON / RDS)
Campusbier Sales Orders (CSV)
REWE Online Products (CSV)
DATA MANAGEMENT
{tibble}
DATA MANAGEMENT
HANDLE THE DATA
Tibbles or data frames? Both are like tables in a spreadsheet… just in R
DATA TRANSFORMATION
{dplyr}
DATA TRANSFORMATION
SELECT COLUMNS
DATA TRANSFORMATION
FILTER ROWS
DATA TRANSFORMATION
ZEILEN SORTIEREN
DATA TRANSFORMATION
ADD OR CHANGE COLUMNS
DATA TRANSFORMATION
SUMMARIZE ROWS
EXERCISE
CAMPUSBIER SALES ORDERS
AD-HOC EXERCISE
You are new as a managing director in the Campusbier project and are supposed to get a first impression of the business. All you have are two datasets: orders.csv and line_items.csv.