03. Large data

Billions.

Irene Iodice https://ioire.github.io , Julian Hinz https://julianhinz.com
2022-05-11
A port with ships and containers. Beautiful.

This week is about the first look at real “big” data, conventional data with many observations and variables that are not allways as nicely formatted as you would like them to be.1

Lecture slides

Morning session slides

View full screen Download as pdf

Afternoon session slides

View full screen Download as pdf

Code

To be added.

Further recommended resources

tidyverse

For more information on the tidyverse, check out the following links:

data.table

For more information on the data.table, check out the following links:

ggplot2

We only just started with ggplot2, but if you want to know more already, check out the following links:


  1. Obviously, the exact defintion of “big” is highly contested. This is what Wikipedia has to say about it.

    ↩︎

Corrections

If you see mistakes or want to suggest changes, please create an issue on the source repository.