This repository was archived by the owner on Apr 11, 2023. It is now read-only.

Description
The description in Setup:
The datasets you will download (most of them compressed) have a combined size of only ~ 3.5 GB.
The description in Downloading Data from S3:
The size of the dataset is approximately 20 GB.
They are all data downloaded by running script/setup. Why not the same amount of data? Which one is right? Does 3.5G refer only to the size of the dataset per programming language?