Dataset Download and Usage

After having created and prepared the dataset, and adding our desired augmentations, we download the dataset.

To download the dataset, first choose the version you would like to download (see Data Augmentation on creating versions). There are multiple ways of downloading your desired version, depending on your use-case.

Direct .zip Download:

  1. Click on the version to download under Versions

  2. Click on Export Dataset (left)

  3. Export:

    a) Format: YOLOv8 (currently used)

    b) Select download zip to computer

    c) Uncheck Also train a model etc.

    d) Click Continue

  4. Download the zip file

Downloading Using Comand-Line Interface:

Use this when testing on a Jupyter Notebook and/or using Cloud Computing services.

See https://docs.roboflow.com/datasets/exporting-data  for information.

Also see https://docs.roboflow.com/roboflow-cli/download-dataset  for alternative download methods.

 

Data Recombination

Extract all zip files to the same location. It is fine to have the README and .yaml files overwrite each other, as the READMEs can be ignored and the .yaml are almost identical (the differences are in the Roboflow section).

Ideally, the image order is randomized. However, labels are required to correspond to the same image.

Roboflow provides hashes at the end of the file names, which is sufficient and can be used as the prefix. Bulk Rename Utility is used to rename files: https://www.bulkrenameutility.co.uk/.

Bulk Rename Utility settings:

  • Move/Copy Parts (6): Copy last n, 32, To start, 1, Sep.: . (dot)

Apply the rename to all files (test, train, valid).

The 3 directories now contain the dataset and are ready for training.

Â