Getting Started

Assuming that a compatible annotated dataset is available (see Dataset Annotation), the general GinJinn2 workflow consists of

train(-validation)-test split of the dataset
GinJinn2 project intialization
Project configuration
Model training
Model evaluation
Model application (prediction)

In the following sections this workflow will be illustrated using a simulated dataset.

Preparation

First, make sure that GinJinn2 is installed (install instructions) and can be called from the terminal:

ginjinn -h

Simulation

GinJinn2 ships with a simple dataset simulation utility. The ginjinn simulate shapes command generates a dataset in COCO or PASCAL VOC format, comprising images with several triangles and circles. As usual, you can get the list of possible arguments by executing ginjinn simulate shapes -h.

For testing purposes, we will simulate a dataset called “shapes_ds” (-o shapes_ds) with 200 (-n 200) COCO-annotated (-a COCO) images:

ginjinn simulate shapes \
    -o shapes_ds \
    -n 200 \
    -a COCO

The previous command will create a new folder “shapes_ds” in your current working directory. Inside this folder, there is an “images” directory and an “annotations.json” file. The “images” folder contains the simulated images; “annotations.json” stores the corresponding instance-segmentation annotations in COCO format.

You can use ginjinn info -I shapes_ds to display dataset statistics like the number of images and instances per category.

optional: You can visualize annotations using the ginjinn vis command. The following command generates instance-segmentation visualizations (-v segmentation) for the simulated dataset (-I shapes_ds) in a new folder “shapes_ds_vis” (-o shapes_ds_vis).

ginjinn vis \
    -I shapes_ds \
    -o shapes_ds_vis \
    -v segmentation

The visualization of the simulated data will look similar to this:

Exemplary shapes simulation visualization

Now that we have an annotated dataset, we are ready to start with the GinJinn2 workflow.

1. Train-Validation-Test Split

Splitting datasets into training, validation (sometimes also called “development”), and test datasets is a common practice when working with predictive machine learning models. The training set, as implied by its name, is used to train a model, while the validation (“development”) set is used to assess its generalization capability for tuning hyperparameters like, for example, learning rate, batch size, and so on. Finally, the test set is used to get an unbiased measure of the model performance, since it has neither been used for training nor for hyperparameter tuning.

GinJinn2 provides the ginjinn split command for splitting datasets in COCO or PASCAL VOC format. We will split the simulated data (-I shapes_ds) into sub-datasets such that 60% of the images are used for training, 20% for validation (-v 0.2), and 20% for testing (-t 0.2) of an instance-segmentation model (-d instance-segmentation). The output datasets will be written to a new folder “shapes_ds_split” (-o shapes_ds_split). GinJinn2 implements a randomized heuristic for splitting the data, which tries to evenly distribute instances from different categories. Thus, when executing the following command, you will be asked whether you want to accept the proposed split or try again.

ginjinn split \
    -I shapes_ds \
    -o shapes_ds_split \
    -d instance-segmentation \
    -v 0.2 \
    -t 0.2

After executing the above command, a new folder “shapes_ds_split” will be created, containing the three subfolders “train”, “val”, and “test”. The subfolders will each contain a subset of the images along with corresponding annotations from the original dataset.

2. GinJinn2 Project Initialization

A GinJinn2 project is simply a folder containing a “ginjinn_config.yaml” file and an “outputs” folder. “ginjinn_config.yaml” specifies the project configuration including data, model, training, and augmentation settings. The “outputs” folder will be used to store intermediary outputs that are generated while training the model. Those include, for example, training and validation metrics, model checkpoints and TensorBoard-compatible outputs.

The ginjinn new command takes care of initializing a new GinJinn2 project. It expects the name of the project directory to be generated and, optionally, the path to a dataset folder (-d), and the name of a model template (-t). Available options for the latter are listed on the help page (ginjinn new -h). We will use ginjinn new to generate a new project “shapes_project” for instance segmentation with an Mask R-CNN (-t mask_rcnn_R_50_FPN_1x.yaml) using the split shapes dataset (-d shapes_ds_split).

ginjinn new shapes_project -t mask_rcnn_R_50_FPN_1x.yaml -d shapes_ds_split

After running the above command, there will a new folder “shapes_project”. This folder contains the configuration file “ginjinn_config.yaml” and the empty “ouputs” folder.

3. GinJinn2 Project Configuration

In this section, we will only very briefly touch the project configuration options. For a more in-depth discussion of the available options please refer to the project configuration document.

When opening the “ginjinn_config.yaml” file with a text editor (we recommend one with syntax highlighting for YAML files, e.g. VSCode), you can see that the input section is already filled with the paths of the datasets in “shapes_ds_split”, and the model is set to “mask_rcnn_R_50_FPN_1x”. For demonstration purposes, we will only modify some training options:

max_iter: total number of training steps
eval_period: number of iterations between evaluations of the validation dataset
checkpoint_period: number of iterations between saving model checkpoints

We will set those values to max_iter: 1000, eval_period: 100, checkpoint_period: 500. The training section of your “ginjinn_config.yaml” should now look like this:

training:
    learning_rate: 0.00125
    batch_size: 1
    max_iter: 1000
    eval_period: 100
    checkpoint_period: 500

The GinJinn2 project is now ready for training.

4. Model Training

The model can now be trained by simply running ginjinn train with the corresponding GinJinn2 project directory. For our “shapes_project” that is

ginjinn train shapes_project

After calling the above command, you will see commandline output describing the model, dataset, and a little bit later the training progress and the evaluation of the validation dataset. Additionally, the outputs folder will start becoming populated by several files.

“metrics.pdf”, “metrics.json”, and “events.out.*” are probably the most informative files while the model is training. “metrics.pdf” contains plots of several performance metrics considering the training and validation datasets. “metrics.json” contains the same information in JSON format. “events.out.*” files can be read by the TensorBoard application for a similar purpose. Below you can see an example of how “metrics.pdf” might look like after training:

After training, the “model_final.pth” file contains the final model weights, i.e. the trained model. Additionally, there are model checkpoint files, identified by the “model_” prefix and “.pth” suffix (e.g. “model_0000499.pth”), storing the model state at certain numbers of training iterations.

5. Model Evaluation

Once the model is trained, it can be evaluated using the test dataset. For this purpose, GinJinn2 provides the ginjinn evaluate command. We evaluate our shape detection model using:

ginjinn evaluate shapes_project

This will print the evaluation metrics to the console and write an “evaluation.csv” file to the project directory. Finally, you should compare the evaluation metrics of the validation set (see “metrics.pdf” or “metrics.json”) with those of test set to check for overfitting. In the case of our shapes_project, “segm/AP” in the last line of “metrics.json” should be around 90; the same should be the case for the “segm”-“AP” entry in “evaluation.csv”.

For our shapes project everything should look fine and we can start applying the trained model to new data.

6. Model Application

A model can be applied using the ginjinn predict command. This command requires a GinJinn2 project with a trained model and a folder containing the images (-i) to be used as input. By default, the predictions are written to the folder “predictions” inside the project directory; an alternative output folder can be specified using the -o option.

Let’s predict instance segmentations for the test dataset. By default, a COCO annotation file (JSON) containing the segmentation and/or bounding-box predictions will be generated. For this example application, we will also use the visualization (-v) and cropping (-c) output options.

The following command predicts instance segmentations for the test dataset and writes outputs to “shapes_prediction”.

ginjinn predict \
    shapes_project \
    -i shapes_ds_split/test/images \
    -o shapes_prediction \
    -c \
    -v

Visualizations of the predictions and cropped segmentation masks will look similar to this:

Real Data

Of course, we can not only predict on images from the test dataset, but on any kind of image. Here is an example with an input image of shapes drawn on a whiteboard, captured with a smartphone camera:

The command to generate the above predication was

ginjinn predict \
    shapes_project \
    -i test_images \
    -o test_images_pred \
    -c \
    -v \
    -r

Conclusion

We have applied GinJinn2 for instance segmentation using simulated data. If you want to see how GinJinn2 can be used for object detection and instance segmentation with empirical data, have a look at the Empirical Applications document.

For information on GinJinn2 project configurations see Project Configuration.