detrex.data

class detrex.data.DetrDatasetMapper(augmentation, augmentation_with_crop, is_train=True, mask_on=False, img_format='RGB')[source]

A callable which takes a dataset dict in Detectron2 Dataset format, and map it into the format used by DETR.

The callable currently does the following:

Read the image from “file_name”
Applies geometric transforms to the image and annotation
Find and applies suitable cropping to the image and annotation
Prepare image and annotation to Tensors

Parameters

augmentation (list[detectron.data.Transforms]) – The geometric transforms for the input raw image and annotations.
augmentation_with_crop (list[detectron.data.Transforms]) – The geometric transforms with crop.
is_train (bool) – Whether to load train set or val set. Default: True.
mask_on (bool) – Whether to return the mask annotations. Default: False.
img_format (str) – The format of the input raw images. Default: RGB.

Because detectron2 did not implement RandomSelect augmentation. So we provide both augmentation and augmentation_with_crop here and randomly apply one of them to the input raw images.