F. Ragusa1,2, D. DiMauro1, A. Palermo1, A. Furnari1, G. M. Farinella1
We consider the problem of object segmentation in cultural sites. Since collecting and labeling large datasets of real images is challenging, we investigate whether the use of synthetic images can be useful to achieve good segmentation performance on real data. To perform the study, we collected a new dataset comprising both real and synthetic images of 24 artworks in a cultural site. The experimental results point out that the use of synthetic data helps to improve the performances of segmentation algorithms when tested on real images. Satisfactory performance is achieved exploiting semantic segmentation together with image-to-image translation and including a small amount of real data during training. The constributions of this work are the following:
We consider the cultural site Palazzo Bellomo of the EGO-CH dataset. The dataset has been acquired using a head-mounted Microsoft HoloLens device.
We have manual annotated 24 objects from 11 environments with semantic masks. In particular, we have annotated 4740 images from the training set of EGO-CH dataset and 848 images from the test set.
2. Synthetic domain
Real Training Data | Accuracy% | Class Accuracy% | Mean IoU% | FWAVACC% | |
---|---|---|---|---|---|
PSPNet_R |
5% | 70.94 | 44.59 | 29.34 | 56.65 |
10% | 76.46 | 50.10 | 33.53 | 63.48 | |
25% | 80.22 | 60.57 | 43.76 | 68.19 | |
50% | 83.51 | 63.71 | 50.21 | 72.41 | |
100% | 83.72 | 64.47 | 49.49 | 72.62 | |
PSPNet_S+R |
0% | 58.32 | 8.45 | 5.50 | 35.60 |
5% | 71.41 | 47.50 | 31.41 | 56.06 | |
10% | 80.39 | 64.92 | 43.76 | 68.67 | |
25% | 83.02 | 62.04 | 47.61 | 71.34 | |
50% | 84.23 | 69.84 | 51.66 | 74.31 | |
100% | 84.78 | 63.37 | 50.53 | 73.78 | |
PSPNet+CycleGAN |
0% | 55.99 | 7.65 | 3.77 | 31.90 |
5% | 84.17 | 75.54 | 55.91 | 74.78 | |
10% | 89.07 | 78.26 | 63.61 | 81.17 | |
25% | 89.87 | 79.94 | 62.57 | 82.30 | |
50% | 89.95 | 76.68 | 67.14 | 82.06 | |
100% | 91.06 | 81.07 | 66.93 | 83.98 |
F. Ragusa, D. DiMauro, A. Palermo, A. Furnari, G. M. Farinella. Synthetic vs Real. Objects Segmentation in Cultural Heritage. In International Conference on Pattern Recognition (ICPR), 2020. Download the paper.