Santi Andrea Orlando1, 2, **, Antonino Furnari1, ** and Giovanni Maria Farinella1, 3, **
We propose unpaired domain adaptation benchmark for virtual to real image-based localization in cultural sites. We focus on using image-to-image translation and mid-level representation to fill domain gap to localizae First Person Vision navigation in Galleria Regionale Palazzo Bellomo, located in Siracusa, Italy. The contribution of this work are the following:
Table 1: Room-based localization results considering defferent combinations of RGB images and mid-level representations. Results are repoted with and without image-to-image translation.
Table 2: Accuracy and F1 scores obtained by the compared methods on each class.
Table 3: 3DOF localization results considering defferent combinations of RGB images and mid-level representations. Results are repoted with and without image-to-image translation.
Table 4: Summary of 3DOF localization results.
For virtual domain we used dataset collected from the 3D model of the cultural site Pallazzo Bellomo acquired with Matterport 3D scanner.
The dataset comprises of 4 simulated navigations generated simulating visits inside the museum. The video has been acquired at 5 frames
per second. We used the 3DOF camara pose and current room (Context), to study image-based localization and classification of the
11 context of the museum.
For more details about this dataset go to this page .
Real images of the same cultural site are the one of EGO-CH dataset. This data has been collected by visitors wearing Microsoft HoloLens device. We used 10 video
sequences labeled with the room location. The frames has been extracted at 5 frame per second selecting only the frame related to the 1st floor of the
museum.
For more details about this dataset go to this page .
Fig. 1: Examples of mid level representation extracted with the corresponding RGB images.
Fig. 2: Illustration of proposed pipeline for domain adaptation.
This research is supported by XENIA Progetti - DWORD, by the project VALUE - Visual Analysis for Localization and Understanding of Environments (N. 08CT6209090207, CUP G69J18001060007) granted by PO FESR 2014/2020 - Azione 1.1.5 - "Sostegno all’avanzamento tecnologico delle imprese attraverso il finanziamento di linee pilota e azioni di validazione precoce dei prodotti e di dimostrazioni su larga scala'', and by Piano della Ricerca 2016-2018 linea di Intervento 2 of DMI, University of Catania. The authors would like to thank Regione Siciliana Assessorato dei Beni Culturali dell'Identità Siciliana - Dipartimento dei Beni Culturali e dell'Identità Siciliana and Polo regionale di Siracusa per i siti culturali - Galleria Regionale di Palazzo Bellomo.