Unsupervised representation learning by discovering reliable image relations

Learning robust representations that allow to reliably establish relations between images is of paramount importance for virtually all of computer vision. Annotating the quadratic number of pairwise relations between training images is simply not feasible, while unsupervised inference is prone to no...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: Milbich, Timo (VerfasserIn) , Ghori, Omair (VerfasserIn) , Diego, Ferran (VerfasserIn) , Ommer, Björn (VerfasserIn)
Dokumenttyp: Article (Journal)
Sprache:Englisch
Veröffentlicht: 17 January 2020
In: Pattern recognition
Year: 2020, Jahrgang: 102
DOI:10.1016/j.patcog.2019.107107
Online-Zugang:Verlag, lizenzpflichtig, Volltext: https://doi.org/10.1016/j.patcog.2019.107107
Verlag, lizenzpflichtig, Volltext: http://www.sciencedirect.com/science/article/pii/S003132031930408X
Volltext
Verfasserangaben:Timo Milbich, Omair Ghori, Ferran Diego, Björn Ommer
Beschreibung
Zusammenfassung:Learning robust representations that allow to reliably establish relations between images is of paramount importance for virtually all of computer vision. Annotating the quadratic number of pairwise relations between training images is simply not feasible, while unsupervised inference is prone to noise, thus leaving the vast majority of these relations to be unreliable. To nevertheless find those relations which can be reliably utilized for learning, we follow a divide-and-conquer strategy: We find reliable similarities by extracting compact groups of images and reliable dissimilarities by partitioning these groups into subsets, converting the complicated overall problem into few reliable local subproblems. For each of the subsets we obtain a representation by learning a mapping to a target feature space so that their reliable relations are kept. Transitivity relations between the subsets are then exploited to consolidate the local solutions into a concerted global representation. While iterating between grouping, partitioning, and learning, we can successively use more and more reliable relations which, in turn, improves our image representation. In experiments, our approach shows state-of-the-art performance on unsupervised classification on ImageNet with 46.0% and competes favorably on different transfer learning tasks on PASCAL VOC.
Beschreibung:Gesehen am 16.12.2020
Beschreibung:Online Resource
DOI:10.1016/j.patcog.2019.107107