Visual camera re-localization from RGB and RGB-D images using DSAC

We describe a learning-based system that estimates the camera position and orientation from a single input image relative to a known environment. The system is flexible w.r.t. the amount of information available at test and at training time, catering to different applications. Input images can be RG...

Full description

Saved in:
Bibliographic Details
Main Authors: Brachmann, Eric (Author) , Rother, Carsten (Author)
Format: Article (Journal) Chapter/Article
Language:English
Published: 31 Aug 2020
In: Arxiv

Online Access:Verlag, lizenzpflichtig, Volltext: http://arxiv.org/abs/2002.12324
Get full text
Author Notes:Eric Brachmann and Carsten Rother

MARC

LEADER 00000caa a2200000 c 4500
001 1731780621
003 DE-627
005 20220818195101.0
007 cr uuu---uuuuu
008 200914s2020 xx |||||o 00| ||eng c
035 |a (DE-627)1731780621 
035 |a (DE-599)KXP1731780621 
035 |a (OCoLC)1341358936 
040 |a DE-627  |b ger  |c DE-627  |e rda 
041 |a eng 
084 |a 28  |2 sdnb 
100 1 |a Brachmann, Eric  |d 1987-  |e VerfasserIn  |0 (DE-588)1179206088  |0 (DE-627)1066600457  |0 (DE-576)518117634  |4 aut 
245 1 0 |a Visual camera re-localization from RGB and RGB-D images using DSAC  |c Eric Brachmann and Carsten Rother 
264 1 |c 31 Aug 2020 
300 |a 18 
336 |a Text  |b txt  |2 rdacontent 
337 |a Computermedien  |b c  |2 rdamedia 
338 |a Online-Ressource  |b cr  |2 rdacarrier 
500 |a Gesehen am 14.09.2020 
520 |a We describe a learning-based system that estimates the camera position and orientation from a single input image relative to a known environment. The system is flexible w.r.t. the amount of information available at test and at training time, catering to different applications. Input images can be RGB-D or RGB, and a 3D model of the environment can be utilized for training but is not necessary. In the minimal case, our system requires only RGB images and ground truth poses at training time, and it requires only a single RGB image at test time. The framework consists of a deep neural network and fully differentiable pose optimization. The neural network predicts so called scene coordinates, i.e. dense correspondences between the input image and 3D scene space of the environment. The pose optimization implements robust fitting of pose parameters using differentiable RANSAC (DSAC) to facilitate end-to-end training. The system, an extension of DSAC and referred to as DSAC*, achieves state-of-the-art accuracy an various public datasets for RGB-based re-localization, and competitive accuracy for RGB-D-based re-localization. 
650 4 |a Computer Science - Computer Vision and Pattern Recognition 
650 4 |a Computer Science - Machine Learning 
700 1 |a Rother, Carsten  |e VerfasserIn  |0 (DE-588)1181464692  |0 (DE-627)1662676883  |4 aut 
773 0 8 |i Enthalten in  |t Arxiv  |d Ithaca, NY : Cornell University, 1991  |g (2020) Artikel-Nummer 2002.12324, 18 Seiten  |h Online-Ressource  |w (DE-627)509006531  |w (DE-600)2225896-6  |w (DE-576)28130436X  |7 nnas  |a Visual camera re-localization from RGB and RGB-D images using DSAC 
773 1 8 |g year:2020  |g extent:18  |a Visual camera re-localization from RGB and RGB-D images using DSAC 
787 0 8 |i Forschungsdaten  |a Brachmann, Eric, 1987 -   |t DSAC* visual re-localization [data]  |d Heidelberg : Universität, 2020  |h 1 Online-Ressource (5 Files)  |w (DE-627)1731779909 
856 4 0 |u http://arxiv.org/abs/2002.12324  |x Verlag  |z lizenzpflichtig  |3 Volltext 
951 |a AR 
992 |a 20200914 
993 |a Article 
994 |a 2020 
998 |g 1181464692  |a Rother, Carsten  |m 1181464692:Rother, Carsten  |d 700000  |d 708070  |d 700000  |d 728500  |e 700000PR1181464692  |e 708070PR1181464692  |e 700000PR1181464692  |e 728500PR1181464692  |k 0/700000/  |k 1/700000/708070/  |k 0/700000/  |k 1/700000/728500/  |p 2  |y j 
998 |g 1179206088  |a Brachmann, Eric  |m 1179206088:Brachmann, Eric  |p 1  |x j 
999 |a KXP-PPN1731780621  |e 3752859679 
BIB |a Y 
JSO |a {"name":{"displayForm":["Eric Brachmann and Carsten Rother"]},"origin":[{"dateIssuedDisp":"31 Aug 2020","dateIssuedKey":"2020"}],"id":{"eki":["1731780621"]},"physDesc":[{"extent":"18 S."}],"relHost":[{"id":{"zdb":["2225896-6"],"eki":["509006531"]},"origin":[{"dateIssuedKey":"1991","publisher":"Cornell University ; Arxiv.org","dateIssuedDisp":"1991-","publisherPlace":"Ithaca, NY ; [Erscheinungsort nicht ermittelbar]"}],"physDesc":[{"extent":"Online-Ressource"}],"title":[{"title_sort":"Arxiv","title":"Arxiv"}],"recId":"509006531","language":["eng"],"disp":"Visual camera re-localization from RGB and RGB-D images using DSACArxiv","type":{"bibl":"edited-book","media":"Online-Ressource"},"note":["Gesehen am 28.05.2024"],"titleAlt":[{"title":"Arxiv.org"},{"title":"Arxiv.org e-print archive"},{"title":"Arxiv e-print archive"},{"title":"De.arxiv.org"}],"part":{"year":"2020","extent":"18","text":"(2020) Artikel-Nummer 2002.12324, 18 Seiten"},"pubHistory":["1991 -"]}],"person":[{"role":"aut","roleDisplay":"VerfasserIn","display":"Brachmann, Eric","given":"Eric","family":"Brachmann"},{"role":"aut","roleDisplay":"VerfasserIn","display":"Rother, Carsten","given":"Carsten","family":"Rother"}],"title":[{"title":"Visual camera re-localization from RGB and RGB-D images using DSAC","title_sort":"Visual camera re-localization from RGB and RGB-D images using DSAC"}],"note":["Gesehen am 14.09.2020"],"type":{"media":"Online-Ressource","bibl":"chapter"},"language":["eng"],"recId":"1731780621"} 
SRT |a BRACHMANNEVISUALCAME3120