Return of the features: Efficient feature selection and interpretation for photometric redshifts

<i>Context<i/>. The explosion of data in recent years has generated an increasing need for new analysis techniques in order to extract knowledge from massive data-sets. Machine learning has proved particularly useful to perform this task. Fully automatized methods (e.g. deep neural netwo...

Full description

Saved in:
Bibliographic Details
Main Author: D'Isanto, Antonio (Author)
Format: Article (Journal)
Language:English
Published: 28 August 2018
In: Astronomy and astrophysics
Year: 2018, Volume: 616, Pages: A97
ISSN:1432-0746
DOI:10.1051/0004-6361/201833103
Online Access:Verlag, Volltext: http://dx.doi.org/10.1051/0004-6361/201833103
Get full text
Author Notes:A. D’Isanto, S. Cavuoti, F. Gieseke, and K.L. Polsterer

MARC

LEADER 00000caa a2200000 c 4500
001 1588165914
003 DE-627
005 20220815112805.0
007 cr uuu---uuuuu
008 190227s2018 xx |||||o 00| ||eng c
024 7 |a 10.1051/0004-6361/201833103  |2 doi 
035 |a (DE-627)1588165914 
035 |a (DE-576)518165914 
035 |a (DE-599)BSZ518165914 
035 |a (OCoLC)1341040473 
040 |a DE-627  |b ger  |c DE-627  |e rda 
041 |a eng 
084 |a 29  |2 sdnb 
100 1 |a D'Isanto, Antonio  |d 1985-  |e VerfasserIn  |0 (DE-588)1179200764  |0 (DE-627)1066594090  |0 (DE-576)518046990  |4 aut 
245 1 0 |a Return of the features  |b Efficient feature selection and interpretation for photometric redshifts  |c A. D’Isanto, S. Cavuoti, F. Gieseke, and K.L. Polsterer 
264 1 |c 28 August 2018 
300 |a 21 
336 |a Text  |b txt  |2 rdacontent 
337 |a Computermedien  |b c  |2 rdamedia 
338 |a Online-Ressource  |b cr  |2 rdacarrier 
500 |a Gesehen am 27.02.2019 
520 |a <i>Context<i/>. The explosion of data in recent years has generated an increasing need for new analysis techniques in order to extract knowledge from massive data-sets. Machine learning has proved particularly useful to perform this task. Fully automatized methods (e.g. deep neural networks) have recently gathered great popularity, even though those methods often lack physical interpretability. In contrast, feature based approaches can provide both well-performing models and understandable causalities with respect to the correlations found between features and physical processes.<i>Aims<i/>. Efficient feature selection is an essential tool to boost the performance of machine learning models. In this work, we propose a forward selection method in order to compute, evaluate, and characterize better performing features for regression and classification problems. Given the importance of photometric redshift estimation, we adopt it as our case study.<i>Methods<i/>. We synthetically created 4520 features by combining magnitudes, errors, radii, and ellipticities of quasars, taken from the Sloan Digital Sky Survey (SDSS). We apply a forward selection process, a recursive method in which a huge number of feature sets is tested through a k-Nearest-Neighbours algorithm, leading to a tree of feature sets. The branches of the feature tree are then used to perform experiments with the random forest, in order to validate the best set with an alternative model.<i>Results<i/>. We demonstrate that the sets of features determined with our approach improve the performances of the regression models significantly when compared to the performance of the classic features from the literature. The found features are unexpected and surprising, being very different from the classic features. Therefore, a method to interpret some of the found features in a physical context is presented.<i>Conclusions<i/>. The feature selection methodology described here is very general and can be used to improve the performance of machine learning models for any regression or classification task. 
773 0 8 |i Enthalten in  |t Astronomy and astrophysics  |d Les Ulis : EDP Sciences, 1969  |g 616(2018) Artikel-Nummer A97, 21 Seiten  |h Online-Ressource  |w (DE-627)253390222  |w (DE-600)1458466-9  |w (DE-576)072283351  |x 1432-0746  |7 nnas  |a Return of the features Efficient feature selection and interpretation for photometric redshifts 
773 1 8 |g volume:616  |g year:2018  |g pages:A97  |g extent:21  |a Return of the features Efficient feature selection and interpretation for photometric redshifts 
856 4 0 |u http://dx.doi.org/10.1051/0004-6361/201833103  |x Verlag  |x Resolving-System  |3 Volltext 
951 |a AR 
992 |a 20190227 
993 |a Article 
994 |a 2018 
998 |g 1179200764  |a D'Isanto, Antonio  |m 1179200764:D'Isanto, Antonio  |d 130000  |d 130001  |e 130000PD1179200764  |e 130001PD1179200764  |k 0/130000/  |k 1/130000/130001/  |p 1  |x j 
999 |a KXP-PPN1588165914  |e 3056820308 
BIB |a Y 
SER |a journal 
JSO |a {"language":["eng"],"recId":"1588165914","type":{"media":"Online-Ressource","bibl":"article-journal"},"note":["Gesehen am 27.02.2019"],"title":[{"title":"Return of the features","subtitle":"Efficient feature selection and interpretation for photometric redshifts","title_sort":"Return of the features"}],"person":[{"roleDisplay":"VerfasserIn","display":"D'Isanto, Antonio","role":"aut","family":"D'Isanto","given":"Antonio"}],"relHost":[{"title":[{"title_sort":"Astronomy and astrophysics","title":"Astronomy and astrophysics","subtitle":"an international weekly journal"}],"pubHistory":["1.1969 -"],"titleAlt":[{"title":"Astronomy & astrophysics"},{"title":"a European journal"}],"part":{"pages":"A97","year":"2018","extent":"21","text":"616(2018) Artikel-Nummer A97, 21 Seiten","volume":"616"},"type":{"media":"Online-Ressource","bibl":"periodical"},"disp":"Return of the features Efficient feature selection and interpretation for photometric redshiftsAstronomy and astrophysics","note":["Gesehen am 21.06.2024","Erscheint 36mal jährlich in 12 Bänden zu je 3 Ausgaben","Fortsetzung der Druck-Ausgabe"],"corporate":[{"role":"isb","roleDisplay":"Herausgebendes Organ","display":"European Southern Observatory"}],"language":["eng"],"recId":"253390222","origin":[{"publisherPlace":"Les Ulis ; Berlin ; Heidelberg","dateIssuedKey":"1969","publisher":"EDP Sciences ; Springer","dateIssuedDisp":"1969-"}],"id":{"issn":["1432-0746"],"eki":["253390222"],"zdb":["1458466-9"]},"name":{"displayForm":["European Southern Observatory (ESO)"]},"physDesc":[{"extent":"Online-Ressource"}]}],"physDesc":[{"extent":"21 S."}],"id":{"doi":["10.1051/0004-6361/201833103"],"eki":["1588165914"]},"origin":[{"dateIssuedDisp":"28 August 2018","dateIssuedKey":"2018"}],"name":{"displayForm":["A. D’Isanto, S. Cavuoti, F. Gieseke, and K.L. Polsterer"]}} 
SRT |a DISANTOANTRETURNOFTH2820