Interpretable machine learning for finding intermediate-mass black holes

Definitive evidence that globular clusters (GCs) host intermediate-mass black holes (IMBHs) is elusive. Machine-learning (ML) models trained on GC simulations can in principle predict IMBH host candidates based on observable features. This approach has two limitations: first, an accurate ML model is...

Full description

Saved in:
Bibliographic Details
Main Authors: Pasquato, Mario (Author) , Trevisan, Piero (Author) , Askar, Abbas (Author) , Lemos, Pablo (Author) , Carenini, Gaia (Author) , Mapelli, Michela (Author) , Hezaveh, Yashar (Author)
Format: Article (Journal)
Language:English
Published: 2024 April 10
In: The astrophysical journal
Year: 2024, Volume: 965, Issue: 1, Pages: 1-15
ISSN:1538-4357
DOI:10.3847/1538-4357/ad2261
Online Access:Verlag, kostenfrei, Volltext: https://doi.org/10.3847/1538-4357/ad2261
Verlag, kostenfrei, Volltext: https://www.webofscience.com/api/gateway?GWVersion=2&SrcAuth=DOISource&SrcApp=WOS&KeyAID=10.3847%2F1538-4357%2Fad2261&DestApp=DOI&SrcAppSID=EUW1ED0DCErOOKADzOXlXGFKvvplk&SrcJTitle=ASTROPHYSICAL+JOURNAL&DestDOIRegistrantName=American+Astronomical+Society
Get full text
Author Notes:Mario Pasquato, Piero Trevisan, Abbas Askar, Pablo Lemos, Gaia Carenini, Michela Mapelli, and Yashar Hezaveh
Description
Summary:Definitive evidence that globular clusters (GCs) host intermediate-mass black holes (IMBHs) is elusive. Machine-learning (ML) models trained on GC simulations can in principle predict IMBH host candidates based on observable features. This approach has two limitations: first, an accurate ML model is expected to be a black box due to complexity; second, despite our efforts to simulate GCs realistically, the simulation physics or initial conditions may fail to reflect reality fully. Therefore our training data may be biased, leading to a failure in generalization to observational data. Both the first issue-explainability/interpretability-and the second-out of distribution generalization and fairness-are active areas of research in ML. Here we employ techniques from these fields to address them: we use the anchors method to explain an Extreme Gradient Boosting (XGBoost) classifier; we also independently train a natively interpretable model using Certifiably Optimal RulE ListS (CORELS). The resulting model has a clear physical meaning, but loses some performance with respect to XGBoost. We evaluate potential candidates in real data based not only on classifier predictions but also on their similarity to the training data, measured by the likelihood of a kernel density estimation model. This measures the realism of our simulated data and mitigates the risk that our models may produce biased predictions by working in extrapolation. We apply our classifiers to real GCs, obtaining a predicted classification, a measure of the confidence of the prediction, an out-of-distribution flag, a local rule explaining the prediction of XGBoost, and a global rule from CORELS.
Item Description:Gesehen am 23.09.2024
Physical Description:Online Resource
ISSN:1538-4357
DOI:10.3847/1538-4357/ad2261