Quark-versus-gluon tagging in CMS Open Data with CWoLa and TopicFlow

We use the CMS Open Data to examine the performance of weakly-supervised learning for tagging quark and gluon jets at the LHC. We target Z+jet and dijet events as respective quark- and gluon-enriched mixtures and derive samples both from data taken in 2011 at 7 TeV, and from Monte Carlo. CWoLa and T...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: Dolan, Matthew J. (VerfasserIn) , Gargalionis, John (VerfasserIn) , Ore, Ayodele (VerfasserIn)
Dokumenttyp: Article (Journal)
Sprache:Englisch
Veröffentlicht: August 4, 2025
In: Journal of high energy physics
Year: 2025, Heft: 8, Pages: 1-37
ISSN:1029-8479
DOI:10.1007/JHEP08(2025)024
Online-Zugang:Verlag, kostenfrei, Volltext: https://doi.org/10.1007/JHEP08(2025)024
Verlag, kostenfrei, Volltext: https://link.springer.com/article/10.1007/JHEP08(2025)024
Volltext
Verfasserangaben:Matthew J. Dolan, John Gargalionis and Ayodele Ore

MARC

LEADER 00000naa a2200000 c 4500
001 1945028475
003 DE-627
005 20251208125108.0
007 cr uuu---uuuuu
008 251208s2025 xx |||||o 00| ||eng c
024 7 |a 10.1007/JHEP08(2025)024  |2 doi 
035 |a (DE-627)1945028475 
035 |a (DE-599)KXP1945028475 
040 |a DE-627  |b ger  |c DE-627  |e rda 
041 |a eng 
084 |a 29  |2 sdnb 
100 1 |a Dolan, Matthew J.  |e VerfasserIn  |0 (DE-588)1205385843  |0 (DE-627)1691022144  |4 aut 
245 1 0 |a Quark-versus-gluon tagging in CMS Open Data with CWoLa and TopicFlow  |c Matthew J. Dolan, John Gargalionis and Ayodele Ore 
264 1 |c August 4, 2025 
300 |b Illustrationen 
300 |a 37 
336 |a Text  |b txt  |2 rdacontent 
337 |a Computermedien  |b c  |2 rdamedia 
338 |a Online-Ressource  |b cr  |2 rdacarrier 
500 |a Gesehen am 08.12.2025 
520 |a We use the CMS Open Data to examine the performance of weakly-supervised learning for tagging quark and gluon jets at the LHC. We target Z+jet and dijet events as respective quark- and gluon-enriched mixtures and derive samples both from data taken in 2011 at 7 TeV, and from Monte Carlo. CWoLa and TopicFlow models are trained on real data and compared to fully-supervised classifiers trained on simulation. In order to obtain estimates for the discrimination power in real data, we consider three different estimates of the quark/gluon mixture fractions in the data. Compared to when the models are evaluated on simulation, we find reversed rankings for the fully- and weakly-supervised approaches. Further, these rankings based on data are robust to the estimate of the mixture fraction in the test set. Finally, we use TopicFlow to smooth statistical fluctuations in the small testing set, and to provide uncertainty on the performance in real data. 
650 4 |a Jets and Jet Substructure 
650 4 |a Parton Shower 
700 1 |a Gargalionis, John  |e VerfasserIn  |0 (DE-588)1383734356  |0 (DE-627)1945029269  |4 aut 
700 1 |a Ore, Ayodele  |e VerfasserIn  |0 (DE-588)1373325461  |0 (DE-627)1932808256  |4 aut 
773 0 8 |i Enthalten in  |t Journal of high energy physics  |d Berlin : Springer, 1997  |g (2025), 8 vom: Aug., Artikel-ID 24, Seite 1-37  |h Online-Ressource  |w (DE-627)320910571  |w (DE-600)2027350-2  |w (DE-576)095428305  |x 1029-8479  |7 nnas  |a Quark-versus-gluon tagging in CMS Open Data with CWoLa and TopicFlow 
773 1 8 |g year:2025  |g number:8  |g month:08  |g elocationid:24  |g pages:1-37  |g extent:37  |a Quark-versus-gluon tagging in CMS Open Data with CWoLa and TopicFlow 
856 4 0 |u https://doi.org/10.1007/JHEP08(2025)024  |x Verlag  |x Resolving-System  |z kostenfrei  |3 Volltext  |7 0 
856 4 0 |u https://link.springer.com/article/10.1007/JHEP08(2025)024  |x Verlag  |z kostenfrei  |3 Volltext  |7 0 
951 |a AR 
992 |a 20251208 
993 |a Article 
994 |a 2025 
998 |g 1373325461  |a Ore, Ayodele  |m 1373325461:Ore, Ayodele  |d 130000  |d 130300  |e 130000PO1373325461  |e 130300PO1373325461  |k 0/130000/  |k 1/130000/130300/  |p 3  |y j 
999 |a KXP-PPN1945028475  |e 4824367131 
BIB |a Y 
SER |a journal 
JSO |a {"person":[{"display":"Dolan, Matthew J.","given":"Matthew J.","role":"aut","family":"Dolan"},{"display":"Gargalionis, John","family":"Gargalionis","role":"aut","given":"John"},{"display":"Ore, Ayodele","role":"aut","given":"Ayodele","family":"Ore"}],"relHost":[{"part":{"issue":"8","extent":"37","text":"(2025), 8 vom: Aug., Artikel-ID 24, Seite 1-37","year":"2025","pages":"1-37"},"id":{"zdb":["2027350-2"],"eki":["320910571"],"issn":["1029-8479"]},"corporate":[{"role":"isb","display":"Institute of Physics"}],"pubHistory":["Nachgewiesen 1997 -"],"titleAlt":[{"title":"JHEP"}],"note":["Gesehen am 02.12.20"],"type":{"bibl":"periodical","media":"Online-Ressource"},"language":["eng"],"title":[{"subtitle":"JHEP ; a refereed journal written, run, and distributed by electronic means","title_sort":"Journal of high energy physics","title":"Journal of high energy physics"}],"origin":[{"publisher":"Springer ; SISSA ; IOP Publ.","publisherPlace":"Berlin ; Heidelberg ; [Trieste] ; Bristol","dateIssuedKey":"1997","dateIssuedDisp":"1997-"}],"disp":"Quark-versus-gluon tagging in CMS Open Data with CWoLa and TopicFlowJournal of high energy physics","physDesc":[{"extent":"Online-Ressource"}],"recId":"320910571"}],"origin":[{"dateIssuedDisp":"August 4, 2025","dateIssuedKey":"2025"}],"title":[{"title":"Quark-versus-gluon tagging in CMS Open Data with CWoLa and TopicFlow","title_sort":"Quark-versus-gluon tagging in CMS Open Data with CWoLa and TopicFlow"}],"note":["Gesehen am 08.12.2025"],"type":{"media":"Online-Ressource","bibl":"article-journal"},"language":["eng"],"recId":"1945028475","physDesc":[{"noteIll":"Illustrationen","extent":"37 S."}],"name":{"displayForm":["Matthew J. Dolan, John Gargalionis and Ayodele Ore"]},"id":{"doi":["10.1007/JHEP08(2025)024"],"eki":["1945028475"]}} 
SRT |a DOLANMATTHQUARKVERSU4202