Classify the Higgs decays with the PFN and ParticleNet at electron–positron colliders

Gang Li; Libo Liao; Xinchou Lou; Peixun Shen; Weimin Song; Shudong Wang; Zhaoling Zhang

doi:10.1088/1674-1137/ac7f21

Abstract：

Various Higgs factories are proposed to study the Higgs boson precisely and systematically in a model- independent way. In this study, the Particle Flow Network and ParticleNet techniques are used to classify the Higgs decays into multicategories, and the ultimate goal is to realize an "end-to-end" analysis. A Monte Carlo simulation study is performed to demonstrate the feasibility, and the performance looks rather promising. This result could be the basis of a "one-stop" analysis to measure all the branching fractions of the Higgs decays simultaneously.

HTML

I. INTRODUCTION

The historic observation of the Higgs boson in 2012 at the Large Hadron Collider (LHC) [1, 2] declared the discovery of the last missing piece of the most fundamental building blocks in the Standard Model (SM). The SM has been remarkably successful in describing experimental phenomena. However, a precision Higgs physics program would be critically important given that the SM does not predict the parameters in the Higgs potential, nor does it involve particle candidates for dark matter. The precision determination of the Higgs couplings to the SM particles, gauge bosons and leptons/quarks, are the agents probing the Higgs mechanism for generating masses [3]. In particular, potential observable deviations of the Higgs couplings from the SM expectations would indicate new physics [4]. Therefore, the Higgs discovery marks the beginning of a new era of both theoretical and experimental exploration. Various $ e^+e^- $ colliders were proposed as Higgs factories by the high energy physics community, such as ILC [5], CLIC [6], FCC-ee [7], and CEPC [8, 9].

The most important advantages of a Higgs factory are that the center of mass (CM) energy is precisely defined and that they could perform absolute measurements of the Higgs boson. Neglecting Z fusion production, in an $ e^+e^-\to ZH $ event, where the Z decays into a pair of visible fermions or their stable decay final states ($Z\rightarrow e^+e^-,\; \mu^+\mu^-,\; \tau^+\tau^-,\; \rm{or}\; q\bar{q}$), the Higgs boson can be identified from the kinematics of those fermion pairs or their stable daughters independent of the Higgs decays. For example, the $ Z\to e^+e^- $ and $ \mu^+\mu^- $ modes are studied systematically in Refs. [10, 11]. The production cross-section and most of the decay branching fractions of the Higgs could be measured model-independently by the counting method. For example, CEPC could measure the cross-section of $ e^+e^-\rightarrow ZH $, $ \sigma(ZH) $, at 240 GeV, to a precision of 0.5% and the branching fractions of the Higgs boson to a few percent, respectively, by combining the four decay modes of the Z boson [11, 9].

The physics goal of a Higgs factory must be accomplished by optimizing the detector design and making use of the latest developments in data science. Recently, various Machine Learning (ML) techniques have already shown very promising performance in data analysis for high energy physics [12], in particular for jet studies. For instance, jets are treated as images [13–18], as sequences [19–22], as trees [23, 24], as graphs [25], or sets [26, 27] of particles, and ML techniques, most notably deep neural networks (DNNs), are used to build new jet tagging algorithms automatically from (labeled) simulated samples and even data [28–31]. While the above ML techniques are used at jet-level for case studies, they naturally can be applied for the event level in $ e^+e^- $ collisions, which have much simpler topologies and are pile-up free.

In this article, two ML approaches are used to study the classification problem of Higgs events. The classification results can serve as the basis of an "end-to-end" (E2E) analysis, which enables the simultaneous analysis of almost all the Higgs decays modes with the state-of-the-art ML techniques, starting with particle-level information and ending with physics observables. The approach also is a "one-stop" analysis to support extracting all Higgs couplings and taking into account the correlations and commonalities of the same detector for the experiment. Throughout this paper, the term "one-stop" analysis refers to an analysis method used to extract multiple observables of the same type at once. It differs from a conventional analysis in several ways. First, because many physics observables are measured using modern ML techniques at the same time, "one-stop" analysis is more efficient. Second, ML techniques usually deploy more information. Instead of only some limited number of selection criteria being used in conventional analysis, four-momenta and impact parameters (only charged tracks) of all particles in an event will be used by the ML techniques. Third, "one-stop" analysis could take into account the correlations and commonalities of the same detector for the experiment. Because all the measurements and their correlations are obtained in a consistent way, creating a combination based on these measurements will be easy.

The rest of this paper is organized as follows. The ML methods used in this study are introduced in Sec. II, followed by the implementation of the ML methods with a Monte Carlo (MC) simulation in Sec. III. Finally, a summary is presented.

II. MACHINE LEARNING METHODS

Recently, various ML techniques were proposed for jet tagging studies. Among them, PFN [26] and ParticleNet [27] achieved superior performance.

In the original publication of PFN [26], the authors applied the Deep Sets concept [32] to the jet-tagging problem. They proposed two elegant model architectures, named EnergyFlow Network (EFN) and ParticleFlow Network (PFN), with provable physics properties, such as infrared and colinear safety. In these two architectures, the features of each particle are encoded into a latent space of Φ [32] and the category, F, is extracted from the summed representation in that latent space. Both Φ and F are approximated by neural networks. The key mathematical fact is that a generic function of a set of particles can be decomposed into an arbitrarily good approximation according to the Deep Set Theorem [32]. The performance of these models in classification problems is comparable with other more complicated models. The authors also tried to interpret and visualize what the model has learned [26].

Motivated by the success of CNNs, the ParticleNet [27] approach based on the Dynamic Graph Convolutional Neural Network (DGCNN) is proposed for learning on particle cloud data. The edge convolution ("EdgeConv") operation, a convolution-like operation for point clouds, is used instead of the regular convolution operation. One important feature of the EdgeConv operation is that it can be easily stacked, just like regular convolutions. Therefore, another EdgeConv operation can be applied subsequently, which makes it possible to learn features of point clouds hierarchically. Another important feature is that the proximity of points can be dynamically learned with EdgeConv operations. The study shows that the graph describing the point clouds is dynamically updated to reflect the changes in the edges, i.e., the neighbors of each point. Reference [27] shows that this leads to better performance than keeping the graph static.

As suggested by the authors [26] and according to the performances of EFN and PFN, we choose PFN and ParticleNet to classify the Higgs decays. This ML attempt contains some distinct features in contrast to conventional data analysis. First, the ML approach is used to classify many physics processes at the same time. If some tiny decays are neglected, there are about 9 branching fractions of the Higgs decays to be measured. The number of classes is greater than 9 when the SM backgrounds are included. In addition, the classification results could be the basis of an E2E analysis, which means that all the particle-level information, such as four-momenta, PID, and impact parameters of charged particles, is used as input directly, and the network calculates the scores of each event. In this case, the analysis no longer needs some dedicated and complicated reconstruction tools, such as lepton/photon isolation, jet-clustering, τ finder, etc.

IV. SUMMARY AND DISCUSSION

In this paper, we presented a study of the classification of the Higgs decays with the state-of-the-art ML approaches at electron–positron colliders. We deploy the ML techniques and try to classify both the signal and background events with only particle-level information and to obtain the confusion matrices, which can be used in further data analysis. This approach is the basis of an efficient and balanced "one-stop" analysis, which makes it possible to measure all Higgs branching fractions using all detector information and taking all the commonalities and correlations into account. For the analyses of tens or hundreds of channels, they can be repeated using this technique in a few days if all data samples are ready. In contrast, the time could be considerably longer using conventional analysis methods.

This work is only a feasibility study. There are various possibilities to improve and further validate these methods. One is to enhance the performance by taking the sequential decays of W and Z bosons into account and add more categories in the classification, which can adopt more information for each category and enhance the classification performance. Another endeavor with more physical significance is incorporating some physics processes beyond the SM in the analysis, such as invisible and semi-invisible decays of the Higgs boson, which can enhance the sensitivity of an experiment to new physics. In addition, an important issue is to investigate the detailed performance of the classification method based on full simulation. It is also very constructive to take the full SM backgrounds and main systematic uncertainties into account.

ACKNOWLEDGE

The authors present special thanks to Yunxuan Song, Congqiao Li, Dr. Yu Bai, and Dr. Huilin Qu for useful discussion and advice. The authors thank the IHEP Computing Center for its firm support.

Reference (39)

[1]	G. Aad et al. (ATLAS Collaboration), Phys. Lett. B 716, 1-29 (2012) doi: 10.1016/j.physletb.2012.08.020
[2]	S. Chatrchyan et al. (CMS Collaboration), Phys. Lett. B 716, 30-61 (2012) doi: 10.1016/j.physletb.2012.08.021
[3]	R. Lafaye, T. Plehn, M. Rauch et al., JHEP 08, 009 (2009), arXiv:0904.3866 doi: 10.1088/1126-6708/2009/08/009
[4]	C. Englert, A. Freitas, M. M. Mühlleitner et al., J. Phys. G 41, 113001 (2014), arXiv:1403.7191 doi: 10.1088/0954-3899/41/11/113001
[5]	H. Baer, T. Barklow, K. Fujii et al., The International Linear Collider Technical Design Report - Volume 2: Physics, arXiv: 1306.6352 (2013)
[6]	M. Aicheler, P. Burrows, M. Draper et al., A Multi-TeV Linear Collider Based on CLIC Technology: CLIC Conceptual Design Report, doi: 10.5170/CERN-2012-007(2012)
[7]	A. Abada et al., Eur. Phys. J. ST 228(2), 261-623 (2019) doi: 10.1140/epjst/e2019-900045-4
[8]	CEPC Study Group, CEPC Conceptual Design Report: Volume 1 - Accelerator, arXiv: 1809.00285 (2018)
[9]	CEPC Study Group, CEPC Conceptual Design Report: Volume 2 - Physics & Detector, arXiv: 1811.10545 (2018)
[10]	Y. bai et al., Chin. Phys. C 44(1), 013001 (2020), arXiv:1905.12903 doi: 10.1088/1674-1137/44/1/013001
[11]	F. An et al., Chin. Phys. C 43(4), 043002 (2019), arXiv:1810.09037 doi: 10.1088/1674-1137/43/4/043002
[12]	M. D. Schwartz, Modern Machine Learning and Particle Physics, arXiv: 2103.12226 (2021)
[13]	A. J. Larkoski, I. Moult, and B. Nachman, Phys. Rept. 841, 1 (2020), arXiv:1709.04464 doi: 10.1016/j.physrep.2019.11.001
[14]	R. Kogler et al., Rev. Mod. Phys 91, 045003 (2019), arXiv:1803.06991 doi: 10.1103/RevModPhys.91.045003
[15]	J. Cogan, M. Kagan, E. Strauss et al., JHEP 02, 118 (2015), arXiv:1407.5675
[16]	L. G. Almeida, M. Backović, M. Cliche et al., JHEP 07, 086 (2015), arXiv:1501.05968
[17]	L. de Oliveira, M. Kagan, L. Mackey et al., JHEP 07, 069 (2016), arXiv:1511.05190
[18]	P. Baldi, K. Bauer, C. Eng et al., Phys. Rev. D 93, 094034 (2016), arXiv:1603.09349 doi: 10.1103/PhysRevD.93.094034
[19]	D. Guest, J. Collado, P. Baldi et al., Phys. Rev. D 94, 112002 (2016), arXiv:1607.08633 doi: 10.1103/PhysRevD.94.112002
[20]	J. Pearkes, W. Fedorko, A. Lister et al., Jet Constituents for Deep Neural Network Based Top Quark Tagging, arXiv: 1704.02124 (2017)
[21]	S. Egan, W. Fedorko, A. Lister et al., Long Short-Term Memory (LSTM) networks with jet constituents for boosted top tagging at the LHC, arXiv: 1711.09059 (2017)
[22]	K. Fraser and M. D. Schwartz, JHEP 10, 093 (2018), arXiv:1803.08066
[23]	G. Louppe, K. Cho, C. Becot et al., JHEP 01, 057 (2019), arXiv:1702.00748
[24]	T. Cheng, Comput. Softw. Big Sci. 2, 3 (2018), arXiv:1711.02633 doi: 10.1007/s41781-018-0007-y
[25]	I. Henrion, J. Brehmer, J. Bruna et al., Neural Message Passing for Jet Physics, Deep Learning for Physical Sciences Workshop at the 31st Conference on Neural Information Processing Systems (NIPS) (2017)
[26]	Patrick T. Komiske, Eric M. Metodiev, and Jesse Thaler, Journal of High Energy Physics 01, 121 (2019)
[27]	H. Qu and L. Gouskos, Phys. Rev. D 101(5), 056019 (2020), arXiv:1902.08570 doi: 10.1103/PhysRevD.101.056019
[28]	E. M. Metodiev, B. Nachman, and J. Thaler, JHEP 10, 174 (2017), arXiv:1708.02949
[29]	P. T. Komiske, E. M. Metodiev, B. Nachman et al., Phys. Rev. D 98, 011502(R) (2018), arXiv:1801.10158 doi: 10.1103/PhysRevD.98.011502
[30]	A. Andreassen, I. Feige, C. Frye et al., Eur. Phys. J. C 79, 102 (2019), arXiv:1804.09720 doi: 10.1140/epjc/s10052-019-6607-9
[31]	P. T. Komiske, E. M. Metodiev, and J. Thaler, JHEP 11, 059 (2018), arXiv:1809.01140
[32]	Manzil Zaheer, Satwik Kottur, Siamak Ravanbakhsh et al., Deep Sets, arXiv: 1703.06114 (2017)
[33]	K. He, X. Zhang, S. Ren et al., Delving deep into rectifiers: surpassing human-level performance on ImageNet classification, in 2015 IEEE International Conference on Computer Vision (ICCV), IEEE, Santiago, Chile, pg. 1026.c (2015).
[34]	Kilian, W., Ohl, T. & Reuter, J., Eur. Phys. J. C 71, 1742 (2011) doi: 10.1140/epjc/s10052-011-1742-y
[35]	T. Sjostrand, S. Mrenna, and P. Z. Skands, JHEP 05, 026 (2006), arXiv:0603175 doi: 10.1088/1126-6708/2006/05/026
[36]	Xin Mo, Gang Li, Man-Qi Ruan et al., Chin. Phys. C 40(3), 033001 (2016)
[37]	Diederik P. Kingma, and Jimmy Ba, Adam: A Method for Stochastic Optimization, ICLR ArXiv: 1412.6980 (2015)
[38]	Laurens van der Maaten and Geoffrey Hinton, Journal of Machine Learning Research 9, 2579-2605 (2008)
[39]	H. Qu, Weaver, a streamlined yet flexible machine learning R&D framework for HEP, https://github.com/hqucms/weaver

Mode	Cross section or branching fraction
$ \sigma(e^+e^-\to e^+e^-H) $	7.04 fb
$ \sigma(e^+e^-\to \mu^+\mu^-H) $	6.77 fb
$ \sigma(e^+e^-\to \tau^+\tau^-H) $	6.75 fb
$ \sigma(e^+e^-\to q^+q^-H) $	136.81 fb
$ \sigma(e^+e^-\to ZZ_{l}) $	67.81 fb
$ \sigma(e^+e^-\to ZZ_{sl}) $	516.67 fb
$ \sigma(e^+e^-\to ZZ_{h}) $	556.49 fb
$ B(H\to c\bar{c}) $	2.91%
$ B(H\to b\bar{b}) $	57.7%
$ B(H\to \mu^+\mu^-) $	$ 2.19\times 10^{-4} $
$ B(H\to \tau^+\tau^-) $	6.32%
$ B(H\to gg) $	8.57%
$ B(H\to \gamma\gamma) $	$ 2.28\times 10^{-3} $
$ B(H\to WW^*) $	21.5%
$ B(H\to ZZ^*) $	2.64%
$ B(H\to Z\gamma) $	$ 1.53\times 10^{-3} $

Decay mode	$ e^+e^-H $		$ \mu^+\mu^- H $		$ \tau^+\tau^- H $		$ q\bar{q}H $
Decay mode	EFF	AUC	EFF	AUC	EFF	AUC	EFF	AUC
$ H\to c\bar{c} $	0.880	0.991	0.882	0.991	0.857	0.987	0.755	0.966
$ H\to b\bar{b} $	0.908	0.994	0.893	0.994	0.877	0.991	0.733	0.972
$ H\to \mu^+\mu^- $	0.997	1.000	0.986	1.000	0.981	1.000	0.983	1.000
$ H\to \tau^+\tau^- $	0.993	0.999	0.985	0.999	0.985	0.999	0.982	0.999
$ H\to gg $	0.810	0.985	0.830	0.986	0.816	0.982	0.736	0.954
$ H\to \gamma\gamma $	0.997	1.000	0.999	1.000	1.000	1.000	0.997	1.000
$ H\to ZZ^* $	0.650	0.958	0.667	0.960	0.585	0.947	0.535	0.926
$ H\to WW^* $	0.806	0.981	0.801	0.981	0.771	0.974	0.632	0.952
$ H\to \gamma Z $	0.921	0.996	0.936	0.996	0.910	0.993	0.896	0.993

Classify the Higgs decays with the PFN and ParticleNet at electron–positron colliders

Abstract：

References

Access

Article Metrics

Metrics

通讯作者: 陈斌, bchen63@163.com

Email This Article

Classify the Higgs decays with the PFN and ParticleNet at electron–positron colliders

HTML

A. ML model setup

B. Simulation samples

C. 9-category classification: training and evaluation

D. Attempted 39-category classification

目录