Rapid identification of time-frequency domain gravitational wave signals from binary black holes using deep learning

  • Recent developments in deep learning techniques have provided alternative and complementary approaches to the traditional matched-filtering methods for identifying gravitational wave (GW) signals. The rapid and accurate identification of GW signals is crucial to the advancement of GW physics and multi-messenger astronomy, particularly considering the upcoming fourth and fifth observing runs of LIGO-Virgo-KAGRA. In this study, we used the 2D U-Net algorithm to identify time-frequency domain GW signals from stellar-mass binary black hole (BBH) mergers. We simulated BBH mergers with component masses ranging from 7 to 50 $ M_{\odot}$ and accounted for the LIGO detector noise. We found that the GW events in the first and second observation runs could all be clearly and rapidly identified. For the third observing run, approximately 80% of the GW events could be identified. In contrast to traditional convolutional neural networks, the U-Net algorithm can output time-frequency domain signal images corresponding to probabilities, providing a more intuitive analysis. In conclusion, the U-Net algorithm can rapidly identify the time-frequency domain GW signals from BBH mergers.
  • 加载中
  • [1] B. P. Abbott et al. (LIGO Scientific, Virgo), Phys. Rev. Lett. 116, 061102 (2016), arXiv: 1602.03837[gr-qc] doi: 10.1103/PhysRevLett.116.061102
    [2] A. Einstein, Sitzungsberichte der Königlich Preußischen Akademie der Wissenschaften, 688 (1916)
    [3] B. P. Abbott et al. (LIGO Scientific, Virgo), Phys. Rev. Lett. 119, 161101 (2017), arXiv: 1710.05832[gr-qc] doi: 10.1103/PhysRevLett.119.161101
    [4] B. P. Abbott et al. (LIGO Scientific, Virgo, Fermi GBM, INTEGRAL, IceCube, AstroSat Cadmium Zinc Telluride Imager Team, IPN, Insight-Hxmt, ANTARES, Swift, AGILE Team, 1M2H Team, Dark Energy Camera GW-EM, DES, DLT40, GRAWITA, Fermi-LAT, ATCA, ASKAP, Las Cumbres Observatory Group, OzGrav, DWF (Deeper Wider Faster Program), AST3, CAASTRO, VINROUGE, MASTER, J-GEM, GROWTH, JAGWAR, CaltechNRAO, TTU-NRAO, NuSTAR, Pan-STARRS, MAXI Team, TZAC Consortium, KU, Nordic Optical Telescope, ePESSTO, GROND, Texas Tech University, SALT Group, TOROS, BOOTES, MWA, CALET, IKI-GW Follow-up, H.E.S.S., LOFAR, LWA, HAWC, Pierre Auger, ALMA, Euro VLBI Team, Pi of Sky, Chandra Team at McGill University, DFN, ATLAS Telescopes, High Time Resolution Universe Survey, RIMAS, RATIR, SKA South Africa/MeerKAT), Astrophys. J. Lett. 848(L12), (2017), arXiv: 1710.05833[astro-ph.HE] doi: 10.3847/2041-8213/aa91c9
    [5] J. Aasi et al. (LIGO Scientific), Class. Quant. Grav. 32, 074001 (2015), arXiv: 1411.4547[gr-qc] doi: 10.1088/0264-9381/32/7/074001
    [6] F. Acernese et al. (VIRGO), Class. Quant. Grav. 32, 024001 (2015), arXiv: 1408.3978[gr-qc] doi: 10.1088/0264-9381/32/2/024001
    [7] T. Akutsu et al. (KAGRA), Nature Astron. 3, 35 (2019), arXiv: 1811.08079[gr-qc] doi: 10.1038/s41550-018-0658-y
    [8] https://gwosc.org/eventapi/html/allevents/.
    [9] B. P. Abbott et al. (LIGO Scientific, Virgo), Phys. Rev. X 9, 031040 (2019), arXiv: 1811.12907[astro-ph.HE] doi: 10.1103/PhysRevX.9.031040
    [10] R. Abbott et al. (LIGO Scientific, Virgo), Phys. Rev. X 11, 021053 (2021), arXiv: 2010.14527[gr-qc] doi: 10.1103/PhysRevX.11.021053
    [11] R. Abbott et al. (KAGRA, VIRGO, LIGO Scientific), Phys. Rev. X 13, 041039 (2023), arXiv: 2111.03606[grqc] doi: 10.1103/PhysRevX.13.041039
    [12] R. Abbott et al. (LIGO Scientific, VIRGO), Phys. Rev. D 109, 022001 (2024), arXiv: 2108.01045[gr-qc] doi: 10.1103/PhysRevD.109.022001
    [13] E. Berti et al., Class. Quant. Grav. 32, 243001 (2015), arXiv: 1501.07274[gr-qc] doi: 10.1088/0264-9381/32/24/243001
    [14] B. P. Abbott et al. (LIGO Scientific, Virgo), Phys. Rev. Lett. 116 , 221101 (2016), [Erratum: Phys. Rev. Lett. 121, 129902 (2018)], arXiv: 1602.03841[gr-qc]
    [15] B. P. Abbott et al. (LIGO Scientific, Virgo), Phys. Rev. Lett. 123, 011102 (2019), arXiv: 1811.00364[gr-qc] doi: 10.1103/PhysRevLett.123.011102
    [16] B. P. Abbott et al. (LIGO Scientific, Virgo), Phys. Rev. D 100, 104036 (2019), arXiv: 1903.04467[gr-qc] doi: 10.1103/PhysRevD.100.104036
    [17] R. Abbott et al. (LIGO Scientific, Virgo), Phys. Rev. D 103, 122002 (2021), arXiv: 2010.14529[gr-qc] doi: 10.1103/PhysRevD.103.122002
    [18] R. Abbott et al. (LIGO Scientific, VIRGO, KAGRA), arXiv: 2112.06861[gr-qc]
    [19] C. Gong, T. Zhu, R. Niu et al., Phys. Rev. D 105, 044034 (2022), arXiv: 2112.06446[gr-qc] doi: 10.1103/PhysRevD.105.044034
    [20] C. Gong, T. Zhu, R. Niu et al., Phys. Rev. D 107, 124015 (2023), arXiv: 2302.05077[gr-qc] doi: 10.1103/PhysRevD.107.124015
    [21] I. Mandel and F. S. Broekgaarden, Living Rev. Rel. 25, 1 (2022), arXiv: 2107.14239[astro-ph.HE] doi: 10.1007/s41114-021-00034-3
    [22] L. A. C. van Son, S. E. de Mink, T. Callister et al., Astrophys. J. 931, 17 (2022), arXiv: 2110.01634[astro-ph.HE] doi: 10.3847/1538-4357/ac64a3
    [23] F. S. Broekgaarden et al., Mon. Not. R. Astron. Soc. 516, 5737 (2022) doi: 10.1093/mnras/stac1677
    [24] J. M. Ezquiaga and D. E. Holz, Phys. Rev. Lett. 129, 061102 (2022), arXiv: 2202.08240[astro-ph.CO] doi: 10.1103/PhysRevLett.129.061102
    [25] B. P. Abbott et al. (LIGO Scientific, Virgo, 1M2H, Dark Energy Camera GW-E, DES, DLT40, Las Cumbres Observatory, VINROUGE, MASTER), Nature 551, 85 (2017), arXiv: 1710.05835[astro-ph.CO] doi: 10.1038/nature24471
    [26] H.-Y. Chen, M. Fishbach, and D. E. Holz, Nature 562, 545 (2018), arXiv: 1712.06531[astro-ph.CO] doi: 10.1038/s41586-018-0606-0
    [27] R. Abbott et al. (LIGO Scientific, Virgo, KAGRA), Astrophys. J. 949, 76 (2023), arXiv: 2111.03604 [astroph.CO] doi: 10.3847/1538-4357/ac74bb
    [28] M. Soares-Santos et al. (DES, LIGO Scientific, Virgo), Astrophys. J. Lett. 876, L7 (2019), arXiv: 1901.01540 [astro-ph.CO] doi: 10.3847/2041-8213/ab14f1
    [29] A. Palmese et al. (DES), Astrophys. J. Lett. 900, L33 (2020), arXiv: 2006.14961 [astro-ph.CO] doi: 10.3847/2041-8213/abaeff
    [30] B. P. Abbott et al. (LIGO Scientific, Virgo, VIRGO), Astrophys. J. 909, 218 (2021), arXiv: 1908.06060[astro-ph.CO] doi: 10.3847/1538-4357/abdcb7
    [31] D. E. Holz and S. A. Hughes, Astrophys. J. 629, 15 (2005), arXiv: astro-ph/0504616 doi: 10.1086/431341
    [32] N. Dalal, D. E. Holz, S. A. Hughes et al., Phys. Rev. D 74, 063006 (2006), arXiv: astro-ph/0601275 doi: 10.1103/PhysRevD.74.063006
    [33] S. Nissanke, D. E. Holz, S. A. Hughes et al., Astrophys. J. 725, 496 (2010), arXiv: 0904.1017[astro-ph.CO] doi: 10.1088/0004-637X/725/1/496
    [34] C. Cutler and D. E. Holz, Phys. Rev. D 80, 104009 (2009), arXiv: 0906.3752[astro-ph.CO] doi: 10.1103/PhysRevD.80.104009
    [35] S. Camera and A. Nishizawa, Phys. Rev. Lett. 110, 151103 (2013), arXiv: 1303.5446[astro-ph.CO] doi: 10.1103/PhysRevLett.110.151103
    [36] S. Vitale and H.-Y. Chen, Phys. Rev. Lett. 121, 021303 (2018), arXiv: 1804.07337[astro-ph.CO] doi: 10.1103/PhysRevLett.121.021303
    [37] L. Bian et al., Sci. China Phys. Mech. Astron. 64, 120401 (2021), arXiv: 2106.10235[gr-qc] doi: 10.1007/s11433-021-1781-x
    [38] R.-G. Cai and T. Yang, Phys. Rev. D 95, 044024 (2017), arXiv: 1608.08008[astro-ph.CO] doi: 10.1103/PhysRevD.95.044024
    [39] R.-G. Cai, T.-B. Liu, X.-W. Liu et al., Phys. Rev. D 97, 103005 (2018), arXiv: 1712.00952[astro-ph.CO] doi: 10.1103/PhysRevD.97.103005
    [40] R.-G. Cai and T. Yang, EPJ Web Conf. 168, 01008 (2018), arXiv: 1709.00837[astro-ph.CO] doi: 10.1051/epjconf/201816801008
    [41] X. Zhang, Sci. China Phys. Mech. Astron. 62, 110431 (2019), arXiv: 1905.11122[astro-ph.CO] doi: 10.1007/s11433-019-9445-7
    [42] H.-Y. Chen, Phys. Rev. Lett. 125, 201301 (2020), arXiv: 2006.02779[astro-ph.HE] doi: 10.1103/PhysRevLett.125.201301
    [43] R. Gray et al., Phys. Rev. D 101, 122001 (2020), arXiv: 1908.06050[gr-qc] doi: 10.1103/PhysRevD.101.122001
    [44] W. Zhao, C. Van Den Broeck, D. Baskaran et al., Phys. Rev. D 83, 023005 (2011), arXiv: 1009.0206[astro-ph.CO] doi: 10.1103/PhysRevD.83.023005
    [45] W. Zhao, B. S. Wright, and B. Li, JCAP 10, 052 (2018), arXiv: 1804.03066[astro-ph.CO] doi: 10.1088/1475-7516/2018/10/052
    [46] S.-J. Jin, T.-N. Li, J.-F. Zhang et al., JCAP 08, 070 (2023), arXiv: 2202.11882[gr-qc] doi: 10.1088/1475-7516/2023/08/070
    [47] M. Du, W. Yang, L. Xu et al., Phys. Rev. D 100, 043535 (2019), arXiv: 1812.01440[astroph.CO] doi: 10.1103/PhysRevD.100.043535
    [48] Y.-F. Cai, C. Li, E. N. Saridakis et al., Phys. Rev. D 97, 103513 (2018), arXiv: 1801.05827[gr-qc] doi: 10.1103/PhysRevD.97.103513
    [49] W. Yang, S. Pan, E. Di Valentino et al., JCAP 05, 050 (2020), arXiv: 1904.11980[astro-ph.CO] doi: 10.1088/1475-7516/2020/05/050
    [50] W. Yang, S. Vagnozzi, E. Di Valentino et al., JCAP 07, 037 (2019), arXiv: 1905.08286[astro-ph.CO] doi: 10.1088/1475-7516/2019/07/037
    [51] R. R. A. Bachega, A. A. Costa, E. Abdalla et al., JCAP 05, 021 (2020), arXiv: 1906.08909[astro-ph.CO] doi: 10.1088/1475-7516/2020/05/021
    [52] Z. Chang, Q.-G. Huang, S. Wang et al., Eur. Phys. J. C 79, 177 (2019) doi: 10.1140/epjc/s10052-019-6664-0
    [53] J.-F. Zhang, M. Zhang, S.-J. Jin et al., JCAP 09, 068 (2019), arXiv: 1907.03238[astro-ph.CO] doi: 10.1088/1475-7516/2019/09/068
    [54] S. Mukherjee, G. Lavaux, F. R. Bouchet et al., Astron. Astrophys. 646, A65 (2021), arXiv: 1909.08627[astro-ph.CO] doi: 10.1051/0004-6361/201936724
    [55] J.-h. He, Phys. Rev. D 100, 023527 (2019), arXiv: 1903.11254[astro-ph.CO] doi: 10.1103/PhysRevD.100.023527
    [56] Z.-W. Zhao, L.-F. Wang, J.-F. Zhang et al., Sci. Bull. 65, 1340 (2020), arXiv: 1912.11629[astroph.CO] doi: 10.1016/j.scib.2020.04.032
    [57] L.-F. Wang, S.-J. Jin, J.-F. Zhang et al., Sci. China Phys. Mech. Astron. 65, 210411 (2022), arXiv: 2101.11882[gr-qc] doi: 10.1007/s11433-021-1736-6
    [58] J.-Z. Qi, S.-J. Jin, X.-L. Fan et al., JCAP 12, 042 (2021), arXiv: 2102.01292[astro-ph.CO] doi: 10.1088/1475-7516/2021/12/042
    [59] S.-J. Jin, L.-F. Wang, P.-J. Wu et al., Phys. Rev. D 104, 103507 (2021), arXiv: 2106.01859[astro-ph.CO] doi: 10.1103/PhysRevD.104.103507
    [60] L.-G. Zhu, L.-H. Xie, Y.-M. Hu et al., Sci. China Phys. Mech. Astron. 65, 259811 (2022), arXiv: 2110.05224 [astro-ph.CO] doi: 10.1007/s11433-021-1859-9
    [61] J. M. S. de Souza, R. Sturani, and J. Alcaniz, JCAP 03, 025 (2022), arXiv: 2110.13316 [gr-qc] doi: 10.1088/1475-7516/2022/03/025
    [62] S.-J. Jin, R.-Q. Zhu, L.-F. Wang et al., Commun. Theor. Phys. 74, 105404 (2022), arXiv: 2204.04689[astro-ph.CO] doi: 10.1088/1572-9494/ac7b76
    [63] M.-D. Cao, J. Zheng, J.-Z. Qi et al., Astrophys. J. 934, 108 (2022), arXiv: 2112.14564[astro-ph.CO] doi: 10.3847/1538-4357/ac7ce4
    [64] H. Leandro, V. Marra, and R. Sturani, Phys. Rev. D 105, 023523 (2022), arXiv: 2109.07537[gr-qc] doi: 10.1103/PhysRevD.105.023523
    [65] X. Fu, L. Zhou, J. Yang et al., Chin. Phys. C 45, 065104 (2021) doi: 10.1088/1674-1137/abf48a
    [66] C. Ye and M. Fishbach, Phys. Rev. D 104, 043507 (2021), arXiv: 2103.14038[astro-ph.CO] doi: 10.1103/PhysRevD.104.043507
    [67] H.-Y. Chen, P. S. Cowperthwaite, B. D. Metzger et al., Astrophys. J. Lett. 908, L4 (2021), arXiv: 2011.01211 [astro-ph.CO] doi: 10.3847/2041-8213/abdab0
    [68] A. Mitra, J. Mifsud, D. F. Mota et al., Mon. Not. Roy. Astron. Soc. 502, 5563 (2021), arXiv: 2010.00189[astro-ph.CO] doi: 10.1093/mnras/stab165
    [69] N. B. Hogg, M. Martinelli, and S. Nesseris, JCAP 12, 019 (2020), arXiv: 2007.14335[astro-ph.CO] doi: 10.1088/1475-7516/2020/12/019
    [70] R. C. Nunes, Phys. Rev. D 102, 024071 (2020), arXiv: 2007.07750[gr-qc] doi: 10.1103/PhysRevD.102.024071
    [71] S. Borhanian, A. Dhani, A. Gupta et al., Astrophys. J. Lett. 905, L28 (2020), arXiv: 2007.02883[astro-ph.CO] doi: 10.3847/2041-8213/abcaf5
    [72] S.-J. Jin, D.-Z. He, Y. Xu et al., JCAP 03, 051 (2020), arXiv: 2001.05393[astro-ph.CO] doi: 10.1088/1475-7516/2020/03/051
    [73] J. Yu, Y. Wang, W. Zhao et al., Mon. Not. Roy. Astron. Soc. 498, 1786 (2020), arXiv: 2003.06586[astroph.CO] doi: 10.1093/mnras/staa2465
    [74] S.-J. Jin, S.-S. Xing, Y. Shao et al., Chin. Phys. C 47, 065104 (2023), arXiv: 2301.06722[astro-ph.CO] doi: 10.1088/1674-1137/acc8be
    [75] S.-J. Jin, Y.-Z. Zhang, J.-Y. Song et al., Sci. China Phys. Mech. Astron. 67, 220412 (2024), arXiv: 2305.19714 [astro-ph.CO] doi: 10.1007/s11433-023-2276-1
    [76] S.-J. Jin, R.-Q. Zhu, J.-Y. Song et al., JCAP 08, 050 (2024), arXiv: 2309.11900[astro-ph.CO] doi: 10.1088/1475-7516/2024/08/050
    [77] J.-Y. Song, L.-F. Wang, Y. Li et al., Sci. China Phys. Mech. Astron. 67, 230411 (2024), arXiv: 2212.00531 [astro-ph.CO] doi: 10.1007/s11433-023-2260-2
    [78] T. Han, S.-J. Jin, J.-F. Zhang, and X. Zhang, Eur. Phys. J. C 84, 663 (2024), arXiv: 2309.14965[astroph.CO] doi: 10.1140/epjc/s10052-024-12999-w
    [79] T.-N. Li, S.-J. Jin, H.-L. Li et al., Astrophys. J. 963, 52 (2024), arXiv: 2310.15879[astroph.CO] doi: 10.3847/1538-4357/ad1bc9
    [80] B. P. Abbott et al. (LIGO Scientific, Virgo), Phys. Rev. D 100, 064064 (2019), arXiv: 1906.08000[gr-qc] doi: 10.1103/PhysRevD.100.064064
    [81] R. Magee et al., Astrophys. J. Lett. 910, L21 (2021), arXiv: 2102.04555[astro-ph.HE] doi: 10.3847/2041-8213/abed54
    [82] H. Xia, L. Shao, J. Zhao et al., Phys. Rev. D 103, 024040 (2021), arXiv: 2011.04418[astro-ph.HE] doi: 10.1103/PhysRevD.103.024040
    [83] Y. LeCun, Y. Bengio, and G. Hinton, nature 521, 436 (2015) doi: 10.1038/nature14539
    [84] D. Guest, K. Cranmer, and D. Whiteson, Annual Review of Nuclear and Particle Science 68, 161 (2018) doi: 10.1146/annurev-nucl-101917-021019
    [85] P. Baldi, P. Sadowski, and D. Whiteson, Nature Commun. 5, 4308 (2014), arXiv: 1402.4735[hep-ph] doi: 10.1038/ncomms5308
    [86] W. Wei and E. A. Huerta, Phys. Lett. B 800, 135081 (2020), arXiv: 1901.00869[gr-qc] doi: 10.1016/j.physletb.2019.135081
    [87] D. George, H. Shen, and E. A. Huerta, arXiv: 1706.07446[gr-qc]
    [88] C. Chatterjee, L. Wen, K. Vinsen et al., Phys. Rev. D 100, 103025 (2019), arXiv: 1909.06367[astro-ph.IM] doi: 10.1103/PhysRevD.100.103025
    [89] H. Shen, D. George, E. A. Huerta et al., arXiv: 1903.03105 [astro-ph.CO]
    [90] E. Cuoco et al., Mach. Learn. Sci. Tech. 2, 011002 (2021), arXiv: 2005.03745[astro-ph.HE] doi: 10.1088/2632-2153/abb93a
    [91] J. A. D. Álvares, J. A. Font, F. F. Freitas et al., Class. Quant. Grav. 38, 155010 (2021), arXiv: 2011.10425[gr-qc] doi: 10.1088/1361-6382/ac0455
    [92] S. R. Green, C. Simpson, and J. Gair, Phys. Rev. D 102, 104057 (2020), arXiv: 2002.07656[astro-ph.IM] doi: 10.1103/PhysRevD.102.104057
    [93] S. R. Green and J. Gair, Mach. Learn. Sci. Tech. 2, 03LT01 (2021), arXiv: 2008.03312[astro-ph.IM] doi: 10.1088/2632-2153/abfaed
    [94] M. Dax, S. R. Green, J. Gair et al., Phys. Rev. Lett. 130, 171403 (2023), arXiv: 2210.05686[grqc] doi: 10.1103/PhysRevLett.130.171403
    [95] J. P. Marulanda, C. Santa, and A. E. Romano, Phys. Lett. B 810, 135790 (2020), arXiv: 2004.01050[gr-qc] doi: 10.1016/j.physletb.2020.135790
    [96] S. Singh, A. Singh, A. Prajapati et al., Mon. Not. Roy. Astron. Soc. 508, 1358 (2021), arXiv: 2008.06550[astro-ph.HE] doi: 10.1093/mnras/stab2417
    [97] M. Mould, D. Gerosa, and S. R. Taylor, Phys. Rev. D 106, 103013 (2022), arXiv: 2203.03651[astro-ph.HE] doi: 10.1103/PhysRevD.106.103013
    [98] C. Chatterjee, L. Wen, F. Diakogiannis et al., Phys. Rev. D 104, 064046 (2021), arXiv: 2105.03073[grqc] doi: 10.1103/PhysRevD.104.064046
    [99] A. McLeod, D. Jacobs, C. Chatterjee et al., arXiv: 2201.11126[astro-ph.IM]
    [100] J. Langendorff, A. Kolmus, J. Janquart et al., Phys. Rev. Lett. 130, 171402 (2023), arXiv: 2211.15097[gr-qc] doi: 10.1103/PhysRevLett.130.171402
    [101] C. Chatterjee, M. Kovalam, L. Wen et al., Astrophys. J. 959, 42 (2023), arXiv: 2207.14522[gr-qc] doi: 10.3847/1538-4357/ad08b7
    [102] T.-Y. Sun, C.-Y. Xiong, S.-J. Jin et al., Chin. Phys. C 48, 045108 (2024), arXiv: 2312.08122[gr-qc] doi: 10.1088/1674-1137/ad2a5f
    [103] C.-Y. Xiong, T.-Y. Sun, J.-F. Zhang et al., arXiv: 2405.09475[gr-qc]
    [104] D. George and E. A. Huerta, Phys. Rev. D 97, 044039 (2018), arXiv: 1701.00008[astro-ph.IM] doi: 10.1103/PhysRevD.97.044039
    [105] H. Gabbard, M. Williams, F. Hayes et al., Phys. Rev. Lett. 120, 141103 (2018), arXiv: 1712.06041[astro-ph.IM] doi: 10.1103/PhysRevLett.120.141103
    [106] P. G. Krastev, K. Gill, V. A. Villar et al., Phys. Lett. B 815, 136161 (2021), arXiv: 2012.13101[astro-ph.IM] doi: 10.1016/j.physletb.2021.136161
    [107] M. Cabero, A. Mahabal, and J. McIver, Astrophys. J. Lett. 904, L9 (2020), arXiv: 2010.11829[gr-qc] doi: 10.3847/2041-8213/abc5b5
    [108] S. Jadhav, N. Mukund, B. Gadre et al., Phys. Rev. D 104, 064051 (2021), arXiv: 2010.08584[gr-qc] doi: 10.1103/PhysRevD.104.064051
    [109] D. George and E. A. Huerta, Phys. Lett. B 778, 64 (2018), arXiv: 1711.03121[gr-qc] doi: 10.1016/j.physletb.2017.12.053
    [110] X. Fan, J. Li, X. Li et al., Sci. China Phys. Mech. Astron. 62, 969512 (2019), arXiv: 1811.01380[astro-ph.IM] doi: 10.1007/s11433-018-9321-7
    [111] S. Wu, Z. Cao, X. Liu et al., Phys. Rev. D 101, 104003 (2020), arXiv: 1909.13442[astroph.IM] doi: 10.1103/PhysRevD.101.104003
    [112] P. G. Krastev, Phys. Lett. B 803, 135330 (2020), arXiv: 1908.03151[astro-ph.IM] doi: 10.1016/j.physletb.2020.135330
    [113] W. Wei, A. Khan, E. A. Huerta et al., Phys. Lett. B 812, 136029 (2021), arXiv: 2010.15845[gr-qc] doi: 10.1016/j.physletb.2020.136029
    [114] C. Verma, A. Reza, D. Krishnaswamy et al., AIP Conf. Proc. 2555, 020010 (2022), arXiv: 2110.01883[gr-qc] doi: 10.1063/5.0108682
    [115] E. A. Moreno, B. Borzyszkowski, M. Pierini et al., Mach. Learn. Sci. Tech. 3, 025001 (2022), arXiv: 2107.12698[gr-qc] doi: 10.1088/2632-2153/ac5435
    [116] Y. Zhang, H. Xu, M. Liu et al., Phys. Rev. D 106, 122002 (2022) doi: 10.1103/PhysRevD.106.122002
    [117] R. Qiu, P. G. Krastev, K. Gill et al., Phys. Lett. B 840, 137850 (2023), arXiv: 2210.15888[astroph.IM] doi: 10.1016/j.physletb.2023.137850
    [118] P. Nousi, A. E. Koloniari, N. Passalis et al., Phys. Rev. D 108, 024022 (2023), arXiv: 2211.01520[gr-qc] doi: 10.1103/PhysRevD.108.024022
    [119] C. Ma, W. Wang, H. Wang et al., Phys. Rev. D 105, 083013 (2022), arXiv: 2204.12058[astro-ph.IM] doi: 10.1103/PhysRevD.105.083013
    [120] M. B. Schäfer et al., Phys. Rev. D 107, 023021 (2023), arXiv: 2209.11146[astro-ph.IM] doi: 10.1103/PhysRevD.107.023021
    [121] A. Trovato, E. Chassande-Mottin, M. Bejger e tal., Class. Quant. Grav. 41, 125003 (2024), arXiv: 2307.09268[gr-qc] doi: 10.1088/1361-6382/ad40f0
    [122] M. B. Schäfer, O. Zelenka, A. H. Nitz et al., Phys. Rev. D 105, 043002 (2022), arXiv: 2106.03741[astro-ph.IM] doi: 10.1103/PhysRevD.105.043002
    [123] T. Marianer, D. Poznanski, and J. X. Prochaska, Mon. Not. Roy. Astron. Soc. 500, 5408 (2020), arXiv: 2010.11949[astro-ph.IM] doi: 10.1093/mnras/staa3550
    [124] V. Boudart and M. Fays, Phys. Rev. D 105, 083007 (2022), arXiv: 2201.08727[gr-qc] doi: 10.1103/PhysRevD.105.083007
    [125] A. Ravichandran, A. Vijaykumar, S. J. Kapadia et al., arXiv: 2302.00666[gr-qc]
    [126] O. Ronneberger, P. Fischer, and T. Brox, in Medical Image Computing and Computer-Assisted Intervention– MICCAI 2015, 18th International Conference, Munich, Germany, October 5-9, 2015, Proceedings, Part III 18 (Springer, 2015), pp. 234–241.
    [127] S. Ni, Y. Li, L.-Y. Gao et al., Astrophys. J. 934, 83 (2022), arXiv: 2204.02780[astro-ph.IM] doi: 10.3847/1538-4357/ac7a34
    [128] L.-Y. Gao, Y. Li, S. Ni et al., Mon. Not. Roy. Astron. Soc. 525, 5278 (2023), arXiv: 2212.08773[astroph.IM] doi: 10.1093/mnras/stad2646
    [129] G. Pratten et al., Phys. Rev. D 103, 104056 (2021), arXiv: 2004.06503[gr-qc] doi: 10.1103/PhysRevD.103.104056
    [130] A. Nitz, I. Harry, D. Brown et al., "gwastro/pycbc: v2.1.2 release of pycbc," 2023
    [131] G. Baltus, J. Janquart, M. Lopez et al., Phys. Rev. D 106, 042002 (2022), arXiv: 2205.04750[gr-qc] doi: 10.1103/PhysRevD.106.042002
    [132] https://github.com/HarisIqbal88/PlotNeuralNet.
    [133] D. M. Macleod, J. S. Areeda, S. B. Coughlin et al., SoftwareX 13, 100657 (2021) doi: 10.1016/j.softx.2021.100657
    [134] S. J. Reddi, S. Kale, and S. Kumar, arXiv: 1904.09237
    [135] B. Abbott et al. (LIGO Scientific), Phys. Rev. D 77, 062002 (2008), arXiv: 0704.3368[gr-qc] doi: 10.1103/PhysRevD.77.062002
    [136] M. B. Schäfer and A. H. Nitz, Phys. Rev. D 105, 043003 (2022), arXiv: 2108.10715[astro-ph.IM] doi: 10.1103/PhysRevD.105.043003
    [137] E. Marx et al., arXiv: 2403.18661[gr-qc]
    [138] M. Cabero et al., Class. Quant. Grav. 36, 15 (2019), arXiv: 1901.05093[physics.ins-det] doi: 10.1088/1361-6382/ab2e14
    [139] A. H. Nitz, Class. Quant. Grav. 35, 035016 (2018), arXiv: 1709.08974[gr-qc] doi: 10.1088/1361-6382/aaa13d
    [140] S. Vitale, D. Gerosa, C.-J. Haster et al., Phys. Rev. Lett. 119, 251103 (2017), arXiv: 1707.04637[gr-qc] doi: 10.1103/PhysRevLett.119.251103
  • 加载中

Figures(11) / Tables(4)

Get Citation
Yu-Xin Wang, Shang-Jie Jin, Tian-Yang Sun, Jing-Fei Zhang and Xin Zhang. Rapid identification of time-frequency domain gravitational wave signals from binary black holes using deep learning[J]. Chinese Physics C. doi: 10.1088/1674-1137/ad73ac
Yu-Xin Wang, Shang-Jie Jin, Tian-Yang Sun, Jing-Fei Zhang and Xin Zhang. Rapid identification of time-frequency domain gravitational wave signals from binary black holes using deep learning[J]. Chinese Physics C.  doi: 10.1088/1674-1137/ad73ac shu
Milestone
Received: 2024-08-06
Article Metric

Article Views(666)
PDF Downloads(15)
Cited by(0)
Policy on re-use
To reuse of subscription content published by CPC, the users need to request permission from CPC, unless the content was published under an Open Access license which automatically permits that type of reuse.
通讯作者: 陈斌, bchen63@163.com
  • 1. 

    沈阳化工大学材料科学与工程学院 沈阳 110142

  1. 本站搜索
  2. 百度学术搜索
  3. 万方数据库搜索
  4. CNKI搜索

Email This Article

Title:
Email:

Rapid identification of time-frequency domain gravitational wave signals from binary black holes using deep learning

    Corresponding author: Xin Zhang, zhangxin@mail.neu.edu.cn
  • 1. Key Laboratory of Cosmology and Astrophysics (Liaoning) & College of Sciences, Northeastern University, Shenyang 110819, China
  • 2. Key Laboratory of Data Analytics and Optimization for Smart Industry (Ministry of Education), Northeastern University, Shenyang 110819, China
  • 3. National Frontiers Science Center for Industrial Intelligence and Systems Optimization, Northeastern University, Shenyang 110819, China

Abstract: Recent developments in deep learning techniques have provided alternative and complementary approaches to the traditional matched-filtering methods for identifying gravitational wave (GW) signals. The rapid and accurate identification of GW signals is crucial to the advancement of GW physics and multi-messenger astronomy, particularly considering the upcoming fourth and fifth observing runs of LIGO-Virgo-KAGRA. In this study, we used the 2D U-Net algorithm to identify time-frequency domain GW signals from stellar-mass binary black hole (BBH) mergers. We simulated BBH mergers with component masses ranging from 7 to 50 $ M_{\odot}$ and accounted for the LIGO detector noise. We found that the GW events in the first and second observation runs could all be clearly and rapidly identified. For the third observing run, approximately 80% of the GW events could be identified. In contrast to traditional convolutional neural networks, the U-Net algorithm can output time-frequency domain signal images corresponding to probabilities, providing a more intuitive analysis. In conclusion, the U-Net algorithm can rapidly identify the time-frequency domain GW signals from BBH mergers.

    HTML

    I.   INTRODUCTION
    • In 2015, the first detection of a gravitational wave (GW) signal (GW150914) from a binary black hole (BBH) merger initiated a new era of GW astronomy [1]. Furthermore, it provided an important test for the existence of GWs, which were predicted by Albert Einstein in 1916 based on his theory of general relativity [2]. On August 17th, 2017, the first binary neutron star (BNS) merger event (GW170817) was detected [3], which together with its electromagnetic (EM) counterparts began the era of multi-messenger astronomy [4]. So far, the LIGO-Virgo-KAGRA collaboration [57] has detected 90 GW events from compact binary coalescences (CBCs) [812]. The study of GW has important implications for fundamental physics, astronomy, and cosmology. For example, GWs can be used to test general relativity [1320], understand the origins and distributions of astrophysical CBC sources [2124], and measure cosmological parameters [2530] (especially providing independent measurements of the Hubble constant using the standard siren method, which is widely discussed in the literature [3179]).

      Traditional matched-filtering techniques have been greatly successful in the detection of GWs [80] and the performance of matched filtering is considered near-optimal in the confines of LIGO detector noise. Searches are performed with two different latencies: online and offline, as described in Ref. [81]. Online searches are performed nearly in real-time during data collection, whereas offline searches are conducted later using the final, calibrated, and cleaned dataset. Online analysis allows for the rapid release of candidate-related public alerts for multi-messenger counterpart searches. Offline analysis includes improved background statistics and extensive data calibration, review, and adjustment. Moreover, with relaxed latency requirements, offline analysis can perform more computationally expensive calculations to enhance the separation of signals from the background noise. Owing to these factors, offline analysis is more sensitive than online analysis. Online searches are rapid, but the ability to separate signals from background noise is not as strong as in offline searches. Furthermore, both types of search demand substantial computational resources. Nevertheless, deep learning can serve as a complementary approach to validate the results generated by the low-latency workflow. In addition, because of the generalization of deep learning [82], GW signals beyond the GW template are hoped to be identified, which is expected to be verified by future GW templates. More importantly, deep learning is known for its low computational resource requirements.

      Deep learning describes a class of machine learning algorithms. The basic principle of deep learning is to use a multi-layer neural network to gradually extract features from the original input data and make predictions. Thanks to the rapid development of GPU technology, deep learning techniques have gradually been adopted by a wide array of fields in recent years [8386]. The main advantage of using deep learning to identify GW signals is that the algorithm can be pre-trained using a library of known waveform templates and detector noises. When running an online search, the trained network can be loaded quickly, allowing the rapid and efficient identification of GW sources.

      Recently, GW astronomy based on deep learning algorithms has been intensively discussed in the literature [82, 86122]. Among these studies, Refs. [82, 104122] focused on the identification of GW signals, with most based on the time-domain analysis. In fact, because the signal strengths of GWs are weak, using 2D data in the analysis of GW detection is sub-optimal [104]. However, many networks work better with 2D images. Hoping to achieve a higher true alarm rate and a lower false alarm rate, analyses are also conducted in the time-frequency domain. Therefore, the identification of the time-frequency domains of GWs using deep learning is also very important, as discussed in Refs. [90, 123125]. The 2D U-Net algorithm [126] has advantages in image processing, performing well in removing foreground contaminations entangled with the systematic effects of radio telescopes in neutral hydrogen $ 21 $ cm intensity mapping surveys [127, 128]. Given the advantage of U-Net in image processing and the challenges in identifying the extremely faint neutral hydrogen signals and distinguishing GW signals from noise (GW detection and neutral hydrogen $ 21 $ cm intensity mapping surveys both extract weak signals from strong background noise), we are naturally motivated to explore the potential of the 2D U-Net algorithm in identifying time-frequency domain GW signals from BBHs.

      In this article, we explore the potential of the U-Net algorithm in identifying GW signals. Unlike traditional convolutional neural network (CNN) algorithms, U-Net can output images corresponding to probabilities, enabling a more intuitive investigation of the time-frequency-domain signal. Our goal was to train the network to quickly identify GW signals in the time-frequency domain. Given that the currently detected GW signals are mainly BBH mergers, we focused on the identification of BBH mergers. We began by training the network using simulated BBH signals and subsequently applied the trained network to real observations. Moreover, this is the first study to perform time-frequency-domain analysis and apply the network to the O3 observations. Through this study, we aim to shed light on the possibility of using U-Net to identify the GW signals from BBH mergers. Neutron star–black hole (NSBH) and BNS mergers have been omitted from this study but will be included in future research.

      This article is organized as follows: In Sec. II, we introduce the methodology used in this study. In Sec. III, we report the identification results of the trained network with the simulation and O1, O2, and O3 data. The conclusion is given in Sec. IV.

    II.   METHODOLOGY

      A.   Dataset assembly

    • In this study, we focused on the GW signals produced by BBH mergers. We generated six sets of data for training six models, each containing 170000 samples of pure background noise and 170000 samples of mixed GW signal and background noise, for different observation runs and detectors. For the simulated GW signals, we adopted the waveform model $ \rm{IMRPhenomXPHM} $ [129] generated by $ {\mathrm{PyCBC}} $ [130]. For the background noise, we selected segments from the publicly available O1, O2, and O3 data of LIGO Hanford and LIGO Livingston, wherein no GW events were detected.

      The synthetic data was described by

      $ s(t) = h(t)+n(t), $

      (1)

      where $ h(t) $ is the GW signal and $ n(t) $ is the background noise.

      The duration of the training dataset was 8 s (we set the merger-time range for the GW signal to $ [6.5, 7.5] $ s) and the sampling rate was 4096 Hz. All the data were whitened and passed through a band-pass filter with a frequency in the range of $ [30,900] $ Hz. Owing to the edge effect of the band-pass filter, the data in the edge 0.25 s were set to zero. Subsequently, we utilized the short-time Fourier transform and applied a Hanning window to transform the signals into the time-frequency domain for analysis. Specifically, the window length was set to 0.1 s and the overlap was half the window length. We preprocessed our data to fit the required network dimensions, setting both the time and frequency bins to 160. Subsequently, we applied maximum normalization to each image by dividing all pixel values by the highest image pixel value, ensuring the maximum pixel value in each image was normalized to 1. The simulated GW parameters are shown in Table 1. The spin magnitudes were uniformly distributed in the range of $ [0, 0.99] $.

      ParameterUniform distribution
      Component masses$m_1,m_2 \in[7,50]~ M_{\odot}$
      Coalescence phase$\Phi \in[0,2 \pi]~ {\rm rad}$
      Polarization$\Psi \in[0,2 \pi] ~ {\rm rad}$
      Inclination$\cos\iota \in[-1,1]$
      Declination$\sin\theta \in[-1,1]$
      Right ascension$\varphi \in[-\pi, \pi]~ {\rm rad}$
      Chirp distance${d}\mathrm{ _{c}^{2} }\in[130^{2},350^{2}]~ \mathrm{Mpc^{2} }$

      Table 1.  Distribution of simulated GW waveform parameters. Parameters not listed were set to zero for simplicity.

      Here we emphasize that the detections of BNS mergers are also important, particularly for observations of the follow-up electromagnetic counterparts, which could greatly advance the fields of multi-messenger astronomy and standard siren cosmology. In this study, we only considered BBH mergers for two reasons. First, to ensure the robustness of the algorithm, as many verification sets as possible were needed. Given that the CBCs currently detected by LIGO-Virgo-KAGRA are mainly BBH mergers, they serve as suitable candidates for testing and validating our methods. Second, owing to the extended duration of BNS merger detections compared to BBH mergers, the data dimension is relatively high. Without employing PCA for dimensionality reduction, the dataset requirements would experience exponential growth; consequently, BNS mergers should be treated with dedicated networks and separately from BBH mergers [117, 131].

    • B.   U-Net architecture

    • The typical use of convolutional networks is on classification tasks, where the output of the image is a single class label. However, in many visual tasks, assigning a single class label to an entire image is not sufficient. Instead, the desired output should include pixel-level localization, where a class label is assigned to each individual pixel. The U-Net algorithm can achieve this through semantic segmentation, which is an approach for identifying the class of an object for each pixel. This approach is particularly useful for tasks such as object detection, where the precise location and shape of the object of interest need to be identified.

      The U-Net network is a CNN originally developed for biomedical image segmentation [126]. Although based on a CNN, it has undergone significant structural modification. Unlike standard CNN architecture, U-Net includes many feature channels in the upsampling, 1 allowing the network to propagate contextual information to higher resolution layers through a series of transpose convolutions. 2 The main idea behind U-Net is to add successive layers to the traditional contracting network, replacing the pooling operation with an upsampling operation. This leads to an increase in output resolution, as these layers produce a U-shaped structure that is almost symmetric with the contracted part.

      The U-Net architecture is illustrated in Fig. 1. It consists of a contracted path (left side) and an extended path (right side). The contracting path follows the typical architecture of a convolutional network. It comprises the repeated application of $ 3\times3 $ convolutions (unpadded convolutions), each followed by a rectified linear unit (ReLU) and a $ 2\times2 $ max pooling 3 operation with stride 2 for downsampling. At each downsampling step, the number of feature channels is doubled. Every step in the expanding path consists of an upsampling of the feature map followed by a $ 2\times2 $ convolution that halves the number of feature channels, a concatenation with the corresponding cropped feature map from the contracting path, and two $ 3\times3 $ convolutions, each followed by a ReLU. The cropping is necessary due to the loss of border pixels in each convolution. In the final layer, a $ 1\times1 $ convolution is used to map each 64-component feature vector to the desired number of classes. In total, the network has 3 convolutional layers.

      Figure 1.  (color online) Training of a CNN with U-Net architecture. Here each color represents a structure in the U-Net network: yellow cubes represent the convolutional layers and ReLU sections, red cubes represent pooling layers in the down-sampling, blue cubes represent the transposed convolutional layers, and grey cubes represent connection layers. This visualization was made using the PlotNeuralNet library [132].

      The network flowchart is shown in Fig. 2. Whenever a time-frequency-domain image is sent to the network, it outputs a noise-reduced image. If the image includes a GW signal, the signal will be highlighted; otherwise, the image will appear almost blank, with pixel values close to zero, rather than exactly zero. For large test datasets, analysis is performed by extracting the maximum pixel value from each image and comparing it with a predetermined threshold. Images with maximum values exceeding this threshold are considered to contain a GW signal, whereas those below are deemed signal-free. We included sigmoid activation in the output layer of the network so that the output of the network is in the range [0, 1], and a threshold of 0.5 was set.

      Figure 2.  (color online) Flowchart of the network used in this study. The observed signal [133] (mixed GW signal and noise or noise only) is processed by the U-Net algorithm. If a GW signal is present, a GW signal image will be output. If there is no signal, an empty signal image will be output.

    • C.   Training

    • During the training process, the coefficients of neural network are determined. To assign initial random values to CNN parameters, we used the “Xavier” initialization, which is designed to keep the scale of gradients roughly the same in all layers. Then, we used the binary cross-entropy loss function to evaluate the deviation between the predicted and the actual values in the training data.

      The key component of a CNN is the convolutional layer, which applies a set of filters to the input. The network consists of a series of stacked layers. In the first convolutional layer, we set the number of convolution kernels to 32. The kernel size determines the convolutional field of view and is fixed at $ 3 \times 3 $. To maintain the output dimensionality, we employed the same padding method for both convolutions and transpose convolutions to handle sample boundaries. The stride determines the kernel traversal step size on the images. We used the default stride settings of 1 for convolutions and 2 for transpose convolutions.

      Our selected U-Net architecture was trained end-to-end for signal identification using the simulated data introduced in Sec. II.A. The details of the hyperparameters used in this work are listed in Table 2. The prior values are those we used during the training process that can achieve a high true alarm rate with a threshold of 0.5, and the optimum values are those we finally selected, with the best ROCs. The NAdam optimizer 4 was used in the analysis with the default TensorFlow parameters [134]. The hyperparameters were carefully fine-tuned to optimize the network. The batch size and number of initial convolution filters were optimized to 32, both of which were limited by the GPU memory. The learning rate 5 was set to $ 10^{-5} $ and the weight decay to$ 10^{-4} $. We set the dropout rate to 0.2 and used batch normalization 6 to maintain the mean and variance of the input data distribution of each layer in the U-Net within a certain range. The total number of trainable parameters was $ 3.1\times10^6 $. We applied a ReLU activation in every convolution. Finally, we adopted the 200 epoch calculation scheme to improve our results.

      HyperparameterDescriptionPrior valueOptimum value
      ηlearning rate for optimizer$10^{-3}, 10^{-4}, 10^{-5},10^{-6}$$10^{-5}$
      ωweight decay for optimizer$10^{-4}, 10^{-5}, 10^{-6}, 10^{-7}$$10^{-5}$
      $n_{\text {filter }}$initial number of convolution filters$16,32$32
      bbatch size32, 6432
      Ωoptimizer for trainingAdam, NAdamNAdam

      Table 2.  Description of the hyperparameters in the U-Net architecture design.

      At the end of every epoch, the network training performance was evaluated using the average accuracy of the validation set. The training process was completed within 6 h on four NVIDIA GeForce RTX A6000 GPUs, each with 48 GB of memory.

      The identification time for 173 GW signals 7 was 0.1868 s, meaning that the identification time for each GW signal was approximately 1 ms. Thus, compared to matched filtering, U-Net can save computation resources.

    • D.   Evaluation

    • Consistent with the method used in Ref. [120], the effective maximum luminosity distance was selected based on the chirp mass; the smaller the chirp mass, the smaller the maximum luminosity distance at which injections were placed. The chirp distance is given by [135]

      $ d_c = d \left( \frac{{\cal{M}}_{c,0}}{{\cal{M}}_c} \right)^{5/6}, $

      (2)

      where d is the luminosity distance, $ {\cal{M}}_c = \left( m_1 m_2 \right)^{3/5} / (m_1 + m_2)^{1/5} $ is the chirp mass, and $ {\cal{M}}_{c,0} = 1.4/2^{1/5} \, M_\odot $ is a fiducial chirp mass used as a basis for calculation.

      When considering a fixed chirp mass, the injections are uniformly distributed within the sphere of the adjusted maximum luminosity distance. The formula we used to estimate the sensitive volume can be written as

      $ V({\cal{F}}) \approx \frac{V(d_{\text{max}})}{N_I} \sum\limits_{i = 1}^{N_{I,{\cal{F}}}} \left( \frac{M_{c,i}}{M_{c,\text{max}}} \right)^{5/2}, $

      (3)

      where $ d_{\text{max}} $ is the maximum luminosity distance, $ V(d_{\text{max}}) $ is the volume of a sphere with radius $ d_{\text{max}} $, $ N_{I,{\cal{F}}} $ is the number of found injections at a FAR of $ {\cal{F}} $, $ N_I $ is the total number of injections performed, $ {\cal{M}}_{c,i} $ is the chirp mass of the ith found injection, and $ {\cal{M}}_{c,\text{max}} $ is the upper limit on the injected chirp masses.

      Before proceeding to the results and discussion, we should note that we trained 6 models based on different noise sources. When analyzing a specific observation run of a specific detector, we used the corresponding model for prediction. We combined the network outputs by adding the outputs of LIGO Hanford and LIGO Livingston at the same moment. Because the output range of a single network is [0, 1], the combined output range is [0, 2]. The advantage of combining the outputs of two detectors in this way is that, when facing glitches, even if the output of one detector indicates a signal, that of the other is unlikely to indicate a signal at the same time. This increases the robustness of the system to glitches, because they typically do not occur at the same time in both detectors and have similar characteristics.

    III.   RESULTS AND DISCUSSION

      A.   Simulation results

    • For the first and second tests, we used the network trained on the O3 data from LIGO-Hanford. We first examined the GW-detection ability of the network for a single mass m and SNR. In the training dataset, we considered equal-mass BBH mergers; the BH mass was sampled in the range of $ [5, 80]\; M_{\odot} $, and SNR in the range of $ [4, 20] $. After 100 epochs of training, the accuracy of the test dataset was 90%. To illustrate the capability of the network, Fig. 3 shows the results with m and SNR in the ranges of $ [5, 80]\; M_{\odot} $ and $ [4, 20] $, respectively. The network performed well in the high-mass and high-SNR regions, but not in the low-mass and low-SNR regions. Therefore, for the lower-SNR region ($ 4<\text{SNR}<5 $), the network also performed poorly. The primary cause for this is that the network is more effective at handling short signals. As the chirp mass decreases, the GW signals last longer and become buried within the characteristics of the random noise. In other words, because the SNR is integrated throughout the GW signals, at the same SNR, short signals have higher strain values than those of long signals. Therefore, it is easier to identify the short signals. For the high-mass and low-SNR regions, the network performs better. The network can achieve an accuracy of 90% when the single mass is larger than $ 9\; M_{\odot} $ and SNR is greater than 8.

      Figure 3.  (color online) Network robustness on m with respect to the SNR. Note that only equal-mass BBH mergers were considered. The dashed red line is the 50% contour line, the dot-dashed black line is the 70% contour line, and the solid blue line is the 90% contour line.

      Second, we tested the robustness of networks where the GW signals were added to noise at different times. We selected several fixed signal added times, $ [6.75,\; 6.9,\; 7,\; 7.1,\; 7.25] $ s, to test whether the network can identify signals with different added times. We generated nearly 20,000 samples. The true alarm probabilities are listed in Table 3. We used the maximum pixel value in the image predicted by the network to compare with a threshold set to 0.5. We also generated nearly 20,000 pure-noise samples with a false alarm probability of 0.0026.

      Time/sTrue alarm probability
      6.750.9722
      6.900.9820
      7.000.9942
      7.100.9896
      7.250.9834

      Table 3.  Testing the robustness of networks to noise with GW signals at different times.

      For the final test, we used two models trained separately on the O3 data from LIGO Hanford and LIGO Livingston. We used O3a noise for evaluation and downloaded six days of data starting from June 6, 2019 (GPS start time 1243877376 and GPS end time 1244340224). All noise was given the “data” quality flag, and none of the ''CBC_CAT1,'' ''CBC_CAT2,'' or “CBC_HW_INJ” flags were active. We considered only segments where the data from both LIGO Hanford and LIGO Livingston met these conditions, with a usable data duration of 3.25 d. We shifted the data from LIGO Livingston by 100 s ten times, while keeping the data from LIGO Hanford fixed. This allowed us to generate 32.5 d of coincident real noise for both detectors.

      We sampled the injections from the distributions defined in Table 1 using the chirp distance. The spin magnitudes were uniformly distributed in the range of $ [0, 0.99] $. The merger time of two subsequent signals was set at 20 s to avoid any overlap.

      In Fig. 4, we show the variation in the sensitive distances of different machine learning methods with the number of sources that can be detected at a particular FAR. Consistent with Ref. [120], the FAR is plotted at 1000 per month. Compared to the Virgo-AUTh [118] and the MFCNN [111], U-Net can reach a higher sensitive distance at a FAR of 1000 per month. However, as the FAR decreases, the sensitive distance of U-Net drops rapidly. Compared to the CNN-Coinc [105, 136], U-Net performs better at FARs of 1000, 100, and 1 per month. Within the mass range of currently detected BBH events, machine learning methods often achieve comparable or superior sensitivity distances to matched filtering at FARs significantly higher than 1 per year [137]. In conclusion, U-Net is not significantly superior to other networks but is strongly competitive.

      Figure 4.  (color online) Sensitive distances of U-Net and other machine learning methods [120] as functions of the FAR. U-Net is shown with a solid line, other methods with dashed lines. The FAR was calculated on a background set that does not contain any injections.

    • B.   Robustness test of U-Net

    • In the process of identifying GW signals, as typical non-stationary noise in GW detectors, glitches often have a large impact on the matched-filtering method. Glitches may also have an impact on the false alarm rate of U-Net. Hence, in this subsection, we report a test on the impact of glitches on the false alarm rate of a network that has not been trained on glitches.

      We chose blip glitches to represent the glitches. Blip glitches are very short-duration transients ($ {\cal{O}}(10) $ ms) and have a wide frequency bandwidth ($ {\cal{O}}(100) $ Hz) [138]. We chose blip glitches for two reasons, (i) they are common in the GW data, and (ii) in the time-frequency domain they appear similar to high-mass BBH mergers, and thus can be mistaken for a real GW signal [138, 139].

      We used the network trained on the O3 data from LIGO-Hanford to identify the blip glitches. We selected blip-like glitches from the LIGO data, then placed the glitches at different times and used the trained network to identify them. As mentioned above, if the network does not identify the signal, an empty signal image will be output. We randomly tested 200 glitches, with ten identified as signals. Thus, the false alarm rate was approximately 5% for blip-like glitches. In Fig. 5, we show three typical blip-like glitches at different times and the predictions of U-Net. The blip glitches are similar to the waveform of a GW at the merge-time, and both resemble the shape of a chirp signal. This might be the reason behind the high false alarm rate.

      Figure 5.  (color online) Blip-like glitches at different times (left) and the prediction results of U-Net (right).

    • C.   Application to real observations

    • For real observations, when analyzing a specific observation run from a specific detector, we used the corresponding model for prediction. In this subsection, we report the identification results. We tested the real GW events by placing the signal at approximately 7 s to make it easier to confirm the results. For each GW event, we identified the signals from LIGO Hanford and LIGO Livingston. Owing to the low SNR of some GW signals, there will be missed detections during identification. Hence, we utilized the union of LIGO Hanford and LIGO Livingston, i.e., we considered the signal identified as long as it was detected by either of the two detectors. Note that in the present study, we only considered the identification of GW signals from BBH mergers; GW events involving NSs were not considered (summarized in Table 4).

      NameEvent type
      GW170817BNS
      GW190425BNS
      GW190814NSBH
      GW190917_114630NSBH
      GW191219_163120NSBH
      GW200115_042309NSBH
      GW200210_092254NSBH

      Table 4.  GW events not considered in this study.

      In Figs. 6 and 7, we show the time-frequency representation of the GW signals of O1 and O2 identified by the trained network. All the GW events were identified by the network. Further, we tested the network by attempting to identify BNS mergers and found that the network is unable to identify them, which is consistent with our expectations. From the figure, we can also see that the identified signals obtained from the network are different for different GW events. This is primarily because the time-frequency signals of GWs for different chirp masses are different. This also means that the time-frequency representation of GW signals identified by U-Net may be used to preliminarily determine the chirp masses of the GW sources. Previous research shows that prior selection of the GW parameter has an important impact on the Bayesian inference [140]. Our results show that the network can not only rapidly and accurately identify the GW signals but also aid with the later Bayesian inferences.

      Figure 6.  (color online) Time-frequency representations of the GW signals of O1 and a portion of O2, identified by the trained network (the remaining O2 signals are presented in Fig. 7). In the event names, H and L represent the signals from LIGO Hanford and LIGO Livingston, respectively.

      Figure 7.  (color online) Same as Fig. 6, but for the remaining time-frequency representations of the GW events from GW170809 to GW170823.

      Subsequently, we applied the trained network to identification in the O3 data. In Figs. 811, we show the time-frequency representations of the GW signals of O3 identified by the trained network. The trained network shows a strong ability to identify GW signals. Fifty-nine GW signals were clearly identified by the network. Note that for O3 we considered a total of 73 GW signals for analysis and only show the identified results. Approximately 80% (80.8%) of GW signals in O3 could be rapidly and accurately identified.

      Figure 8.  (color online) Time-frequency representations of a portion of the GW signals of O3 identified by the trained network (the remaining O3 signals are presented in Figs. 911). The colorbar settings are consistent with Fig. 6. For clarity, only the GW signals from 6 s to 8 s are shown. For each GW signal, only the better-identified signal from LIGO Livingston or LIGO Hanford is shown for simplicity.

      Figure 9.  (color online) Same as Fig. 8, but for the remaining time-frequency representations of the GW events, from GW190620_030421 to GW190915_235702.

      Figure 10.  (color online) Same as Fig. 8, but for the remaining time-frequency representations of the GW events, from GW190916_200658 to GW200128_022011.

      Figure 11.  (color online) Same as Fig. 8, but for the remaining time-frequency representations of the GW events, from GW200129_065458 to GW200322_091133.

    IV.   CONCLUSION
    • In this study, we use the U-Net algorithm to identify the time-frequency GW signals of LIGO O1, O2, and O3 observations. We trained the network using a dataset of pure background noise signals and a combined dataset of GW signals and background noise. In the test dataset, the false alarm probabilities for O1, O2, and O3 were all better than 0.1%. The trained network was then used to identify signals in real observations. The time-frequency representations of the GW signals in O1 and O2 were all identified. For O3, the trained network could identify approximately 80% of GW events. Our results show that the U-Net algorithm can rapidly identify the time-frequency representations of GW signals from BBH mergers.

    ACKNOWLEDGMENTS
    • This research used data or software obtained from the Gravitational Wave Open Science Center (gwosc.org), a service of LIGO Laboratory, the LIGO Scientific Collaboration, the Virgo Collaboration, and KAGRA. We thank He Wang for helpful discussions.

Reference (140)

目录

/

DownLoad:  Full-Size Img  PowerPoint
Return
Return