A1 Refereed original research article in a scientific journal
Euclid preparation LXXV. Estimating galaxy physical properties using CatBoost chained regressors with attention
Authors: Humphrey, A.; Cunha, P. A. C.; Bisigello, L.; Tortora, C.; Bolzonella, M.; Pozzetti, L.; Baes, M.; Granett, B. R.; Amara, A.; Andreon, S.; Auricchio, N.; Baccigalupi, C.; Baldi, M.; Bardelli, S.; Bodendorf, C.; Bonino, D.; Branchini, E.; Brescia, M.; Brinchmann, J.; Camera, S.; Capobianco, V.; Carbone, C.; Carretero, J.; Casas, S.; Castellano, M.; Castignani, G.; Cavuoti, S.; Cimatti, A.; Colodro-Conde, C.; Congedo, G.; Conselice, C. J.; Conversi, L.; Copin, Y.; Courbin, F.; Courtois, H. M.; Da Silva, A.; Degaudenzi, H.; De Lucia, G.; Dinis, J.; Dubath, F.; Dupac, X.; Dusini, S.; Farina, M.; Farrens, S.; Ferriol, S.; Frailis, M.; Franceschi, E.; Galeotta, S.; George, K.; Gillis, B.; Giocoli, C.; Grazian, A.; Grupp, F.; Guzzo, L.; Haugan, S. V. H.; Holmes, W.; Hook, I.; Hormuth, F.; Hornstrup, A.; Jahnke, K.; Joachimi, B.; Keihänen, E.; Kermiche, S.; Kiessling, A.; Kilbinger, M.; Kubik, B.; Kümmel, M.; Kunz, M.; Kurki-Suonio, H.; Ligori, S.; Lilje, P. B.; Lindholm, V.; Lloro, I.; Mainetti, G.; Maino, D.; Maiorano, E.; Mansutti, O.; Marggraf, O.; Markovic, K.; Martinelli, M.; Martinet, N.; Marulli, F.; Massey, R.; McCracken, H. J.; Medinaceli, E.; Mei, S.; Melchior, M.; Mellier, Y.; Meneghetti, M.; Merlin, E.; Meylan, G.; Moresco, M.; Moscardini, L.; Munari, E.; Nakajima, R.; Niemi, S.-M.; Nightingale, J. W.; Padilla, C.; Paltani, S.; Pasian, F.; Pedersen, K.; Pettorino, V.; Pires, S.; Polenta, G.; Poncet, M.; Popa, L. A.; Raison, F.; Rebolo, R.; Renzi, A.; Rhodes, J.; Riccio, G.; Romelli, E.; Roncarelli, M.; Rossetti, E.; Saglia, R.; Sakr, Z.; Sánchez, A. G.; Sapone, D.; Scaramella, R.; Schneider, P.; Schrabback, T.; Scodeggio, M.; Secroun, A.; Sefusatti, E.; Seidel, G.; Serrano, S.; Sirignano, C.; Stanco, L.; Steinwagner, J.; Tallada-Crespí, P.; Taylor, A. N.; Tereno, I.; Toledo-Moreo, R.; Torradeflot, F.; Tutusaus, I.; Valenziano, L.; Vassallo, T.; Veropalumbo, A.; Wang, Y.; Weller, J.; Zamorani, G.; Zoubian, J.; Zucca, E.; Biviano, A.; Boucaud, A.; Bozzo, E.; Burigana, C.; Calabrese, M.; Farinelli, R.; Mauri, N.; Scottez, V.; Tenti, M.; Viel, M.; Wiesmann, M.; Akrami, Y.; Allevato, V.; Anselmi, S.; Ballardini, M.; Blanchard, A.; Borgani, S.; Bruton, S.; Cabanac, R.; Calabro, A.; Cañas-Herrera, G.; Cappi, A.; Carvalho, C. S.; Castro, T.; Chambers, K. C.; Contarini, S.; Cooray, A. R.; Coupon, J.; Cucciati, O.; Desprez, G.; Díaz-Sánchez, A.; Di Domizio, S.; Escartin Vigo, J. A.; Escoffier, S.; Ferrari, A. G.; Ferreira, P. G.; Ferrero, I.; Fornari, F.; Gabarra, L.; Ganga, K.; García-Bellido, J.; Gaztanaga, E.; Giacomini, F.; Gozaliasl, G.; Gregorio, A.; Hall, A.; Hildebrandt, H.; Hjorth, J.; Kajava, J. J. E.; Kansal, V.; Karagiannis, D.; Kirkpatrick, C. C.; Legrand, L.; Libet G.; Loureiro, A.; Maggio, G.; Magliocchetti, M.; Mannucci, F.; Maoli, R.; Martins, C. J. A. P.; Matthew, S.; Maurin, L.; Metcalf, R. B.; Monaco, P.; Moretti, C.; Morgante, G.; Walton, Nicholas A.; Odier, J.; Patrizii, L.; Pöntinen, M.; Popa, V.; Porciani, C.; Potter, D.; Risso, I.; Rocci, P.-F.; Sahlén, M.; Schneider, A.; Sereno, M.; Simon, P.; Spurio Mancini, A.; Tao, C.; Testera, G.; Teyssier, R.; Toft, S.; Tosi, S.; Troja, A.; Tucci, M.; Valieri, C.; Valiviita, J.; Vergani, D.; Verza, G.; Euclid Collaboration
Publisher: EDP Sciences
Publication year: 2025
Journal: Astronomy and Astrophysics
Article number: A74
Volume: 702
ISSN: 0004-6361
eISSN: 1432-0746
DOI: https://doi.org/10.1051/0004-6361/202452468
Publication's open availability at the time of reporting: Open Access
Publication channel's open availability : Open Access publication channel
Web address : https://doi.org/10.1051/0004-6361/202452468
Self-archived copy’s web address: https://research.utu.fi/converis/portal/detail/Publication/505588811
The Euclid Space Telescope will image about 14 000 deg2 of the extragalactic sky at visible and near-infrared wavelengths, providing a dataset of unprecedented size and richness that will facilitate a multitude of studies into the evolution of galaxies. Although spectroscopy will also be available for some of the galaxies, in the vast majority of cases the main source of information will come from broadband images and data products thereof (i.e. photometry). Therefore, there is a pressing need to identify or develop scalable yet reliable methodologies to estimate the redshift and physical properties of galaxies using broadband photometry from Euclid. Optionally, such methods could also include ground-based optical photometry. To address this need, we present a novel method developed as part of a ‘data challenge’ within the Euclid Collaboration to estimate the redshift, stellar mass, star-formation rate, specific star-formation rate, E(B − V), and age of galaxies using mock Euclid and ground-based photometry. The main novelty of our property-estimation pipeline is its use of the CatBoost implementation of gradient-boosted regression-trees together with chained regression and an intelligent, automatic optimisation of the training data. The pipeline also includes a computationally efficient method to estimate prediction uncertainties, and, in the absence of ground-truth labels, it provides accurate predictions for metrics of model performance up to z ~ 2. We applied our pipeline to several datasets consisting of mock Euclid broadband photometry and mock ground-based ugriz photometry, with the objective of evaluating the performance of our methodology for estimating the redshift and physical properties of galaxies detected in the Euclid Wide Survey. The statistical metrics of prediction residuals vary depending on which mock catalogue and filters are tested. Nonetheless, the quality of our photometric redshift and physical property estimates are highly competitive overall, validating our modelling approach. However, at z ≳ 3.5, the relative sparsity of galaxies resulted in unreliable redshift and physical property estimates, which we argue could be mitigated by building catalogues with better sampling of z ≳ 3.5 galaxies or by switching to the use of spectral energy distribution fitting in this regime. We also find that the inclusion of ground-based optical photometry significantly improves the quality of the property estimation, highlighting the importance of combining Euclid data with ancillary ground-based data from such surveys as the Vera C. Rubin Observatory Legacy Survey of Space and Time and UNIONS.
Downloadable publication This is an electronic reprint of the original article. |
Funding information in the publication:
This work was supported by Fundação para a Ciência e a Tecnologia (FCT) through grants UID/FIS/04434/2019, UIDB/04434/2020, UIDP/04434/2020, and PTDC/FIS-AST/29245/2017, and an FCT-CAPES Transnational Coöperation Project. AH acknowledges support from the NVIDIA Academic Hardware Grant Program. PACC acknowledges financial support from the FCT through grant 2022.11477.